Friday, March 23, 2012

I got this issue with a Redhat5/HP Proliant,

root]# vgs
File descriptor 4 (/dev/pts/1) leaked on vgs invocation. Parent PID 28768: sh
/dev/sdb: read failed after 0 of 4096 at 0: Input/output error
Found duplicate PV QSsOVbn2hAYPzWhviPc3nK0sT89rsB1n: using /dev/sdr not /dev/sdq
/dev/sdd: read failed after 0 of 4096 at 0: Input/output error
/dev/sde: read failed after 0 of 4096 at 0: Input/output error
/dev/sdg: read failed after 0 of 4096 at 0: Input/output error
/dev/sdj: read failed after 0 of 4096 at 0: Input/output error
/dev/sdl: read failed after 0 of 4096 at 0: Input/output error
/dev/sdm: read failed after 0 of 4096 at 0: Input/output error
/dev/sdo: read failed after 0 of 4096 at 0: Input/output error
VG #PV #LV #SN Attr VSize VFree
VG01 1 4 0 wz--n- 650.05G 180.05G
VolGroup00 1 11 0 wz--n- 136.56G 42.19G
VolGroup02 3 4 0 wz--n- 399.91G 0

root]#

I had to edit /etc/lvm/lvm.conf and define a filter.


# filter = [ "a/.*/" ]
filter = [ "a/dev/cciss/.*/", "a/dev/mpath/.*/", "r/.*/" ]

After finish with editing, you need to run pvscan and vgscan to make the change,
active.

Please consult with official manual to find a proper filter for your own environment.

https://access.redhat.com/knowledge/solutions/2989

Friday, March 2, 2012

Stale Lun!!!

[root /]# multipathd -k"show paths"|egrep -v "active\]\[ready|cciss|next_check"
1:0:1:0 sdba 67:64 1 [failed][faulty] XXXX...... 2/5
1:0:1:3 sdbf 67:144 1 [active][faulty] XXXXXXXX.. 4/5

I removed two luns in our SAN and then manually deleted the luns. Now I stuck with
this situation. Let's see if a reboot in the weekend will fix the issue!

[read my previous post!]

Update: a reboot fixed the issue.