April 26, 200917 yr Hello, i build my unraid server three months ago. This was my first linux system, so i am not an expert .... Until now everything works fine, but today i check the free space and found that the web interface display one error for my disk 1. I decided to start a parity check and now after some hours i get 8 errors for the same device. In the system.log i found this Apr 24 23:30:29 nas emhttp: shcmd (372): /usr/sbin/hdparm -y /dev/sdb >/dev/null Apr 24 23:30:30 nas emhttp: shcmd (373): /usr/sbin/hdparm -y /dev/sdc >/dev/null Apr 25 09:20:09 nas kernel: ata2.00: exception Emask 0x0 SAct 0x3f SErr 0x0 action 0x0 Apr 25 09:20:09 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:09 nas kernel: ata2.00: cmd 60/08:00:9f:21:a5/00:00:22:00:00/40 tag 0 ncq 4096 in Apr 25 09:20:09 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:09 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:09 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:09 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:09 nas kernel: ata2: EH complete Apr 25 09:20:09 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:09 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:09 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:09 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 09:20:12 nas kernel: ata2.00: exception Emask 0x0 SAct 0xff SErr 0x0 action 0x0 Apr 25 09:20:12 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:12 nas kernel: ata2.00: cmd 60/08:28:9f:21:a5/00:00:22:00:00/40 tag 5 ncq 4096 in Apr 25 09:20:12 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:12 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:12 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:12 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:12 nas kernel: ata2: EH complete Apr 25 09:20:12 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:12 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:12 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:12 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 09:20:15 nas kernel: ata2.00: exception Emask 0x0 SAct 0x3ff SErr 0x0 action 0x0 Apr 25 09:20:15 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:15 nas kernel: ata2.00: cmd 60/08:10:9f:21:a5/00:00:22:00:00/40 tag 2 ncq 4096 in Apr 25 09:20:15 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:15 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:15 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:15 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:15 nas kernel: ata2: EH complete Apr 25 09:20:15 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:15 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:15 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:15 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 09:20:18 nas kernel: ata2.00: exception Emask 0x0 SAct 0x8f SErr 0x0 action 0x0 Apr 25 09:20:18 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:18 nas kernel: ata2.00: cmd 60/08:38:9f:21:a5/00:00:22:00:00/40 tag 7 ncq 4096 in Apr 25 09:20:18 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:18 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:18 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:18 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:18 nas kernel: ata2: EH complete Apr 25 09:20:18 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:18 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:18 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:18 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 09:20:22 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7ff SErr 0x0 action 0x0 Apr 25 09:20:22 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:22 nas kernel: ata2.00: cmd 60/08:00:9f:21:a5/00:00:22:00:00/40 tag 0 ncq 4096 in Apr 25 09:20:22 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:22 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:22 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:22 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:22 nas kernel: ata2: EH complete Apr 25 09:20:22 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:22 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:22 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:22 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 09:20:25 nas kernel: ata2.00: exception Emask 0x0 SAct 0x400 SErr 0x0 action 0x0 Apr 25 09:20:25 nas kernel: ata2.00: irq_stat 0x40000008 Apr 25 09:20:25 nas kernel: ata2.00: cmd 60/08:50:9f:21:a5/00:00:22:00:00/40 tag 10 ncq 4096 in Apr 25 09:20:25 nas kernel: res 41/40:00:9f:21:a5/83:00:22:00:00/40 Emask 0x409 (media error) Apr 25 09:20:25 nas kernel: ata2.00: status: { DRDY ERR } Apr 25 09:20:25 nas kernel: ata2.00: error: { UNC } Apr 25 09:20:25 nas kernel: ata2.00: configured for UDMA/133 Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x08 Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] Sense Key : 0x3 [current] [descriptor] Apr 25 09:20:25 nas kernel: Descriptor sense data with sense descriptors (in hex): Apr 25 09:20:25 nas kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 Apr 25 09:20:25 nas kernel: 22 a5 21 9f Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] ASC=0x11 ASCQ=0x4 Apr 25 09:20:25 nas kernel: end_request: I/O error, dev sdc, sector 581247391 Apr 25 09:20:25 nas kernel: ata2: EH complete Apr 25 09:20:25 nas kernel: md: disk1 read error Apr 25 09:20:25 nas kernel: handle_stripe read error: 581247328/1, count: 1 Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 25 09:20:25 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 25 10:34:36 nas emhttp: shcmd (374): /usr/sbin/hdparm -y /dev/sdd >/dev/null Apr 25 10:34:37 nas emhttp: shcmd (375): /usr/sbin/hdparm -y /dev/sde >/dev/null Apr 25 10:34:38 nas emhttp: shcmd (376): /usr/sbin/hdparm -y /dev/sdf >/dev/null Apr 25 10:34:38 nas emhttp: shcmd (377): /usr/sbin/hdparm -y /dev/sdg >/dev/null Apr 25 11:25:40 nas emhttp: shcmd (378): /usr/sbin/hdparm -y /dev/sdb >/dev/null i think this was the first error i saw this morning.
April 26, 200917 yr Author Part 2 ... For the parity check i get this Apr 26 11:25:59 nas emhttp: shcmd (392): /usr/sbin/hdparm -y /dev/sdb >/dev/null Apr 26 11:26:00 nas emhttp: shcmd (393): /usr/sbin/hdparm -y /dev/sdd >/dev/null Apr 26 11:26:01 nas emhttp: shcmd (394): /usr/sbin/hdparm -y /dev/sde >/dev/null Apr 26 12:58:44 nas kernel: mdcmd (6078): clear Apr 26 12:58:51 nas kernel: mdcmd (6080): check Apr 26 12:58:51 nas kernel: md: recovery thread woken up ... Apr 26 12:58:51 nas kernel: md: recovery thread checking parity... Apr 26 12:58:51 nas kernel: md: using 1152k window, over a total of 976762552 blocks. Apr 26 13:47:11 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:11 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:11 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:11 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:11 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:11 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:11 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:11 nas kernel: ata2: EH complete Apr 26 13:47:11 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:11 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:11 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:11 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:14 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:14 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:14 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:14 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:14 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:14 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:14 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:14 nas kernel: ata2: EH complete Apr 26 13:47:14 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:14 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:14 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:14 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:18 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:18 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:18 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:18 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:18 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:18 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:18 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:18 nas kernel: ata2: EH complete Apr 26 13:47:18 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:18 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:18 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:18 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:22 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:22 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:22 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:22 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:22 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:22 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:22 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:22 nas kernel: ata2: EH complete Apr 26 13:47:22 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:22 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:22 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:22 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:25 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:25 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:25 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:25 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:25 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:25 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:25 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:25 nas kernel: ata2: EH complete Apr 26 13:47:25 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:25 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:25 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:25 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:29 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:29 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:29 nas kernel: ata2.00: cmd 60/f0:00:47:c0:ed/01:00:14:00:00/40 tag 0 ncq 253952 in Apr 26 13:47:29 nas kernel: res 41/40:00:87:c0:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:29 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:29 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:29 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:29 nas kernel: ata2: EH complete Apr 26 13:47:29 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:29 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:29 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:29 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:33 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:33 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:33 nas kernel: ata2.00: cmd 60/f0:10:47:c0:ed/01:00:14:00:00/40 tag 2 ncq 253952 in Apr 26 13:47:33 nas kernel: res 41/40:00:87:c0:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:33 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:33 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:33 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:33 nas kernel: ata2: EH complete Apr 26 13:47:33 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:33 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:33 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:33 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:37 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:37 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:37 nas kernel: ata2.00: cmd 60/f0:00:47:c0:ed/01:00:14:00:00/40 tag 0 ncq 253952 in Apr 26 13:47:37 nas kernel: res 41/40:00:87:c0:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:37 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:37 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:37 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:37 nas kernel: ata2: EH complete Apr 26 13:47:37 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:37 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:37 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:37 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:42 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x0 Apr 26 13:47:42 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:42 nas kernel: ata2.00: cmd 60/10:08:77:b1:ed/03:00:14:00:00/40 tag 1 ncq 401408 in Apr 26 13:47:42 nas kernel: res 41/40:00:4a:b4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:42 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:42 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:42 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x08 Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] Sense Key : 0x3 [current] [descriptor] Apr 26 13:47:42 nas kernel: Descriptor sense data with sense descriptors (in hex): Apr 26 13:47:42 nas kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 Apr 26 13:47:42 nas kernel: 14 ed b4 4a Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] ASC=0x11 ASCQ=0x4 Apr 26 13:47:42 nas kernel: end_request: I/O error, dev sdc, sector 351122506 Apr 26 13:47:42 nas kernel: ata2: EH complete Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:42 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122440/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122448/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122456/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122464/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122472/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122480/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122488/1, count: 1 Apr 26 13:47:42 nas kernel: md: disk1 read error Apr 26 13:47:42 nas kernel: handle_stripe read error: 351122496/1, count: 1 Apr 26 13:47:46 nas kernel: ata2.00: exception Emask 0x0 SAct 0xf SErr 0x0 action 0x0 Apr 26 13:47:46 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:46 nas kernel: ata2.00: cmd 60/f0:08:37:d4:ed/01:00:14:00:00/40 tag 1 ncq 253952 in Apr 26 13:47:46 nas kernel: res 41/40:00:5c:d4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:46 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:46 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:46 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:46 nas kernel: ata2: EH complete Apr 26 13:47:46 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:46 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:46 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:46 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:53 nas kernel: ata2.00: exception Emask 0x0 SAct 0x3f SErr 0x0 action 0x0 Apr 26 13:47:53 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:53 nas kernel: ata2.00: cmd 60/b8:08:27:f2:ed/03:00:14:00:00/40 tag 1 ncq 487424 in Apr 26 13:47:53 nas kernel: res 41/40:00:47:f4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:53 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:53 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:53 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:53 nas kernel: ata2: EH complete Apr 26 13:47:53 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:53 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:53 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:53 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:47:56 nas kernel: ata2.00: exception Emask 0x0 SAct 0x3f SErr 0x0 action 0x0 Apr 26 13:47:56 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:47:56 nas kernel: ata2.00: cmd 60/b8:20:27:f2:ed/03:00:14:00:00/40 tag 4 ncq 487424 in Apr 26 13:47:56 nas kernel: res 41/40:00:47:f4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:47:56 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:47:56 nas kernel: ata2.00: error: { UNC } Apr 26 13:47:56 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:47:56 nas kernel: ata2: EH complete Apr 26 13:47:56 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:47:56 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:47:56 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:47:56 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:48:02 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7f SErr 0x0 action 0x0 Apr 26 13:48:02 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:48:02 nas kernel: ata2.00: cmd 60/b8:08:27:f2:ed/03:00:14:00:00/40 tag 1 ncq 487424 in Apr 26 13:48:02 nas kernel: res 41/40:00:47:f4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:48:02 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:48:02 nas kernel: ata2.00: error: { UNC } Apr 26 13:48:02 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:48:02 nas kernel: ata2: EH complete Apr 26 13:48:02 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:48:02 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:48:02 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:48:02 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:48:05 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7f SErr 0x0 action 0x0 Apr 26 13:48:05 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:48:05 nas kernel: ata2.00: cmd 60/b8:28:27:f2:ed/03:00:14:00:00/40 tag 5 ncq 487424 in Apr 26 13:48:05 nas kernel: res 41/40:00:47:f4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:48:05 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:48:05 nas kernel: ata2.00: error: { UNC } Apr 26 13:48:05 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:48:05 nas kernel: ata2: EH complete Apr 26 13:48:05 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:48:05 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:48:05 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:48:05 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 13:48:08 nas kernel: ata2.00: exception Emask 0x0 SAct 0x7f SErr 0x0 action 0x0 Apr 26 13:48:08 nas kernel: ata2.00: irq_stat 0x40000008 Apr 26 13:48:08 nas kernel: ata2.00: cmd 60/b8:08:27:f2:ed/03:00:14:00:00/40 tag 1 ncq 487424 in Apr 26 13:48:08 nas kernel: res 41/40:00:47:f4:ed/83:00:14:00:00/40 Emask 0x409 (media error) Apr 26 13:48:08 nas kernel: ata2.00: status: { DRDY ERR } Apr 26 13:48:08 nas kernel: ata2.00: error: { UNC } Apr 26 13:48:08 nas kernel: ata2.00: configured for UDMA/133 Apr 26 13:48:08 nas kernel: ata2: EH complete Apr 26 13:48:08 nas kernel: sd 2:0:0:0: [sdc] 1953525168 512-byte hardware sectors (1000205 MB) Apr 26 13:48:08 nas kernel: sd 2:0:0:0: [sdc] Write Protect is off Apr 26 13:48:08 nas kernel: sd 2:0:0:0: [sdc] Mode Sense: 00 3a 00 00 Apr 26 13:48:08 nas kernel: sd 2:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Apr 26 16:29:04 nas emhttp: shcmd (395): /usr/sbin/hdparm -y /dev/sdf >/dev/null What is the best way to fix this ? greetings from germany
April 26, 200917 yr Author some more infos..... From my point SMART seems to be ok or what do you think ? Statistics for /dev/sdc WDC_WD10EADS-00L5B1_WD-WCAU46045241 smartctl version 5.38 [i486-slackware-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: WDC WD10EADS-00L5B1 Serial Number: WD-WCAU46045241 Firmware Version: 01.01A01 User Capacity: 1,000,204,886,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sun Apr 26 19:22:08 2009 GMT-1 SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x05) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (24000) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 160 158 021 Pre-fail Always - 6966 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 335 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 098 098 000 Old_age Always - 1802 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 41 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 335 194 Temperature_Celsius 0x0022 126 107 000 Old_age Always - 24 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 3 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
April 26, 200917 yr Author and the last for today... I make a file system check with this result: Checking /dev/md1 (/dev/sdc) reiserfsck 3.6.19 (2003 www.namesys.com) ************************************************************* ** If you are using the latest reiserfsprogs and it fails ** ** please email bug reports to [email protected], ** ** providing as much information as possible -- your ** ** hardware, kernel, patches, settings, all reiserfsck ** ** messages (including version), the reiserfsck logfile, ** ** check the syslog file for any related information. ** ** If you would like advice on using this program, support ** ** is available for $25 at www.namesys.com/support.html. ** ************************************************************* Will read-only check consistency of the filesystem on /dev/md1 Will put log info to 'stdout' ########### reiserfsck --check started at Sun Apr 26 19:23:38 2009 ########### Replaying journal.. Reiserfs journal '/dev/md1' in blocks [18..8211]: 0 transactions replayed Checking internal tree..finished Comparing bitmaps..finished Checking Semantic tree: finished No corruptions found There are on the filesystem: Leaves 245609 Internal nodes 1503 Directories 1536 Other files 133714 Data block pointers 236713436 (0 of them are zero) Safe links 0 ########### reiserfsck finished at Sun Apr 26 20:00:25 2009 ########### /dev/md1 mounted on /mnt/disk1 Samba Started
April 26, 200917 yr It looks like you have possibly 3 bad sectors (see the RAW_VALUE of Current_Pending_Sector). They have not yet been permanently marked off, so may still be recoverable. Please do the SMART long test on that drive, found farther down in the Troubleshooting page, Obtaining a SMART report section. It takes a LONG time to run! After you start it, wait 4 or 5 hours, then obtain a SMART report and examine the percentage complete for the test. That will give a rough idea when to check again. Once complete (100%), then keep the SMART report, and compare the values with the one you displayed above. The RAW_VALUE of Current_Pending_Sector should go to zero, and the Reallocated_Sector_Ct will show how many of those sectors could not be recovered, and are now remapped. You should now be able to run one more parity check, to clean up any out-of-sync parity blocks.
April 27, 200917 yr Author Thanks for you reply!! Over night i start a second parity check. It finished with 0 errors..... As you have recommended, now i start a long smart test. I will post the result when i come back from work. My main question for now is: Have I lost data? And if so, which files are concerned? thanks for you help
April 27, 200917 yr Author Hi, after 9 hours i see this: SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 1814 351125639 i´m not sure what this mean..... did the test need 9 hour for 10% of the disk, and will be finish in 81 hours or did the test stop....
April 27, 200917 yr Hi, after 9 hours i see this: SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 1814 351125639 i´m not sure what this mean..... did the test need 9 hour for 10% of the disk, and will be finish in 81 hours or did the test stop.... I would interpret that as the test having aborted with a read failure. What does it say at the top of the SMART report. Usually it gives some status there too. In your previous post of a SMART report, it indicated the "extended" (long) test to take approx 255 minutes, so that is about the time it should take. (a bit over 4 hours) Extended self-test routine recommended polling time: ( 255) minutes. Joe L.
April 27, 200917 yr Author Hi, this is the complete output: Statistics for /dev/sdc WDC_WD10EADS-00L5B1_WD-WCAU46045241 smartctl version 5.38 [i486-slackware-linux-gnu] Copyright (C) 2002-8 Bruce Allen Home page is http://smartmontools.sourceforge.net/ === START OF INFORMATION SECTION === Device Model: WDC WD10EADS-00L5B1 Serial Number: WD-WCAU46045241 Firmware Version: 01.01A01 User Capacity: 1,000,204,886,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Mon Apr 27 17:37:33 2009 GMT-1 SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x05) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Disabled. Self-test execution status: ( 121) The previous self-test completed having the read element of the test failed. Total time to complete Offline data collection: (24000) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 160 158 021 Pre-fail Always - 6975 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 337 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 098 098 000 Old_age Always - 1824 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 41 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 7 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 337 194 Temperature_Celsius 0x0022 128 107 000 Old_age Always - 22 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 3 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 1814 351125639 SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
Archived
This topic is now archived and is closed to further replies.