December 9, 201510 yr I have a 3TB Seagate drive that it dropping off with write errors. I'm thinking of replacing it but the problem is SMART looks fairly clean and when I reboot to put it back into the array, it actually shows as "No Device". Is it possible that the cabling / my server is losing connections somehow? Every time I reseat it, it works for a week or so and then write errors out again.. It's a HP Microserver G7; the bays are just slide in cages. Releavant outputs: root@Tower:~# ls -l /dev/disk/by-id/ total 0 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 ata-ST3000DM001-1CH166_Z1F37YED -> ../../sdb lrwxrwxrwx 1 root root 10 2015-12-08 23:52 ata-ST3000DM001-1CH166_Z1F37YED-part1 -> ../../sdb1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 ata-ST3000DM001-1CH166_Z1F40YWL -> ../../sdc lrwxrwxrwx 1 root root 10 2015-12-08 23:52 ata-ST3000DM001-1CH166_Z1F40YWL-part1 -> ../../sdc1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 scsi-SATA_ST3000DM001-1CH_Z1F37YED -> ../../sdb lrwxrwxrwx 1 root root 10 2015-12-08 23:52 scsi-SATA_ST3000DM001-1CH_Z1F37YED-part1 -> ../../sdb1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 scsi-SATA_ST3000DM001-1CH_Z1F40YWL -> ../../sdc lrwxrwxrwx 1 root root 10 2015-12-08 23:52 scsi-SATA_ST3000DM001-1CH_Z1F40YWL-part1 -> ../../sdc1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 usb-General_USB_Flash_Disk_0354120400059990-0:0 -> ../../sda lrwxrwxrwx 1 root root 10 2015-12-08 23:52 usb-General_USB_Flash_Disk_0354120400059990-0:0-part1 -> ../../sda1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 wwn-0x5000c50063af85f3 -> ../../sdb lrwxrwxrwx 1 root root 10 2015-12-08 23:52 wwn-0x5000c50063af85f3-part1 -> ../../sdb1 lrwxrwxrwx 1 root root 9 2015-12-08 23:52 wwn-0x5000c50065298f6e -> ../../sdc lrwxrwxrwx 1 root root 10 2015-12-08 23:52 wwn-0x5000c50065298f6e-part1 -> ../../sdc1 root@Tower:~# Relevant syslog: Dec 3 04:00:26 Tower kernel: md: disk1 write error, sector=2917467848 Dec 3 04:00:55 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:00:55 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:00:55 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:00:55 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:00:55 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 07 10 00 00 00 08 00 00 Dec 3 04:00:55 Tower kernel: end_request: I/O error, dev sdc, sector 2917467920 Dec 3 04:00:55 Tower kernel: md: disk1 write error, sector=2917467856 Dec 3 04:00:56 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:00:56 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:00:56 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:00:56 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:00:56 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 70 00 00 00 08 00 00 Dec 3 04:00:56 Tower kernel: end_request: I/O error, dev sdc, sector 2917468272 Dec 3 04:00:56 Tower kernel: md: disk1 write error, sector=2917468208 Dec 3 04:01:55 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:01:55 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:01:55 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:01:55 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:01:55 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 78 00 00 00 08 00 00 Dec 3 04:01:55 Tower kernel: end_request: I/O error, dev sdc, sector 2917468280 Dec 3 04:01:56 Tower kernel: md: disk1 write error, sector=2917468216 Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:01:56 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:01:56 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 80 00 00 00 08 00 00 Dec 3 04:01:56 Tower kernel: end_request: I/O error, dev sdc, sector 2917468288 Dec 3 04:01:56 Tower kernel: md: disk1 write error, sector=2917468224 Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:01:56 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:01:56 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:01:56 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 88 00 00 00 08 00 00 Dec 3 04:01:56 Tower kernel: end_request: I/O error, dev sdc, sector 2917468296 Dec 3 04:01:56 Tower kernel: md: disk1 write error, sector=2917468232 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:02:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:02:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 90 00 00 00 08 00 00 Dec 3 04:02:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468304 Dec 3 04:02:26 Tower kernel: md: disk1 write error, sector=2917468240 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:02:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:02:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 98 00 00 00 08 00 00 Dec 3 04:02:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468312 Dec 3 04:02:26 Tower kernel: md: disk1 write error, sector=2917468248 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:02:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:02:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:02:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 d8 00 00 00 08 00 00 Dec 3 04:02:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468376 Dec 3 04:02:26 Tower kernel: md: disk1 write error, sector=2917468312 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 e0 00 00 00 08 00 00 Dec 3 04:03:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468384 Dec 3 04:03:26 Tower kernel: md: disk1 write error, sector=2917468320 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 08 e8 00 00 00 08 00 00 Dec 3 04:03:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468392 Dec 3 04:03:26 Tower kernel: md: disk1 write error, sector=2917468328 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:26 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:26 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:26 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 09 40 00 00 00 08 00 00 Dec 3 04:03:26 Tower kernel: end_request: I/O error, dev sdc, sector 2917468480 Dec 3 04:03:27 Tower kernel: md: disk1 write error, sector=2917468416 Dec 3 04:03:56 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:56 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:56 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:56 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:56 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 09 48 00 00 00 08 00 00 Dec 3 04:03:56 Tower kernel: end_request: I/O error, dev sdc, sector 2917468488 Dec 3 04:03:56 Tower kernel: md: disk1 write error, sector=2917468424 Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:57 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:57 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 09 50 00 00 00 08 00 00 Dec 3 04:03:57 Tower kernel: end_request: I/O error, dev sdc, sector 2917468496 Dec 3 04:03:57 Tower kernel: md: disk1 write error, sector=2917468432 Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] Dec 3 04:03:57 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 3 04:03:57 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 3 04:03:57 Tower kernel: cdb[0]=0x8a: 8a 00 00 00 00 00 ad e5 09 58 00 00 00 08 00 00 Dec 3 04:03:57 Tower kernel: end_request: I/O error, dev sdc, sector 2917468504 Dec 3 04:03:57 Tower kernel: md: disk1 write error, sector=2917468440 Dec 3 09:30:51 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 3 09:30:51 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 3 09:30:51 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 3 20:34:06 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 3 20:34:06 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 3 20:34:06 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 4 07:34:08 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 4 07:34:08 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 4 07:34:08 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 4 18:18:52 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 4 18:18:52 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 4 18:18:52 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 5 05:11:01 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 5 05:11:01 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 5 05:11:01 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 5 16:25:14 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 5 16:25:14 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 5 16:25:14 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 6 03:13:19 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 6 03:13:19 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 6 03:13:19 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 6 14:22:15 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 6 14:22:15 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 6 14:22:15 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 7 00:52:43 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 7 00:52:43 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 7 00:52:44 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 7 11:31:22 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 7 11:31:22 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 7 11:31:22 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 7 22:34:11 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 7 22:34:11 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 7 22:34:11 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 8 09:45:43 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 8 09:45:43 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 8 09:45:43 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 8 21:02:26 Tower dhcpcd[983]: eth0: renewing lease of 192.168.1.217 Dec 8 21:02:26 Tower dhcpcd[983]: eth0: acknowledged 192.168.1.217 from 192.168.1.1 Dec 8 21:02:26 Tower dhcpcd[983]: eth0: leased 192.168.1.217 for 86400 seconds Dec 8 23:38:31 Tower in.telnetd[1326]: connect from 192.168.1.160 (192.168.1.160) Dec 8 23:38:32 Tower login[1327]: ROOT LOGIN on '/dev/pts/0' from 'setsuna' Dec 8 23:39:19 Tower kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Dec 8 23:39:27 Tower last message repeated 11 times Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] Dec 8 23:39:54 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 8 23:39:54 Tower kernel: cdb[0]=0x88: 88 00 00 00 00 00 00 00 00 00 00 00 00 80 00 00 Dec 8 23:39:54 Tower kernel: end_request: I/O error, dev sdc, sector 0 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 0 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 1 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 2 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 3 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 4 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 5 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 6 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 7 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 8 Dec 8 23:39:54 Tower kernel: Buffer I/O error on device sdc, logical block 9 Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] Unhandled error code Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] Dec 8 23:39:54 Tower kernel: Result: hostbyte=0x04 driverbyte=0x00 Dec 8 23:39:54 Tower kernel: sd 2:0:0:0: [sdc] CDB: Dec 8 23:39:54 Tower kernel: cdb[0]=0x88: 88 00 00 00 00 00 00 00 00 00 00 00 00 08 00 00 Dec 8 23:39:54 Tower kernel: end_request: I/O error, dev sdc, sector 0 Dec 8 23:40:10 Tower kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Dec 8 23:41:45 Tower last message repeated 7 times The last bit there is me trying to probe with smartctl (but it was already "dead" I/O errors). Syslog repeats the write errors up there for a bit longer. Reseating it I suspect will make it show up again per usual.
December 9, 201510 yr I would suspect a disk problem, healthy SMART does not always mean a healthy disk, change the slot with another disk on the microserver, if the disk fails in a new slot it's the disk.
Archived
This topic is now archived and is closed to further replies.