February 7, 20179 yr Hi all, I woke up this morning finding that one of the disks of my Unraid Server (running 6.3.0) has been disabled. I hope someone is able to tell me what exactly to look for ( failing disk, cable issue, ? ) Here is an extract of my syslog which hopefully helps sorting this out. Thanks for your time. *The data on the disk is accessible, I am currently copying it to another disk via shell. Feb 7 03:54:06 LochNAS kernel: sas: Enter sas_scsi_recover_host busy: 1 failed: 1 Feb 7 03:54:06 LochNAS kernel: sas: trying to find task 0xffff8801ee3c6900 Feb 7 03:54:06 LochNAS kernel: sas: sas_scsi_find_task: aborting task 0xffff8801ee3c6900 Feb 7 03:54:06 LochNAS kernel: sas: sas_scsi_find_task: task 0xffff8801ee3c6900 is aborted Feb 7 03:54:06 LochNAS kernel: sas: sas_eh_handle_sas_errors: task 0xffff8801ee3c6900 is aborted Feb 7 03:54:06 LochNAS kernel: sas: ata14: end_device-1:5: cmd error handler Feb 7 03:54:06 LochNAS kernel: sas: ata9: end_device-1:0: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata10: end_device-1:1: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata11: end_device-1:2: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata12: end_device-1:3: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata13: end_device-1:4: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata14: end_device-1:5: dev error handler Feb 7 03:54:06 LochNAS kernel: sas: ata15: end_device-1:6: dev error handler Feb 7 03:54:06 LochNAS kernel: ata14.00: exception Emask 0x0 SAct 0x10000000 SErr 0x0 action 0x6 frozen Feb 7 03:54:06 LochNAS kernel: ata14.00: failed command: READ FPDMA QUEUED Feb 7 03:54:06 LochNAS kernel: sas: ata16: end_device-1:7: dev error handler Feb 7 03:54:06 LochNAS kernel: ata14.00: cmd 60/40:00:80:70:3b/00:00:39:00:00/40 tag 28 ncq dma 32768 in Feb 7 03:54:06 LochNAS kernel: res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Feb 7 03:54:06 LochNAS kernel: ata14.00: status: { DRDY } Feb 7 03:54:06 LochNAS kernel: ata14: hard resetting link Feb 7 03:54:07 LochNAS kernel: sas: sas_form_port: phy1 belongs to port5 already(1)! Feb 7 03:54:08 LochNAS kernel: drivers/scsi/mvsas/mv_sas.c 1435:mvs_I_T_nexus_reset for device[1]:rc= 0 Feb 7 03:54:14 LochNAS kernel: ata14.00: qc timeout (cmd 0xec) Feb 7 03:54:14 LochNAS kernel: ata14.00: failed to IDENTIFY (I/O error, err_mask=0x4) Feb 7 03:54:14 LochNAS kernel: ata14.00: revalidation failed (errno=-5) Feb 7 03:54:14 LochNAS kernel: ata14: hard resetting link Feb 7 03:54:14 LochNAS kernel: sas: sas_form_port: phy1 belongs to port5 already(1)! Feb 7 03:54:16 LochNAS kernel: drivers/scsi/mvsas/mv_sas.c 1435:mvs_I_T_nexus_reset for device[1]:rc= 0 Feb 7 03:54:22 LochNAS kernel: ata14.00: qc timeout (cmd 0x27) Feb 7 03:54:22 LochNAS kernel: ata14.00: failed to read native max address (err_mask=0x4) Feb 7 03:54:22 LochNAS kernel: ata14.00: HPA support seems broken, skipping HPA handling Feb 7 03:54:22 LochNAS kernel: ata14.00: revalidation failed (errno=-5) Feb 7 03:54:22 LochNAS kernel: ata14: hard resetting link Feb 7 03:54:22 LochNAS kernel: sas: sas_form_port: phy1 belongs to port5 already(1)! Feb 7 03:54:24 LochNAS kernel: drivers/scsi/mvsas/mv_sas.c 1435:mvs_I_T_nexus_reset for device[1]:rc= 0 Feb 7 03:54:39 LochNAS kernel: ata14.00: qc timeout (cmd 0xef) Feb 7 03:54:39 LochNAS kernel: ata14.00: failed to set xfermode (err_mask=0x4) Feb 7 03:54:39 LochNAS kernel: ata14.00: disabled Feb 7 03:54:39 LochNAS kernel: ata14: hard resetting link Feb 7 03:54:39 LochNAS kernel: sas: sas_form_port: phy1 belongs to port5 already(1)! Feb 7 03:54:41 LochNAS kernel: drivers/scsi/mvsas/mv_sas.c 1435:mvs_I_T_nexus_reset for device[1]:rc= 0 Feb 7 03:54:42 LochNAS kernel: ata14: EH complete Feb 7 03:54:42 LochNAS kernel: sas: --- Exit sas_scsi_recover_host: busy: 0 failed: 1 tries: 1 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#2 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#2 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 70 80 00 00 00 40 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960196736 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196672 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#4 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#4 CDB: opcode=0x35 35 00 00 00 00 00 00 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 0 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196680 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196688 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196696 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196704 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196712 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196720 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196728 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#2 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#2 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 70 c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#6 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 7c c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960199872 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960196800 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196736 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199808 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196744 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199816 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196752 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199824 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196760 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199832 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196768 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#10 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199840 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#10 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 84 c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960201920 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#3 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#3 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 74 c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960197824 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196776 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199848 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196784 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199856 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196792 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199864 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196800 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199872 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196808 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199880 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196816 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#15 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#15 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 8c c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960203968 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199888 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#5 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 78 c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196824 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960198848 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199896 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196832 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199904 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196840 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199912 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196848 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199920 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196856 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#17 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199928 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#17 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 94 c0 00 00 03 c0 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960206016 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196864 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199936 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#8 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] tag#8 CDB: opcode=0x88 88 00 00 00 00 00 39 3b 80 c0 00 00 04 00 00 00 Feb 7 03:54:42 LochNAS kernel: blk_update_request: I/O error, dev sdg, sector 960200896 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196872 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199944 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196880 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199952 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196888 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960199960 Feb 7 03:54:42 LochNAS kernel: md: disk11 read error, sector=960196896 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] Read Capacity(16) failed: Result: hostbyte=0x04 driverbyte=0x00 Feb 7 03:54:42 LochNAS kernel: sd 1:0:5:0: [sdg] Sense not available.
February 7, 20179 yr Author Hey, thanks for your reply, attaching diagnostics.zip now. lochnas-diagnostics-20170207-1910.zip
February 7, 20179 yr Yes, unfortunately because of the bug on 6.3.0 there are no SMART reports. You have a SAS2LP and the issue was most likely cause by the controller, it's a rather common error lately, reboot check SMART report for disk11 look good (or upload it here if you're not sure). If SMART looks good disable VT-D if you don't use it, check for board bios update and maybe try using the controller in other slot if available and rebuild to the same disk.
February 9, 20179 yr Author Hi again, I finally got around to following your suggestions. I flashed the latest firmware I could find on the supermicro website, disabled VT-D and updated to Unraid 6.3.1. When I restarted the server, the drive was present and (obviously) still marked with a red X. I then tried to start the short S.M.A.R.T. selftest, but got an error stating that a mandatory command failed. I checked the "Main" tab again and the drive was marked as missing all of a sudden. I rebooted again, the drive was present again and I could see the S.M.A.R.T. information for it. I'm attaching it now. Thanks for your help. lochnas-smart-20170209-1801.zip
February 9, 20179 yr Author Okay. I'll see if I have another slot available for the controller. This issue is very concerning.
Archived
This topic is now archived and is closed to further replies.