September 27, 201213 yr Hi All, Woke up this morning and my cache drive had stopped. I don't use the mover, it is only on there for my torrents. Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd dled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) Sep 28 03:37:06 Tower kernel: end_request: I/O error, dev sdo, sector 970830864 (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Unhandled error code (Errors) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] Result: hostbyte=0x04 driverbyte=0x00 (System) Sep 28 03:37:06 Tower kernel: sd 8:0:0:0: [sdo] CDB: cdb[0]=0x28: 28 00 39 dd b4 10 00 00 18 00 (Drive related) You can see that it kept repeating itself. I couldn't get the full syslog as I could access the drive but not copy anything off it. I've restarted the server and the drive seems to have come up alright but I just wanted to check it wasn't a drive failing and what it might be. Thanks Josh
September 27, 201213 yr Unfortunately, there is nothing that can be determined from the segment above (except which drive was affected of course). Those particular errors are a part of the aftermath, and occur long after the real errors. It's the real errors we need to see, especially the very first one. And you cannot even lay suspicion here, because a resulting I/O error can be caused by either true drive issues, or by non-drive issues (such as power issues, loose or bad cables, controller or port issues, etc), or even just a random transient issue (power spike). Monitor the drive closely, and try to get as complete a syslog as possible if you detect another issue with the drive.
September 27, 201213 yr Author Thanks, I'll have to keep an eye on it. It's only running as a cache drive with data that can be lost. I couldn't get anymore of the syslog, nothing in the /var/log for the current one but just lots of older ones. Had trouble downloading the syslog. I also got this email after but I don't know if that means much either. Subject:unRaid Status WARNING - Array Not Started Status update for unRAID Tower - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Status: WARNING - unRAID Array NOT STARTED Server Name: Tower Server IP: 192.168.80.2 Date: Fri Sep 28 03:47:05 EST 2012 Output of /proc/mdcmd: - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - sbName=/boot/config/super.dat sbVersion=0.95.4 sbCreated=1284792054 sbUpdated=1348767753 sbEvents=335 sbState=1 sbNumDisks=21 sbSynced=1346450223 sbSyncErrs=0 mdVersion=1.1.1 mdState=STOPPED mdNumProtected=14 mdNumDisabled=0 mdDisabledDisk=0 mdNumInvalid=0 mdInvalidDisk=0 mdNumMissing=0 mdMissingDisk=0 mdNumNew=0 mdResync=0 diskNumber.0=0 diskName.0= diskSize.0=1953514552 diskState.0=7 diskModel.0=WDC WD20EARS-00M diskSerial.0=WD-WMAZA3557461 diskId.0=WDC_WD20EARS-00M_WD-WMAZA3557461 rdevNumber.0=0 rdevStatus.0=DISK_OK rdevName.0=sdk rdevSize.0=1953514552 rdevModel.0=WDC WD20EARS-00M rdevSerial.0=WD-WMAZA3557461 rdevId.0=WDC_WD20EARS-00M_WD-WMAZA3557461 rdevNumErrors.0=0 rdevLastIO.0=1348767753 rdevSpinupGroup.0=64 diskNumber.1=1 diskName.1=md1 diskSize.1=1465138552 diskState.1=7 diskModel.1=SAMSUNG HD154UI diskSerial.1=S1XWJDWZ600768 diskId.1=SAMSUNG_HD154UI_S1XWJDWZ600768 rdevNumber.1=1 rdevStatus.1=DISK_OK rdevName.1=sdp rdevSize.1=1465138552 rdevModel.1=SAMSUNG HD154UI rdevSerial.1=S1XWJDWZ600768 rdevId.1=SAMSUNG_HD154UI_S1XWJDWZ600768 rdevNumErrors.1=0 rdevLastIO.1=1348767718 rdevSpinupGroup.1=0 diskNumber.2=2 diskName.2=md2 diskSize.2=1465138552 diskState.2=7 diskModel.2=SAMSUNG HD154UI diskSerial.2=S1XWJDWZ801337 diskId.2=SAMSUNG_HD154UI_S1XWJDWZ801337 rdevNumber.2=2 rdevStatus.2=DISK_OK rdevName.2=sdm rdevSize.2=1465138552 rdevModel.2=SAMSUNG HD154UI rdevSerial.2=S1XWJDWZ801337 rdevId.2=SAMSUNG_HD154UI_S1XWJDWZ801337 rdevNumErrors.2=0 rdevLastIO.2=1348767729 rdevSpinupGroup.2=0 diskNumber.3=3 diskName.3=md3 diskSize.3=1465138552 diskState.3=7 diskModel.3=SAMSUNG HD154UI diskSerial.3=S1XWJ9CZ700336 diskId.3=SAMSUNG_HD154UI_S1XWJ9CZ700336 rdevNumber.3=3 rdevStatus.3=DISK_OK rdevName.3=sdl rdevSize.3=1465138552 rdevModel.3=SAMSUNG HD154UI rdevSerial.3=S1XWJ9CZ700336 rdevId.3=SAMSUNG_HD154UI_S1XWJ9CZ700336 rdevNumErrors.3=0 rdevLastIO.3=1348767730 rdevSpinupGroup.3=0 diskNumber.4=4 diskName.4=md4 diskSize.4=1465138552 diskState.4=7 diskModel.4=SAMSUNG HD154UI diskSerial.4=S1XWJDWZ602362 diskId.4=SAMSUNG_HD154UI_S1XWJDWZ602362 rdevNumber.4=4 rdevStatus.4=DISK_OK rdevName.4=sdb rdevSize.4=1465138552 rdevModel.4=SAMSUNG HD154UI rdevSerial.4=S1XWJDWZ602362 rdevId.4=SAMSUNG_HD154UI_S1XWJDWZ602362 rdevNumErrors.4=0 rdevLastIO.4=1348767740 rdevSpinupGroup.4=0 diskNumber.5=5 diskName.5=md5 diskSize.5=1465138552 diskState.5=7 diskModel.5=SAMSUNG HD154UI diskSerial.5=S1XWJDWZ602350 diskId.5=SAMSUNG_HD154UI_S1XWJDWZ602350 rdevNumber.5=5 rdevStatus.5=DISK_OK rdevName.5=sdd rdevSize.5=1465138552 rdevModel.5=SAMSUNG HD154UI rdevSerial.5=S1XWJDWZ602350 rdevId.5=SAMSUNG_HD154UI_S1XWJDWZ602350 rdevNumErrors.5=0 rdevLastIO.5=1348767741 rdevSpinupGroup.5=1052544 diskNumber.6=6 diskName.6=md6 diskSize.6=1465138552 diskState.6=7 diskModel.6=SAMSUNG HD154UI diskSerial.6=S1XWJDWZ600765 diskId.6=SAMSUNG_HD154UI_S1XWJDWZ600765 rdevNumber.6=6 rdevStatus.6=DISK_OK rdevName.6=sdn rdevSize.6=1465138552 rdevModel.6=SAMSUNG HD154UI rdevSerial.6=S1XWJDWZ600765 rdevId.6=SAMSUNG_HD154UI_S1XWJDWZ600765 rdevNumErrors.6=0 rdevLastIO.6=1348767752 rdevSpinupGroup.6=1 diskNumber.7=7 diskName.7=md7 diskSize.7=1465138552 diskState.7=7 diskModel.7=SAMSUNG HD154UI diskSerial.7=S1XWJ9BZA01291 diskId.7=SAMSUNG_HD154UI_S1XWJ9BZA01291 rdevNumber.7=7 rdevStatus.7=DISK_OK rdevName.7=sdc rdevSize.7=1465138552 rdevModel.7=SAMSUNG HD154UI rdevSerial.7=S1XWJ9BZA01291 rdevId.7=SAMSUNG_HD154UI_S1XWJ9BZA01291 rdevNumErrors.7=0 rdevLastIO.7=1348767752 rdevSpinupGroup.7=1052448 diskNumber.8=8 diskName.8=md8 diskSize.8=1953514552 diskState.8=7 diskModel.8=WDC WD20EARS-00M diskSerial.8=WD-WCAZA5834918 diskId.8=WDC_WD20EARS-00M_WD-WCAZA5834918 rdevNumber.8=8 rdevStatus.8=DISK_OK rdevName.8=sdg rdevSize.8=1953514552 rdevModel.8=WDC WD20EARS-00M rdevSerial.8=WD-WCAZA5834918 rdevId.8=WDC_WD20EARS-00M_WD-WCAZA5834918 rdevNumErrors.8=0 rdevLastIO.8=1348767753 rdevSpinupGroup.8=1052320 diskNumber.9=9 diskName.9=md9 diskSize.9=1953514552 diskState.9=7 diskModel.9=WDC WD20EARS-00M diskSerial.9=WD-WCAZA3920129 diskId.9=WDC_WD20EARS-00M_WD-WCAZA3920129 rdevNumber.9=9 rdevStatus.9=DISK_OK rdevName.9=sdh rdevSize.9=1953514552 rdevModel.9=WDC WD20EARS-00M rdevSerial.9=WD-WCAZA3920129 rdevId.9=WDC_WD20EARS-00M_WD-WCAZA3920129 rdevNumErrors.9=0 rdevLastIO.9=1348767753 rdevSpinupGroup.9=1052064 diskNumber.10=10 diskName.10=md10 diskSize.10=1953514552 diskState.10=7 diskModel.10=Hitachi HDS5C302 diskSerial.10=ML4220F31BANMR diskId.10=Hitachi_HDS5C302_ML4220F31BANMR rdevNumber.10=10 rdevStatus.10=DISK_OK rdevName.10=sdj rdevSize.10=1953514552 rdevModel.10=Hitachi HDS5C302 rdevSerial.10=ML4220F31BANMR rdevId.10=Hitachi_HDS5C302_ML4220F31BANMR rdevNumErrors.10=0 rdevLastIO.10=1348767718 rdevSpinupGroup.10=1051552 diskNumber.11=11 diskName.11=md11 diskSize.11=1953514552 diskState.11=7 diskModel.11=Hitachi HDS5C302 diskSerial.11=ML4220F31BAR1R diskId.11=Hitachi_HDS5C302_ML4220F31BAR1R rdevNumber.11=11 rdevStatus.11=DISK_OK rdevName.11=sdf rdevSize.11=1953514552 rdevModel.11=Hitachi HDS5C302 rdevSerial.11=ML4220F31BAR1R rdevId.11=Hitachi_HDS5C302_ML4220F31BAR1R rdevNumErrors.11=0 rdevLastIO.11=1348767718 rdevSpinupGroup.11=1050528 diskNumber.12=12 diskName.12=md12 diskSize.12=976762552 diskState.12=7 diskModel.12=WDC WD10EAVS-00D diskSerial.12=WD-WCAU49872251 diskId.12=WDC_WD10EAVS-00D_WD-WCAU49872251 rdevNumber.12=12 rdevStatus.12=DISK_OK rdevName.12=sdi rdevSize.12=976762552 rdevModel.12=WDC WD10EAVS-00D rdevSerial.12=WD-WCAU49872251 rdevId.12=WDC_WD10EAVS-00D_WD-WCAU49872251 rdevNumErrors.12=0 rdevLastIO.12=1348767719 rdevSpinupGroup.12=0 diskNumber.13=13 diskName.13= diskSize.13=0 diskState.13=0 diskModel.13= diskSerial.13= diskId.13= rdevNumber.13=13 rdevStatus.13=DISK_NP rdevName.13= rdevSize.13=0 rdevModel.13= rdevSerial.13= rdevId.13= rdevNumErrors.13=0 rdevLastIO.13=1346468744 rdevSpinupGroup.13=0 diskNumber.14=14 diskName.14= diskSize.14=0 diskState.14=0 diskModel.14= diskSerial.14= diskId.14= rdevNumber.14=14 rdevStatus.14=DISK_NP rdevName.14= rdevSize.14=0 rdevModel.14= rdevSerial.14= rdevId.14= rdevNumErrors.14=0 rdevLastIO.14=0 rdevSpinupGroup.14=0 diskNumber.15=15 diskName.15= diskSize.15=0 diskState.15=0 diskModel.15= diskSerial.15= diskId.15= rdevNumber.15=15 rdevStatus.15=DISK_NP rdevName.15= rdevSize.15=0 rdevModel.15= rdevSerial.15= rdevId.15= rdevNumErrors.15=0 rdevLastIO.15=0 rdevSpinupGroup.15=0 diskNumber.16=16 diskName.16= diskSize.16=0 diskState.16=0 diskModel.16= diskSerial.16= diskId.16= rdevNumber.16=16 rdevStatus.16=DISK_NP rdevName.16= rdevSize.16=0 rdevModel.16= rdevSerial.16= rdevId.16= rdevNumErrors.16=0 rdevLastIO.16=0 rdevSpinupGroup.16=0 diskNumber.17=17 diskName.17= diskSize.17=0 diskState.17=0 diskModel.17= diskSerial.17= diskId.17= rdevNumber.17=17 rdevStatus.17=DISK_NP rdevName.17= rdevSize.17=0 rdevModel.17= rdevSerial.17= rdevId.17= rdevNumErrors.17=0 rdevLastIO.17=0 rdevSpinupGroup.17=0 diskNumber.18=18 diskName.18= diskSize.18=0 diskState.18=0 diskModel.18= diskSerial.18= diskId.18= rdevNumber.18=18 rdevStatus.18=DISK_NP rdevName.18= rdevSize.18=0 rdevModel.18= rdevSerial.18= rdevId.18= rdevNumErrors.18=0 rdevLastIO.18=0 rdevSpinupGroup.18=0 diskNumber.19=19 diskName.19= diskSize.19=0 diskState.19=0 diskModel.19= diskSerial.19= diskId.19= rdevNumber.19=19 rdevStatus.19=DISK_NP rdevName.19= rdevSize.19=0 rdevModel.19= rdevSerial.19= rdevId.19= rdevNumErrors.19=0 rdevLastIO.19=0 rdevSpinupGroup.19=0 diskNumber.20=20 diskName.20=md20 diskSize.20=29313112 diskState.20=7 diskModel.20=KINGSTON SSDNOW diskSerial.20=60UA5067K04K diskId.20=KINGSTON_SSDNOW_60UA5067K04K rdevNumber.20=20 rdevStatus.20=DISK_OK rdevName.20=sde rdevSize.20=29313112 rdevModel.20=KINGSTON SSDNOW rdevSerial.20=60UA5067K04K rdevId.20=KINGSTON_SSDNOW_60UA5067K04K rdevNumErrors.20=0 rdevLastIO.20=1348767729 rdevSpinupGroup.20=4000 thanks josh
Archived
This topic is now archived and is closed to further replies.