Nism0 Posted January 29, 2018 Posted January 29, 2018 Hi all, Since a few weeks one disk after another is having problems. Now three disks have problems.. I am clueless about what is causing this. I pulled all disks and changed some of the cabling but still the same disks come up with these problems. Does anybody know what to do next? tower-diagnostics-20180129-1130 2.zip
Nism0 Posted January 29, 2018 Author Posted January 29, 2018 I think all disks are fine, should i do a new config?
JorgeB Posted January 29, 2018 Posted January 29, 2018 You have 2 invalid disks with single parity, so both can't be emulated, disk13 has filesystem issues that should be fixed by running xfs_repair. As for the two invalid disks, any idea if any of them was written to after it was disabled? (disk12 will have some damage since it was rebuilding until you stopped it). As for the reason for the issues, you rebooted meanwhile and the syslog doesn't cover what caused this, but disk2 has pending sectors, disk13 shows some recent reported uncorrected errors, you're also using a SAS2LP, they have known issues with unRAID v6 and all disks with issues are connected there. Your best bet is probably doing a new config, then run an extended SMART test on both disk2 and disk13 and try to fix the filesystem on disk12 (and 13 after the extended test). Any more errors grab the diags before rebooting.
JonathanM Posted January 29, 2018 Posted January 29, 2018 Interesting coincidence that 2 of the drives with issues are https://en.wikipedia.org/wiki/ST3000DM001
Nism0 Posted January 29, 2018 Author Posted January 29, 2018 Thanks for the reply guys. Ok wait maybe i am not totally clueless. The ST3000DM001 are bad drives and they keep crashing (mainly during heavy load) and i keep rebuilding them. I thought i was not going to get myself into trouble but here i am... I will do a new config and report the outcome. Thanks!
Nism0 Posted January 29, 2018 Author Posted January 29, 2018 Ok, now we have this. Disk12 is ok and all data seems to be there.... Now do a xfs_repair on disk13?
Nism0 Posted January 29, 2018 Author Posted January 29, 2018 Ok were getting there... Repair was successful it seems. Had to do a -L...
JorgeB Posted January 29, 2018 Posted January 29, 2018 Disk2 is disable again, with errors, so this appears to confirm the pending sectors and that it should be replaced.
Nism0 Posted January 29, 2018 Author Posted January 29, 2018 Yep, tried some other stuff but i can't get it online. I can't even do a smart test from the web interface. I will replace it asap. If this results in not losing data then this is the most idiot proof system out there.
Recommended Posts
Archived
This topic is now archived and is closed to further replies.