enlinks Posted December 26, 2022 Share Posted December 26, 2022 (edited) Good afternoon everyone, Currently have left the array is the broken state and collected the diagnostics report, not sure if I should proceed with a reload at this time or anything else you'd like for me to collect prior to that? The impacted disk is WDC_WD40EZAZ-00SF3B0_WD-WX12D412T4UJ (sdc), currently mounted at (sdn). To add I did have an issue about a month or 2 ago where all 8 disks in my array generated read errors. I wasn't sure what happened but the issue never happened again. I have the diagnostics from that as well if you think that would be helpful. legion-diagnostics-20221226-1405.zip Edited December 26, 2022 by enlinks Forgot to mention impacted disk Quote Link to comment
Solution JorgeB Posted December 27, 2022 Solution Share Posted December 27, 2022 Disk dropped and reconnected again and SMART looks OK, this is usually a power/cable problem. Quote Link to comment
enlinks Posted December 27, 2022 Author Share Posted December 27, 2022 7 hours ago, JorgeB said: Disk dropped and reconnected again and SMART looks OK, this is usually a power/cable problem. OK great thank you for confirming that. Excuse my ignorance but is there a other things I can validate to troubleshoot besides swapping cables and whatnot? I only ask because I don't want to wait and see if that's the problem after swapping. The problem is either my HBA, the QSFP cables, or my disk shelf. I know things like the megacli and storcli can give some outputs on the controller but not sure if that's usable here as I don't see those packages under the nerd pack. Quote Link to comment
JorgeB Posted December 27, 2022 Share Posted December 27, 2022 You can run an extended SMART test, though it's only definite if it fails. Quote Link to comment
enlinks Posted January 3, 2023 Author Share Posted January 3, 2023 (edited) Thanks @JorgeB, I ran a short test with no errors but had to restart the extended (it was running for a few days) because I lost power the other day and my UPS isn't the best so the shutdown triggered due to the UPS time remaining. I'm attempting to run the extended test again to see if I can get it to complete this time. Edited January 3, 2023 by enlinks typo Quote Link to comment
enlinks Posted January 16, 2023 Author Share Posted January 16, 2023 @JorgeB just a final update on this. There seems to be something wrong with the WD Blues being able to do the extended test. The extended test is hung at 90% as I've tried running the test on 2 of them, Disk1 has been running since the 3rd and I started Disk2 about 3 days ago. However my WD Golds the Extended test finished in about a day or two (wasn't exactly watching the time). I'm going to try to swap out cables a monior and troubleshoot from that angle, thanks again! 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.