kr4sv Posted April 1, 2020 Share Posted April 1, 2020 Hi there, Been using unraid (pro) since 2014 - with some WD greens (!!) from that time still running. Now ironically my newest (1 year old) WD red seems to die on me and I'm not entirely sure how to proceed on this problem. What happened so far: - Scheduled parity check froze at 16% with > 2 million sync errors corrected - Looking at the UI, all drives were spun up and marked as green, but disk4 did not report a temperature value - Disk-share for disk4 could not be accessed - Connecting via SSH showed /mnt/disk4 being empty - Cancelled the parity check after 10 hours without progress and rebooted the server (both via UI) - Disk4 seemed to be back online with data being accessible both through disk-share and SSH - Ran a SMART test on Disk4 -> no errors - Started a second parity which froze at 21% with the same disk4 being unresponsive - I started manually backing up files from the disk-share, but the disk randomnly freezes and requires a reboot Any hints on what to do next? I'm assuming my parity is invalid now, due to the 2 failed checks, correct? tower-diagnostics-20200401-0913.zip Quote Link to comment
JorgeB Posted April 1, 2020 Share Posted April 1, 2020 Diags are after reboot so we can't see what happened, I assume the disk never got disabled. Disk4 looks fine but there are recent UNC @ LBA errors, so best to run an extended SMART test, if OK run a non correcting parity check and grab diags if it fails again, it could be the HBA since you're using a SAS2LP and those have known issues for a long time. Quote Link to comment
trurl Posted April 1, 2020 Share Posted April 1, 2020 3 hours ago, kr4sv said: assuming my parity is invalid now, due to the 2 failed checks If they were non-correcting checks then parity would not be changed. Quote Link to comment
kr4sv Posted April 1, 2020 Author Share Posted April 1, 2020 Extended SMART Scan is still running, but checking the threads regarding the sas2lp card I think it might be a good idea to replace that one. I'm fairly certain that this card was recommended Hardware when i installed it 4-5 years ago - never had problems so I didn't verify compatibility when upgrading unraid. Got two Adaptec cards (6805 & 5805) lying around in my server graveyard - gonna check if those are supported Quote Link to comment
kr4sv Posted April 1, 2020 Author Share Posted April 1, 2020 (edited) Extended SMART test completed without errors, but the following parity check froze at 4% - diagnostics (before reboot) attached tower-diagnostics-20200401-2240.zip WDC_WD40EFRX-68N32N0_WD-WCC7K6RRJ110-20200401-2323.txt Edited April 1, 2020 by kr4sv Quote Link to comment
JorgeB Posted April 2, 2020 Share Posted April 2, 2020 Possibly a problem with the SASLP, do you have another controller you could use? SASLP/SAS2LP are not recommended for a long time. Quote Link to comment
kr4sv Posted April 2, 2020 Author Share Posted April 2, 2020 I ordered a LSI 6Gbps SAS HBA LSI 9201-8i as replacement, as the spare adaptec cards i got lying around seemed to cause trouble based on other threads. Just in case the disk is the problem i'm currently using the unbalance plugin to move the remaining data away from that one. Once the disk is empty I might try building parity with a replacement disk on the same controller port while waiting for the new LSI card. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.