oh-tomo Posted June 18, 2022 Share Posted June 18, 2022 I'm going to shut down and replace the disk. Attached are the diagnostics. Wish me luck. server2018-diagnostics-20220618-0010.zip Quote Link to comment
JorgeB Posted June 18, 2022 Share Posted June 18, 2022 Doesn't look like a disk problem, suggest running and extended SMART test before replacing it. Quote Link to comment
oh-tomo Posted June 18, 2022 Author Share Posted June 18, 2022 8 hours ago, JorgeB said: Doesn't look like a disk problem, suggest running and extended SMART test before replacing it. What kind of problem does it look like? 14 hours left on new disk data rebuild. Should I abort and put the old disk back in for an extended SMART test? My impulse was that the array was vulnerable with a disabled disk exhibiting read/write errors so I wanted to get a functioning disk in there ASAP. Old disk warranty expired in February. Quote Link to comment
trurl Posted June 18, 2022 Share Posted June 18, 2022 Rebuilding to a new disk is always a little safer than rebuilding on top of the original. That way you have the original in case there are problems with rebuild. But of course you need a spare disk and they aren't free. Let rebuild complete. Quote Link to comment
oh-tomo Posted June 18, 2022 Author Share Posted June 18, 2022 Here's an updated diagnostics while it rebuilds. Took a while to generate with the rebuild going on. server2018-diagnostics-20220618-1202.zip Quote Link to comment
trurl Posted June 18, 2022 Share Posted June 18, 2022 That looks fine. I didn't check SMART for all other disks. Do any have SMART warnings on the Dashboard page? Quote Link to comment
oh-tomo Posted June 18, 2022 Author Share Posted June 18, 2022 55 minutes ago, trurl said: That looks fine. I didn't check SMART for all other disks. Do any have SMART warnings on the Dashboard page? Disk 4 has error (UDMA CRC error count 1), the rest are healthy on dashboard. Quote Link to comment
trurl Posted June 18, 2022 Share Posted June 18, 2022 33 minutes ago, oh-tomo said: Disk 4 has error (UDMA CRC error count 1), the rest are healthy on dashboard. That's OK. These are just logged by the drive when there is some inconsistency in the data it received. The data is resent. Click on that warning and acknowledge it, it will warn again if it increases. If you get a lot of these you should try to figure out where the problem is, connection or such. Quote Link to comment
oh-tomo Posted June 19, 2022 Author Share Posted June 19, 2022 I paused the rebuild and ten minutes later got an alert that Disk 1 (the new disk) is in error state. So maybe JorgeB is onto something. Running an extended SMART test now. How long do they take? Will the test reveal an actionable solution to the cause of the errors? Quote Link to comment
oh-tomo Posted June 19, 2022 Author Share Posted June 19, 2022 (edited) 33 minutes ago, oh-tomo said: I paused the rebuild and ten minutes later got an alert that Disk 1 (the new disk) is in error state. So maybe JorgeB is onto something. Running an extended SMART test now. How long do they take? Will the test reveal an actionable solution to the cause of the errors? Extended SMART test didn't finish -- "Interrupted (host reset)" I'll try again. Edited June 19, 2022 by oh-tomo Quote Link to comment
oh-tomo Posted June 19, 2022 Author Share Posted June 19, 2022 1 hour ago, oh-tomo said: Extended SMART test didn't finish -- "Interrupted (host reset)" I'll try again. Extended SMART test (second attempt) has been stuck at "self-test in progress, 10% complete" all this time. Quote Link to comment
JorgeB Posted June 19, 2022 Share Posted June 19, 2022 Extended SMART test will take several hours, also make sure spin down is disabled for that disk. Quote Link to comment
itimpi Posted June 19, 2022 Share Posted June 19, 2022 5 hours ago, oh-tomo said: Extended SMART test (second attempt) has been stuck at "self-test in progress, 10% complete" all this time. The test should take something like 1-2 hours per TB. Progress only gets updated every 10%. Quote Link to comment
oh-tomo Posted June 20, 2022 Author Share Posted June 20, 2022 (edited) "Completed without error" Now what? ST16000NE000-2RW103_ZL2P6V5T-20220619-2307 redacted.txt Edited June 20, 2022 by oh-tomo Quote Link to comment
oh-tomo Posted June 20, 2022 Author Share Posted June 20, 2022 I guess I could resume the parity check. Quote Link to comment
JorgeB Posted June 20, 2022 Share Posted June 20, 2022 3 hours ago, oh-tomo said: "Completed without error" Now what? It confirms the disk wasn't the problem, check/replace cables and if contents on the emulate disk1 look correct you can rebuild on top: https://wiki.unraid.net/Manual/Storage_Management#Rebuilding_a_drive_onto_itself Quote Link to comment
oh-tomo Posted June 25, 2022 Author Share Posted June 25, 2022 I couldn't see any problem with the cables. Maybe they were arcing too high towards the upper case fan. I tucked them a bit to the side away from the fan. I guess I just wait and see if the problem returns. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.