aethl Posted May 3, 2022 Share Posted May 3, 2022 Hello All, had my first disk error this morning and am looking for some advice on how to proceed. Looks like one of my disks has 1024 read errors and has been disabled. Good thing is that the data on the array still seems to be intact. I am purchasing a new drive now to replace this with, but would like some assistance investigating what exactly happened. Any help is appreciated! unraid-diagnostics-20220503-0622.zip Quote Link to comment
JorgeB Posted May 3, 2022 Share Posted May 3, 2022 Disk dropped offline so there's no SMART report, but the log shows issues with multiple disks, so most likely a power/connection problem, power/cycling the server should bring the disk back online, if it does post new diags. Quote Link to comment
aethl Posted May 3, 2022 Author Share Posted May 3, 2022 Got it, doing that now. Thanks! Quote Link to comment
aethl Posted May 3, 2022 Author Share Posted May 3, 2022 Rebooted and I think the disk might be down for good. It's not getting picked up at all, even by my HBA card at bootup. Posting new diags. Would you mind expanding on the errors you observed with other disks? I ran short smart tests on all the drives and got new diags. No errors observed on the other drives, but going to run the extended test as well. unraid-diagnostics-20220503-0741.zip Quote Link to comment
JorgeB Posted May 3, 2022 Share Posted May 3, 2022 55 minutes ago, aethl said: Rebooted That's usually not enough, hence why I mentioned power cycling the server, you can also check connections. Quote Link to comment
aethl Posted May 3, 2022 Author Share Posted May 3, 2022 Sorry I should have been more specific, I did a full shutdown on the server (pulled power cables) and then powered on. Quote Link to comment
JorgeB Posted May 3, 2022 Share Posted May 3, 2022 OK, in that case disk is probably dead, you can swap cables with another one to be sure if you want. Quote Link to comment
aethl Posted May 3, 2022 Author Share Posted May 3, 2022 Yeah I will try moving to a different slot, but I think its dead. I really appreciate the help, thank you! Quote Link to comment
aethl Posted May 4, 2022 Author Share Posted May 4, 2022 Ok I was wrong, its not dead. Moved it to another slot and it picked back up again. Now its showing as a new disk. From what I understand, unraid would have dropped the disk from the array once the errors were detected. If I add it back to the array as a new disk, the parity should just rebuild it and I should be good to go right? Quote Link to comment
JorgeB Posted May 4, 2022 Share Posted May 4, 2022 4 minutes ago, aethl said: If I add it back to the array as a new disk, the parity should just rebuild it and I should be good to go right? Correct, make sure the emulated disk is mounting and contents look correct, whatever is there is what's going to be rebuilt, also good idea to check SMART since we couldn't before. Quote Link to comment
aethl Posted May 4, 2022 Author Share Posted May 4, 2022 Got it, SMART tests dont show any errors. I have the array in a stopped state right now, when the array was started all the data looked good and nothing was missing. Quote Link to comment
JorgeB Posted May 4, 2022 Share Posted May 4, 2022 If all looks good you can rebuild on top: https://wiki.unraid.net/Manual/Storage_Management#Rebuilding_a_drive_onto_itself Quote Link to comment
aethl Posted May 4, 2022 Author Share Posted May 4, 2022 Ah ok cool. So I think steps 1-5 are sort of already done here since when I power-cycled the server the disk was showing as missing at the HBA level so unraid already marked it as missing/unassigned. Thanks! Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.