Henning Posted July 2, 2021 Share Posted July 2, 2021 (edited) Hi there, I got a an unraid server with 13 disks and 2 parity drives ranging from 8-14TB. A few days ago I couln't open a few files so I checked the web interface and to my horror unraid reported two disks with errors. One data disk and the first parity drive. I shut down the server and now I want to start the trouble shooting and need your help. Before I shut it down I saw that both drives had an error count of about 2K. I don't suspect a drive failure since both drives failed at the same time. You can't see it but these mails came all in the same minute: My drives are connected either to the mainboard (Gigabyte X470 Aorus Ultra Gaming) or to one of two SAS cards. One Dell PERC H310 and one H200. Here some infos about the failed drives. * Disk 7 is connected to the H310 on the A Port along with three other drives. The B port doesn't have any drives. A SMART short self test completed without errors. Here are the downloaded SMART results: Disk 7 SMART.txt * Parity 1 is connected to the H200 on the A Port along with two other drives. The B port also has three drives connected. A SMART short self test completed without errors. Here are the downloaded SMART results: Parity 1 SMART.txt My first guess was that maybe one of the 4xSata SAS cables may gotten loose or one of the cards is faulty but the drives are on different cables on different cards What are my options now? What can I do to find the cause for the errors? If possible I want to prevent buying new drives and replace them if possible since the current prices are insane. I am pretty lost so any help is appreciated 🙂 Cheers! Edited July 2, 2021 by Henning Quote Link to comment
JorgeB Posted July 2, 2021 Share Posted July 2, 2021 4 minutes ago, Henning said: A few days ago I couln't open a few files so I checked the web interface and to my horror unraid reported two disks with errors This should never happen, you should have system notifications enable to be notified immediately when there's a problem, or it might be too late. 6 minutes ago, Henning said: I shut down the server and now I want to start the trouble shooting and need your help. Next time before doing this grab the diags or the syslog will be lost after powering back on. Start the array, grab the diagnostics and post the here. Quote Link to comment
Henning Posted July 2, 2021 Author Share Posted July 2, 2021 (edited) Hopefully there is no next time time but I will be wiser than. I grabbed the syslog in maintainance mode, I hope thats okay. Edited July 2, 2021 by Henning Quote Link to comment
JorgeB Posted July 2, 2021 Share Posted July 2, 2021 Just now, Henning said: in maintainance mode, I hope thats okay. It's not, and please post the complete diags: Tools -> Diagnostics Quote Link to comment
Henning Posted July 2, 2021 Author Share Posted July 2, 2021 unraid-diagnostics-20210702-1724.zip Quote Link to comment
JorgeB Posted July 2, 2021 Share Posted July 2, 2021 Emulated disk is mounting correctly, that why maintenance mode wasn't enough, disks look fine, you can rebuild on top and re-sync parity at the same time, since we can't see what happened it might be a good idea to replace/swap cables before doing it to rule that out if it happens again. https://wiki.unraid.net/Manual/Storage_Management#Rebuilding_a_drive_onto_itself 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.