Bitbass Posted October 14 Share Posted October 14 Unraid started finding errors on Disk 4 last night. I didn't notice it until this morning, at which time I rebooted Unraid to see if it would clear it up. Instead, it's now showing Disk 2 as being unmountable. I'm suspecting I have a SATA controller problem, but the array is now trying to rebuild Disk 4. It is going VERY slowly. Maybe I screwed up on starting the array and having it kick into a rebuild. How do I navigate my way out of this now? Do I need to confirm it's a SATA controller problem? If I need to, I can transfer the system into a different motherboard and sata controller. I've done it before. I just need to understand what the risk is to stopping the rebuild, and if that's the right approach. unraid-diagnostics-20241014-1450.zip Quote Link to comment
Bitbass Posted October 14 Author Share Posted October 14 I have spare drives to swap in. I've learned my lesson from the past and I won't rush things now (despite this mistake I probably already made). The rebuild is jumping between 8 days and 30+ days. So, I have time before I do anything else! Quote Link to comment
JorgeB Posted October 15 Share Posted October 15 Cancel the rebuild, check/replace cables for disk2 and post new diags after array start. Quote Link to comment
Bitbass Posted October 15 Author Share Posted October 15 Replaced the SATA cable for Disk2. No change that I can tell. Unraid started in disk selection mode. I simply started the array. unraid-diagnostics-20241015-0804.zip Quote Link to comment
JorgeB Posted October 15 Share Posted October 15 Did you also replaced/swapped the power cable? If yes the disk could be failing. Quote Link to comment
Bitbass Posted October 15 Author Share Posted October 15 No, didn't try that. I can give that a shot, but the current setup is a chain of power connectors and this one is second in line. Wouldn't be crazy if that's gone bad, but unlikely. I saw in another thread that there's a way to do an XFS repair. Should I consider that, and if so, do I need to wait for the rebuild on Disk4 to complete? Quote Link to comment
JorgeB Posted October 15 Share Posted October 15 25 minutes ago, Bitbass said: there's a way to do an XFS repair. Not before resolving the ATA erros. Quote Link to comment
Bitbass Posted October 15 Author Share Posted October 15 Ok, how about I cancel the rebuild again, shut it all down, transfer it into my other motherboard and see what I get out of that? It would be a different SATA controller, obviously. Rules out a failure there. I guess what I'm asking is where am I at for potential data recovery? Is Disk2 a lost cause at this point and I need to consider that data gone? Or do I still have the data but I need to be careful about how I rebuild things? Quote Link to comment
JorgeB Posted October 15 Share Posted October 15 32 minutes ago, Bitbass said: I guess what I'm asking is where am I at for potential data recovery? It will depend on if disk2 is really failing or not, if it is, you may lose it's data and the one from the disabled disk, if disk2 is OK, the filesystem should be repairable with xfs_repair. Quote Link to comment
Bitbass Posted October 15 Author Share Posted October 15 What do you recommend as a next step? Quote Link to comment
JorgeB Posted October 15 Share Posted October 15 Swap both cables from disk2 with a different disk, this will also use a different SATA ports, and see where the ATA errors follow. Quote Link to comment
Bitbass Posted October 16 Author Share Posted October 16 Swapped cables around a couple of times. The unmountable was jumping around and not following a cable. So, I swapped motherboards. The good news is it's more stable now. No more unmountable situation, yet. The bad news is, the repair is moving very slowly. Looking at the Syslog, I think I've traced the ATA1 errors to the sdb drive. This is a newish drive, but it is a surveillance drive. It hasn't thrown any errors yet, aside from the ATA errors. When I try to run a Short SMART on Disk15 it fails with a Host Reset. All the other disks are fine with the SMART tests. Best I can figure is that Disk15 might have been failing, or maybe not, but Disk4 had write errors, went offline, and then when I rebooted Disk15 silently broke with the ATA errors that I didn't see at first. I'm attaching the current diag file. Hopefully someone can tell me this is plausible. So, my question is, what's the best path for me to recover with minimal data loss. I have drives I can swap in. I could probably add another drive to the array now, if there's a way for me to migrate the content or rebuild onto that without data loss. Current state is that I have a new Disk4 that doesn't have content on it. I have the old Disk4 that I can mount in another system (or in Unraid if that's the right thing to do) that might have content on it, but is throwing errors. I have Disk15 with ATA errors. I have spare 8TB drives that I can swap in. unraid-diagnostics-20241016-0746.zip Quote Link to comment
JorgeB Posted October 16 Share Posted October 16 Check/replace cables for disk15 and post new diags after array start. Quote Link to comment
Bitbass Posted October 16 Author Share Posted October 16 Ok, had an all morning power outage. What else can go wrong. Upon booting up Unraid with different cables for Disk15 I now have Disk15 as unmountable. Diags attached. And, I see data loss now. I know it might still be recoverable, but it's getting less likely. unraid-diagnostics-20241016-1317.zip Quote Link to comment
Solution JorgeB Posted October 16 Solution Share Posted October 16 There are now issues with three disks, it may be something other than the cables, like PSU, or if you are using splitters that's another possibility. Quote Link to comment
Bitbass Posted October 16 Author Share Posted October 16 Son of a... the culprit is attached. Appreciate you hanging in there @JorgeB. Everything is running much more responsively now and the rebuild on the drive I swapped out is blazing along at normal speeds. Quote Link to comment
JorgeB Posted October 17 Share Posted October 17 You should never use that kind of splitters, at most split on SATA plugin into two. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.