Jump to content

Drive errors followed by Unmountable for a second drive after reboot


Go to solution Solved by JorgeB,

Recommended Posts

Unraid started finding errors on Disk 4 last night. I didn't notice it until this morning, at which time I rebooted Unraid to see if it would clear it up. Instead, it's now showing Disk 2 as being unmountable. I'm suspecting I have a SATA controller problem, but the array is now trying to rebuild Disk 4. It is going VERY slowly. Maybe I screwed up on starting the array and having it kick into a rebuild. How do I navigate my way out of this now? Do I need to confirm it's a SATA controller problem? If I need to, I can transfer the system into a different motherboard and sata controller. I've done it before. I just need to understand what the risk is to stopping the rebuild, and if that's the right approach.

unraid-diagnostics-20241014-1450.zip

Link to comment

I have spare drives to swap in. I've learned my lesson from the past and I won't rush things now (despite this mistake I probably already made). The rebuild is jumping between 8 days and 30+ days. So, I have time before I do anything else! :) 

Link to comment

No, didn't try that. I can give that a shot, but the current setup is a chain of power connectors and this one is second in line. Wouldn't be crazy if that's gone bad, but unlikely. 

 

I saw in another thread that there's a way to do an XFS repair. Should I consider that, and if so, do I need to wait for the rebuild on Disk4 to complete?

Link to comment

Ok, how about I cancel the rebuild again, shut it all down, transfer it into my other motherboard and see what I get out of that? It would be a different SATA controller, obviously. Rules out a failure there.

 

I guess what I'm asking is where am I at for potential data recovery? Is Disk2 a lost cause at this point and I need to consider that data gone? Or do I still have the data but I need to be careful about how I rebuild things?

Link to comment
32 minutes ago, Bitbass said:

I guess what I'm asking is where am I at for potential data recovery?

It will depend on if disk2 is really failing or not, if it is, you may lose it's data and the one from the disabled disk, if disk2 is OK, the filesystem should be repairable with xfs_repair.

Link to comment

Swapped cables around a couple of times. The unmountable was jumping around and not following a cable. So, I swapped motherboards. The good news is it's more stable now. No more unmountable situation, yet. The bad news is, the repair is moving very slowly. Looking at the Syslog, I think I've traced the ATA1 errors to the sdb drive. This is a newish drive, but it is a surveillance drive. It hasn't thrown any errors yet, aside from the ATA errors. When I try to run a Short SMART on Disk15 it fails with a Host Reset. All the other disks are fine with the SMART tests. 

 

Best I can figure is that Disk15 might have been failing, or maybe not, but Disk4 had write errors, went offline, and then when I rebooted Disk15 silently broke with the ATA errors that I didn't see at first. I'm attaching the current diag file. Hopefully someone can tell me this is plausible.

 

So, my question is, what's the best path for me to recover with minimal data loss. I have drives I can swap in. I could probably add another drive to the array now, if there's a way for me to migrate the content or rebuild onto that without data loss.

 

Current state is that I have a new Disk4 that doesn't have content on it. I have the old Disk4 that I can mount in another system (or in Unraid if that's the right thing to do) that might have content on it, but is throwing errors. I have Disk15 with ATA errors. I have spare 8TB drives that I can swap in.

unraid-diagnostics-20241016-0746.zip

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...