unmountable with large number of xfs_repair issues


sts

Recommended Posts

I wish I had started by posting here, but instead I tried to figure it out. I'll do my best to explain what has happened and what I've done over the past couple days. To add to the headache the server is built on a asrock c2550d4i so I'm guessing a combo of faulty cabling and/or bad Marvell controllers are at the root of my issues.

 

started weeks ago with disk5 turning up Unmountable: No file system. I assumed this was a bad drive, as the system had no trouble emulating with parity, I shut everything down and ordered a new drive and when it arrived started rebuilding but the process was slow and various disks were showing Hard Resetting Link errors.

 

I stopped the rebuild and shut the system down and replaced all the sata cables and reorganized the sata power cabling. that seemed to help except disk4 was now also showing as Unmountable: No file system. After looking at the forums and I ran xfs_repair -v -L on disk4, making it mountable again.

 

Began rebuilding disk5 again and disk4 started producing errors. Then I noticed both disk4 & disk3 have SMART errors. I allowed the rebuild of disk5 to finish only to  discover it is of course also unmountable.

 

With the array in maintenance mode I ran run xfs_repair -n on all disks and discovered disk3, disk5 and disk7 have issues. I ran xfs_repair -v on disk3 and disk7, which appeared to work but the xfs_repair -n of disk5 (emulated, assigned or unassigned) has a lot of "out-of-order bno btree", "data fork in ino" and skipped phase 5, 6 & 7.

 

I've included as much of my idiotic fumbling as I can remember above in case it's helpful?

 

So I'm looking for advice, is there a way to make this drive mountable again or is the file structure too damaged? should I even attempt to run xfs_repair -v or L on disk5? unfortunately the system has been powered down and rebooted numerous times through this process, so the Diagnostics file is what it is, sorry about that. Is there a best method moving forward for rebuilding and mitigating data loss? I do have space on a second server for copying.

antron-diagnostics-20181113-1252.zip

virtual disk5 xfs_repair status -nv.txt

Link to comment

Thanks for having a look Johnnie. I had a spare M1015 / LSI SAS9220-8i which I've installed, so no Marvell controllers are being used. In maintenance mode now, with Disk5 emulated I ran another xfs_repair -n which produced what appears to be the same output as before. How should I proceed? Is there a repair I should attempt, or can I offer more information?

 

Link to comment

 

I just noticed you've said in another thread with a similar issue:

 

Quote

XFS_repair is searching the disk for a backup superblock, I remember it can take a while in big disks, unless there is something wrong with the disk, like pending sectors, just let it run.

 

So I'll just let it run overnight and cross my fingers it doesn't end in an error.

 

I'll write again when i see a result

Link to comment
3 hours ago, sts said:

ah, I guess when I didn't have the replacement disk installed the assignments were different?

Unraid identifies the disks by their serial number when it assigns a number to them because the disk letters are not guaranteed to stay the same between boots. You must always confirm if you need to use a disk letter for anything.

Link to comment

Thanks for the clarification trurl. I really should have known better.

 

Thanks again for the help Johnnie. I'll look into the xfs Mailing List.

 

I do have another question, out of curiosity I've found that UFS file explorer has no problem browsing and copying off the replacement drive, i'm assuming it could also read the original failed disk5. Would it be possible to back up the content of the failed drive to another location and then select New Config in Unraid with the existing array plus the replacement drive, rebuild the server with that empty disk, obviously abandoning emulation and the media that was on disk5, but then re-add the media that was backed up from using UFS file explorer? or am I asking for whole new set of problems?

 

 

Edited by sts
Double words
Link to comment
3 hours ago, sts said:

rebuild the server with that empty disk, obviously abandoning emulation and the media that was on disk5, but then re-add the media that was backed up from using UFS file explorer?

You don't need a new config for that, just rebuild the disk as is, format and restore the data from the external disk.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.