Jump to content

3rd Data Disk errors during rebuild


Recommended Posts

So, i had 1 failed drive. I also needed more space, so i removed a failed drive and another drive to do an upgrade. I precleared them and i added them to the array in place of the previous 2 drives. During the rebuild, a 3rd drive has a large number of read errors, it has never had issues before this. Is there anything that can be done to save this?

Link to comment

Problem is that might make recovery not possible, or at least cause some data loss, you can still try but keep old disk11 intact, that way you can also try and use ddrescue on it.

 

To re-add old disk 15 and try to rebuild both disks you can do this:

 

-Tools -> New Config -> Retain current configuration: All -> Apply
-Check all assignments and assign any missing disk(s) if needed, including old disk 15 and new disks 11 and 21, replacement disks should be same size or larger than the old ones
-IMPORTANT - Check both "parity is already valid" and "maintenance mode" and start the array (note that the GUI will still show that data on parity disk(s) will be overwritten, this is normal as it doesn't account for the checkbox, but it won't be as long as it's checked)
-Stop array
-Unassign disks 11 and 21
-Start array (in normal mode now)

 

Grab and post new diags.

 

 

Link to comment

So just for clarity it was drive 21 that was healthy and drive 15 that was bad at first and drive 11 is the one that just had read errors. I substituted your instructions based on me giving you the wrong info in the first place. So i did the following:

 

-Tools -> New Config -> Retain current configuration: All -> Apply
-Check all assignments and assign any missing disk(s) if needed, including old disk 21 and new disks 11 and 15, replacement disks should be same size or larger than the old ones
-IMPORTANT - Check both "parity is already valid" and "maintenance mode" and start the array (note that the GUI will still show that data on parity disk(s) will be overwritten, this is normal as it doesn't account for the checkbox, but it won't be as long as it's checked)
-Stop array
-Unassign disks 11 and 15
-Start array (in normal mode now)

 

Just to verify i understood. Anyway diagnostic is attached. Array is only with 11 and 15 not installed.

tvserver01-diagnostics-20221121-1130.zip

Link to comment
47 minutes ago, scuppasteve said:

So just for clarity it was drive 21 that was healthy and drive 15 that was bad

That makes sense, like I mentioned disk 15 was logged as a disk problem.

 

Filesystem was not detected on disks 11 and 15, not a very good sign, but click on both disks with the array stopped and change the filesystem from auto to xfs, then post new diags after array start.

 

 

 

 

Link to comment
Nov 22 07:07:32 TVServer01 kernel: md: disk18 read error, sector=41446120
Nov 22 07:07:32 TVServer01 kernel: md: disk18 read error, sector=41446128
Nov 22 07:07:32 TVServer01 kernel: md: disk18 read error, sector=41446136
Nov 22 07:07:32 TVServer01 kernel: md: disk18 read error, sector=41446144
Nov 22 07:07:32 TVServer01 kernel: md: disk18 read error, sector=41446152
.....
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=8
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=16
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=24
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=32
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=40
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=48
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=56
Nov 22 07:14:12 TVServer01 kernel: md: disk10 read error, sector=64

 

See what's going with those disks, looks like they dropped offline and reconnected, this is usually a power/connection problem, since there are two emulated disks all other disks must work for Unraid to be able to emulate them.

Link to comment

Its more drive failures, its fucked at this point. 10 i was able to reseat, but 18 is bad, i pulled it and tried to test it on another machine. Can i start the array at this point with the data that is on the functioning drives. If i understand correctly, nothing is affected on the existing data drives. This is due to a series of fan failures and drives overheated. Not much that can be done now i think.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...