Jump to content

2 Drives disabled with red X. Appear uninstalled when array is started


Go to solution Solved by JorgeB,

Recommended Posts

Just to let you know the background of how the parity drives got unassigned, I recovered from an april 30th backup.  In august, or september my parity discs failed and I replaced them successfully.  I thought that I had backups but I can't find the later backups.  I do use connect, unraid.net to do weekly backups but I haven't been taking them off the unraid server for a while I guess.  I know that it is too late now, but I believe that is what happened.  I should have thought better of using that backup, but that is water under the dam now

Link to comment

Okay, slow down.  Up to this point, most of the advice has been either about doing tests or hypothesizing options.  Anytime you think you're ready to take action, please post here your planned steps for review and approval.  Anytime you take an action, you're one step closer to losing data if it is the wrong action. 

 

I believe your data is still intact, so don't give up hope.  But slow down and work with the guys here, don't do anything that's not reviewed and approved.

 

22 hours ago, JorgeB said:

That suggests something in your /config is causing the issue, you can backup the current flash drive first and then redo it and just restore the bare minimum, like the key, super.dat and the pools folder for the assignments, also copy the docker user templates folder, if all works you can then reconfigure the server or try restoring a few config files at a time from the backup to see if you can find the culprit.

 

Did you follow this guidance to backup the current flash drive first, before restoring from backup?  From a planning perspective, we need to know what options remain.

 

Link to comment
1 minute ago, JorgeB said:

Connect the old disks in two known good slots, it can be the ones where you have the new disks 5 and 6, just to see if the old disks are detected, unlikely that they are both dead.

 

I would highly recommend at least starting with the original slots where the replacement drives are being detected, remove the replacements and install the originals there.  For now, don't touch any of the other "good" drives, as that could be compounding the problem, especially if you start to lose track of which drives are which.  Keep it simple.

 

Link to comment

That suggests something in your /config is causing the issue, you can backup the current flash drive first and then redo it and just restore the bare minimum, like the key, super.dat and the pools folder for the assignments, also copy the docker user templates folder, if all works you can then reconfigure the server or try restoring a few config files at a time from the backup to see if you can find the culprit.

 

I followed this first but still couldn't assign disks then went with the backup not thinking.

Link to comment

Are they using the onboard SATA controller? If they show up in the BIOS they should also show up in Unraid, last diags you posted had 4 disks using the onboard SATA:

 

Jan 24 08:11:11 Tower kernel: ata5: SATA link down (SStatus 4 SControl 300)
Jan 24 08:11:11 Tower kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata1: SATA link down (SStatus 4 SControl 300)
Jan 24 08:11:11 Tower kernel: ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata6: SATA link up 6.0 Gbps (SStatus 133 SControl 300)

 

Ports 1 and 5 were down, in which ports are those disks connected?

Link to comment

the first 8 drives use onboard Sata.  I then have 2 lsi PCIE SAS cards.  Each LSI card have 8 ports a piece.  You should see 8 sata drives Including the 2 original drives that I just swapped out.  When Unraid disables a drive, does/can it set a flag stating that the drive is unusable? Jumping at straws here.

Link to comment
9 minutes ago, jkwaterman said:

The MB actually has 10 ports.   2 of the first 6 are shared so 2 are not used

Thanks, I see that now, there are two additional 2 port Asmedia controllers, and both disks are failing two initialize in one of them:

 

Jan 24 08:11:11 Tower kernel: ata7: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata7.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 24 08:11:11 Tower kernel: ata7: limiting SATA link speed to 3.0 Gbps
Jan 24 08:11:11 Tower kernel: ata7: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Jan 24 08:11:11 Tower kernel: ata7.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 24 08:11:11 Tower kernel: ata7: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Jan 24 08:11:11 Tower kernel: ata8: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata8.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 24 08:11:11 Tower kernel: ata8: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 24 08:11:11 Tower kernel: ata8.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 24 08:11:11 Tower kernel: ata8: limiting SATA link speed to 3.0 Gbps
Jan 24 08:11:11 Tower kernel: ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Jan 24 08:11:11 Tower kernel: ata8.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 24 08:11:11 Tower kernel: ata8: SATA link up 3.0 Gbps (SStatus 123 SControl 320)

 

When this happens they can still show up in the BIOS and be a disk issue, but to be 100% certain, please try connecting these two disks to the Intel controller, you can swap with two other that are using it, if they do the same on the Intel controller I would say for sure those disks are bad.

 

 

 

 

Link to comment

I have been going through this thread while waiting for confirmation that the original disks are bad.  I noticed that I made a statement that may cause others to think that I did not back up the config before going down this path.  I did back up the config from the original flash drive before doing anything. I do have a super.dat file in the config folder backup.

Link to comment
Jan 16 16:12:36 Tower kernel: ata6.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 16 16:12:36 Tower kernel: ata4: link is slow to respond, please be patient (ready=0)
Jan 16 16:12:36 Tower kernel: ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Jan 16 16:12:36 Tower kernel: ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jan 16 16:12:36 Tower kernel: ata4.00: failed to IDENTIFY (INIT_DEV_PARAMS failed, err_mask=0x80)
Jan 16 16:12:36 Tower kernel: ata4: link is slow to respond, please be patient (ready=0)
Jan 16 16:12:36 Tower kernel: ata4: COMRESET failed (errno=-16)
Jan 16 16:12:36 Tower kernel: ata4: link is slow to respond, please be patient (ready=0)

 

Same issue, I think we can conclude that the old disks really failed, which is kind of strange both failing the same way at the same time.

 

10 hours ago, jkwaterman said:

I did back up the config from the original flash drive before doing anything. I do have a super.dat file in the config folder backup.

This is from when the disks were already disabled? With the array like this:

 

image.png

 

 

Link to comment
On 1/23/2024 at 1:29 PM, JorgeB said:

That suggests something in your /config is causing the issue, you can backup the current flash drive first and then redo it and just restore the bare minimum, like the key, super.dat and the pools folder for the assignments, also copy the docker user templates folder, if all works you can then reconfigure the server or try restoring a few config files at a time from the backup to see if you can find the culprit.

I backed up the flash drive config at this point in time before I mucked with the April backup when we know I had parity.   Yes if I understand you the two drives were disabled. 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...