"Disk in parity slot is not biggest." after a power failure


Go to solution Solved by JorgeB,

Recommended Posts

I have 5 identical WD REDs, each 4TB. Three of them are data disks and two are parity.
Everything was working fine.

Then I installed Unassigned devices and attached a 5 TB USB drive with old photos and phone videos.
Everything was still working fine ... until we lost power.

When the system re-booted, the USB drive was still attached - and still larger than any of the parity drives.

This resulted in the "Wrong" error. After some research on this forum, I unplugged the USB drive and used Tools\New config.

This resolved the "Wrong" error, but the array remains stopped. The START button is no clickable and next to the "Stopped" is a green dot and the message that "Disk in parity slot is not biggest."

 

I've read in previous ports that this might be caused by the controller. In my case, all five drives are connected to SATA ports on the motherboard and it has worked fine for a couple of weeks.

Link to comment
  • Solution

OK, you are likely running into a bug, even if we fixed the parity issues all disks would show up unmountable, type this for all 3 data array disks:

 

sgdisk -o -a 8 -n 1:1M:0 /dev/sdX

 

Replace X with correct disk identifiers, as of last diags they were /dev/sdc, d and f, but they can change with a reboot, so double check.

 

Reboot, all 3 array data disks will show as wrong again, do another new config and check "parity is already valid" before starting the array, start the array and everything should be back to normal, if it's not post new diags.

 

 

 

 

Link to comment

I executed the command for all three drives.

Rebooted.

Like you predicted, the array remains stopped but the START button is clickable.

However, I'm unable to perform the New config. No matter which combination I select in the drop-down, the APPLY button stays inactive. Next to it is the message "Array has been Reset (please configure)".

 

Should I go ahead and START the array with "Parity is already valid." checked?

I've attached new diag. logs.

batcave-diagnostics-20240212-2025.zip

Link to comment

That seems to have fixed it. Thank you @JorgeB! Now, a few questions.

 

What can be learned from this? What did I do wrong? Was it a mistake to boot the system with the USB drive still attached?

And, most importantly: what did the magic spell command actually do?

Link to comment
11 hours ago, mappo said:

What did I do wrong?

You didn't do anything wrong, it was caused by a recently discovered bug, though the initial new config you did wasn't going to help in this case, and it prevented us from being 100% sure it was this bug, so just recommend asking for help before trying solutions found on the forum, because sometimes the symptoms can be similar but have a different root cause. 

  • Thanks 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.