Jump to content
The Unraid Annual Cyber Weekend Sale is here 🔥 ×

Possible bad disks, parity rebuild stuck and unable to cancel


Go to solution Solved by formerly,

Recommended Posts

Posted

Hello, not sure what diags or info is required here, will grab whatever is needed.

 

Here's the scenario, I downsized rigs and moved before I could test the drives/config in the new rig. 6 disks, single parity. New box is up and running, config and settings in general seem intact.

 

- Immediately got an error on disk 3 (13TB WD white label shuck). "DEVICE IS DISABLED, CONTENTS EMULATED". Would not even get through an extended SMART test. Swapped bays, the error followed the drive.

- I took disk 5 (14TB WD white label shuck), did a new config and put it in disk 3's spot.

- Started array, parity rebuild starts, first time got to ~20% after ~18 hours. Slowed to double digit KB/sec.

- 5 CPU threads pegged at 100%

- Unable to pause/cancel rebuild

- Force rebooted machine

- Started array, parity rebuild starts again, got to 4.2%, now "running" at 604 KB/sec. 5 CPU threads pegged at 100% (including threads 16 and 18... but there are only threads 0-15 available (14700k). Unable to pause/cancel parity check again.

Posted

You should post your system's diagnostics zip file in your next post in this thread to get more informed feedback.   It is always a good idea to post this if your question might involve us seeing how you have things set up or to look at recent logs.

Posted
12 hours ago, itimpi said:

You should post your system's diagnostics zip file in your next post in this thread to get more informed feedback.   It is always a good idea to post this if your question might involve us seeing how you have things set up or to look at recent logs.

Diags here. 

 

I was able to cancel the parity sync, it just took many hours for the command to go through. Extended SMART test did not finish but I don't see a record of it failing. Trying it again now that the CPU isn't pegged.

unraidbr-diagnostics-20240601-1356.zip

Posted

@JorgeBcorrect. Z690 mobo, 14700k, 64gb ram. Jonsbo n 3 case. Standard consumer parts. 
 

Only outlier there is an m.2 to sata adapter card, but I have not heard of that causing driver/crashing issues. 

Posted

Looks like RAM is not the problem, unlikely that there would be two bad sticks, I would recommend downgrading to 6.11.5 and re-test, although rare, there are some cases where this issue can be caused by a kernel compatibility problem with the board, if it's the same with v6.11, then board or CPU would be the next suspects.

Posted

@JorgeB new diags.

 

I rolled it back to 6.11.3 because that was readily available in the GUI... It ran at 200MB/sec+ for nearly 4 hours before crashing to a much higher floor of 1.3 MB/sec...

 

I'm going to make sure BIOS is updated. 

  • 1 month later...
  • Solution
Posted

I ended up buying an identical motherboard and CPU to test. The old CPU was bad (and the new motherboard was bad). Not sure exactly what the issue was, but it was CPU related.

  • Like 1

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...