April 30, 20242 yr I just ran into an issue with rebuilding a drive. I have an array with dual parity disks. I had 2 disks exhibit an error. This has happened before and I usually rebuild one then when complete rebuild the other. In this case the disk was partially rebuilt and I noticed some array data was missing. I stopped the rebuild, rebooted the server and saw that 3 disks were unmountable. Something is causing my disks to drop out of the array and now I'm in a state that I don;t know how to recover from. I've attached diagnostics in case that might help diagnose the issue. nasvm-diagnostics-20240429-1948.zip
April 30, 20242 yr Community Expert Diags are after rebooting so we can't see what happen, but check filesystem on those 3 disks, run it without -n, and if it asks for it use -L.
May 7, 20242 yr Author Thanks for the guidance. I was able to get things running again although with data loss. I’m trying to figure out why Drive 5 keeps dropping out in error after rebuild now. SMART tests pass so it must be something else. i’ve attached a current diagnostics dump. it’s probably what’s contributing to whatever happened before. nasvm-diagnostics-20240507-0724.zip
May 7, 20242 yr Community Expert Try uploading the diags again, it's downloading as 1KB corrupt file.
May 8, 20242 yr Community Expert Lots of these with disk5: May 5 05:19:45 nasvm kernel: sd 1:0:8:0: Power-on or device reset occurred They usually indicate a power/connection problem, check/replace cables or swap slots for that disk. You also need to check filesystem for disk 1
May 14, 20242 yr Author Things are still rebuilding but so far no changes and no drops. Not sure what exactly was going on. I will continue to monitor. Thank you for the help.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.