May 9May 9 I have woke this morning and have found that 2 disks have failed in my array, Disk7 and 8I'm running 7.2.4, what is my best course of action?Where 1 disk has failed before, I've restarted the array with out it, rebuilt the array, then added it back in and then rebuilt array again.I guess I could do this one disk at a time but I am worried as I I'll have no parity protectionSee attached syslogtower-syslog-20260509-0831.zip Edited May 9May 9 by clevoir typo
May 9May 9 Author I've bitten the bullet and bought 2 8TB new hard drives as I could do with some more capacityI see that the hard drives failed after a power failure, I've seen this before and then bought a UPS with AVR, so disappointed a power failure has caused drives to fail?
May 9May 9 Community Expert 1 hour ago, clevoir said:so disappointed a power failure has caused drives to failAre you sure that the disks have failed and not simply been disabled due a write error caused by an external factor (such as the power failure)?Running the extended SMART test on a drive is a good indication of a drives health.
May 9May 9 Author The array won't let me run any self-tests, if I click on either sefl-test nothing happens?
May 9May 9 Community Expert Probably worth posting diagnostics zip file created after a failure trying to run the self-test so we can see why it failed.
May 9May 9 Community Expert Good news is both emulated disks are mountable so rebuild should be OK after hardware problems are resolved.Both disks disconnected and reconnected as Unassigned Devices. Do the extended tests on the Unassigned Devices.
May 9May 9 Community Expert SMART for those Unassigned Devices looks OK, but both disks were logging critical medium errors before they disconnected. So definitely do the extended test on both.
May 9May 9 Author Thanks for the reply, I've got some new 8TB hard drives being delivered todayI'll swap them out with failed ones, then do some testing on the old ones.,
May 9May 9 Author I am doing extended test now, and I intend to preclear the new disks before use which is going to take some hoursIf failed disks test OK, I guess I could rebuild the array with them or at least one of them, and if rebuild is OK I can replace them with the new disks later at leisure?I am just concerned about the amount of time it's going to preclear the new disks and then rebuild the array, and worry if I have another another fault as I will have no parity protection.I guess it's going to safer to rebuild the array with one disk at a time rather than 2?
May 9May 9 Community Expert 8 hours ago, clevoir said:safer to rebuild the array with one disk at a time rather than 2?Since you have dual parity, even if the other disk has problems while you rebuild one, the rebuild should be OK because you still have enough redundancy. On the other hand, it will take longer to do them one at a time so will be longer until everything is good again.Might be a good idea to avoid anything writing to the array (users, dockers, VMs, scripts) until you get them both rebuilt.
May 10May 10 Author Both existing disks tested OK with a long SMART test, however I have stopped the array and tried to add one of them.It's listed but when I select it as an array disk, it's immediate changed to not assigned. I have tried with the other disk and this has the same effect?I have cleared both disks, and both are available to to preclear. I am thinking about preclearing the Disk 8 (4TB) to see if it will then let me add it to the array.In the meantime i am preclearing the 2 new 8TB drives, which looks like it's going to take a total of 50 hoursPlease find attached diagnostics tower-diagnostics-20260510-1052.zip
May 10May 10 Community Expert 11 hours ago, trurl said:Might be a good idea to avoid anything writing to the array (users, dockers, VMs, scripts) until you get them both rebuilt.I forgot to mention Mover, so it was busy moving things to the array while those disks were emulated. Shouldn't be a problem, that's what emulation is for. But it does mean the physical disks are more out-of-sync with the array than before.Maybe I overlooked it, but I didn't see in syslog where you actually reassigned the disks.24 minutes ago, clevoir said:tried to add one of them.I never use the work "add" when I mean "replace". "Add" means to a new slot in the array, not to the same slot that you are replacing. Did you try to "replace", or did you try to "add"?
May 10May 10 Community Expert I do see this thoughMay 10 11:27:28 Tower kernel: mdcmd (9): import 7 sdn 64 5860522532 0 WDC_WD60EFPX-68C5ZN0_WD-WX42D8445KLT May 10 11:27:28 Tower kernel: md: import disk7: lock_bdev error: -2 May 10 11:27:28 Tower kernel: md: import_slot: 7 empty May 10 11:27:28 Tower kernel: mdcmd (10): import 8 sdd 64 3907018532 0 WDC_WD40EFAX-68JH4N1_WD-WX22DB05D97H May 10 11:27:28 Tower kernel: md: import disk8: lock_bdev error: -2 May 10 11:27:28 Tower kernel: md: import_slot: 8 emptyNot sure what that's about. A few search results from a few years ago. I'll take a closer look.
May 10May 10 Author Sorry perhaps wrong choice of words, I stopped the array and I tried to reassign the previously failed disks 7 & 8 back into their original allocationsThey were selectable, but as soon as I selected them, the array reported the disk allocation unassignedIf the original disks are going to cause a problem I may just hold out until the new disks have finished their preclear.I was just trying to seek some reassurance by having one or both the original disks working again, before they were replaced by the new disks Edited May 10May 10 by clevoir added
May 10May 10 Community Expert The problem disks are on different controllers, but none of the other disks on either controller is having problems.Do you have a backplane or are there separate cables going to each drive?
May 10May 10 Author All disks are in a number of 5 to 3 IcyDock backplanesNew disks are have just started Preclear Post Read which is going to take 10 hours
May 10May 10 Author The failed disks are shown as unassigned devices, and are available to preclear, I can't understand why I couldn't reassign them back to the array? I've been running Unraid since V3 and have not seen the behavior before.I am looking to replace them them to the new drives once the preclear has passed, and carry out testing on the failed onesNothing has changed on the backplanes, disks / cabling hasn't been touched. Failed disks are showing that they have power to them. Edited May 10May 10 by clevoir typo
May 11May 11 Author I have now swapped out Disk 8, and started an array rebuildHowever the rebuild has paused after 12 mins, see attached diagnostics? I've just resumed the rebuildI have a spare controller card that I could swap out with the once that serves this disk tower-diagnostics-20260511-0717.zip
May 11May 11 Community Expert It's logged as a disk error, but looks more to me like a controller or power issue
May 11May 11 Author The rebuild is thankfully still ongoing, if it fails again I'll check power / data wiring and move the disk to another controller.Should it rebuild OK I guess that can do the same.I thought that I would add the new disks one at a time in case of problems.
May 11May 11 Author After restarting rebuild i have noticed that it has logged 54 faults.I thought that I'd resumed the rebuild after the faults were logged, but i have noticed it is doing a read check instead?Shall I let this complete, or power down / check data & power leads + move disk to another controller?
May 11May 11 Community Expert A read check implies the dtive was disabled (red ‘x’), and if so not a lot of point in continuing the check. Can’t do more than guess without diagnostics.
May 11May 11 Author This is the latest diagnostics, shall I power down / check cabling / move disk to new controller and then try and start the rebuild again? tower-diagnostics-20260511-1305.zip
May 11May 11 Community Expert 27 minutes ago, clevoir said:shall I power down / check cabling / move disk to new controller and then try and start the rebuild again?Worth a try, and post new diagnostics if it fails again.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.