ERRORS: Emulating two drives during a rebuild! :-(


Go to solution Solved by JorgeB,

Recommended Posts

!diagnostics-20230421-1305.zipHi All

 

GHOST IN THE MACHINE?

I really need some help to fix my Unraid server, seeing many new errors and not sure what's the cause of it?

So far Unraid have ben pretty stable.

 

Most of this just started during a rebuild of a new replacement drive nr. 18 I got this error:

image.thumb.png.3e7dd09bf84b0bd81ed8843fab0be564.png

 

And log from Disk 1:

image.thumb.png.71c284231866bb89118cea277c85b736.png

 

 

But I guess it best to wait for the drive 18 to finish rebuild before trying to do a reboot and rebuild drive 1?

Since it looks like I am emulating 2 drives! dISK 1 and disk 18

 

image.thumb.png.161abdffca0051dfcccdd066e99b29cf.png

 

Drive 1 now shows up under Unassigned drives?

 

On top of this I am getting some strange other strange errors and behaviors:

image.thumb.png.bded15185145ed098263308639c8c2f6.png

 

Run out of memory message:

image.thumb.png.673669334089a104cfcf8375ce6ce575.png

 

image.thumb.png.54c9b42217b968209d26e827e31fa70c.png

 

Diagnostic attached

 

diagnostics-20230420-1529.zip

 

New Diag disk 1 was removed?

!diagnostics-20230421-1305.zip

 

 

 

Edited by casperse
Link to comment
  • casperse changed the title to ERRORS: Emulating two drives during a rebuild! :-(
  1. I stopped the array removed disabled drive 1 and started array without VM & Docker
  2. I then added drive 1 and started the array again and it started a rebuild of drive 18 and emulated drive 1
  3. BUT then it stopped and when I looked in the log file I got this:

image.thumb.png.f3fc171bdde30889c4f33643ee6545aa.png

 

Now its writing error on drive 2 & 6

I have shutdown the server and now I dont know how to proceed?

 

New diagnostic files attached here:

 

 

!!diagnostics-20230421-1352.zip

Link to comment

Okay @JorgeBI got a brand new 1000W Corsair PSU and I just booted the server now I get a new error message:

image.thumb.png.0e35dfa99118042026794069f1f19259.png

I am pretty sure I have a backup on my Unraid account?

BUT I can see the drive 18 is started to rebuild! and the logs doesn't show any errors! so far so good!

 

image.thumb.png.48164fcdc75ca7b59d32d46fc1c7e33c.png

 

Getting new errors again, but its still rebuilding (ETA 4 days!)

image.thumb.png.35d66ec1a879786e577f890bfdc68251.png

So what should I do now? wait for two drives to rebuild?

Do I need a new USB for Unraid?

 

Link to comment

It only appeared at start up and I have not seen it since. I can see that it it did a backup of the USB to "My server" so that also worked!
That's a good thing right.

 

Like this:

image.thumb.png.6da8f51a431104aeee6862789799396a.png

So the error is not related to drive failures but nvme drive errors (because of the SMART transfer warnings)

 

Again thanks for helping me out! I would never have guessed that my Platinum Corsair AX860i power supply would cause problems, actually think the have a very long warranty have to check that. 

Link to comment
4 hours ago, JorgeB said:

You can try a different PCIe slot for the HBA, if it's in a CPU slot try a PCH one, or vice versa.

 

Thanks I will try that after rebuild is done. I think the HBA is in one of the x8 slots

image.png.05e8d5a8582729c373db1d86ac4ea3a9.png

Unfortunately look like it is going to take (4-5 days) a very long time for the 18TB + 12TB drives to rebuild.

 

Is there any tweaks I can use to do this faster? (Thinking of the disk settings, most is larger and faster drives)

So far I think its very standard values (I have tried to search the forum but haven't found any newer post about this subjetc
image.thumb.png.433609154838092f8c5f833b4cbccffc.png

Link to comment

Hi JorgeB
 

Speed is at highest 120MB/sec and now pretty low 8.6 MB/sec most of the time:

image.thumb.png.d37ee83ae219c7ff71a3dde4ea24265f.png

 

lspci -d 1000: -vv
image.thumb.png.32c426cff4a761201a0aa8224d266c13.png

 

 

Quote

You can try a different PCIe slot for the HBA, if it's in a CPU slot try a PCH one, or vice versa.


My current PCie slot and placement of HW:

PCIe slot 1: x8 NVIDIA Quadro P2000

PCIe slot 2: x4 NVIDIA GeForce RTX 3060

PCIe slot 3: x8 LSI Logic SAS 9305-24i Host Bus Adapter

PCIe slot 4: x4 M.2. NVMe ICY BOX: IB-PCI215M2-HSL adapter

 

New placement?

PCIe slot 1: x8 LSI Logic SAS 9305-24i Host Bus Adapter

PCIe slot 2: x4 NVIDIA Quadro P2000

PCIe slot 3: x8 NVIDIA GeForce RTX 3060

PCIe slot 4: x4 M.2. NVMe ICY BOX: IB-PCI215M2-HSL adapter

Link to comment

Hi Jorge B.

It just finished it must have speeded up in the end....and it says that the array is ok for both drives now?

When it was running it only stated it was rebuilding the 18TB drive and not Drive 1 (12TB) but according to Unraid my array is fine now?

Anyway I am now looking into building another Unraid server as a Backup server (This was kind of a wakeup call)

Again thanks for all your help so happy to be up and running again with parity drives on all drives!

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.