unRAID 6 Problems after PSU Failure


Recommended Posts

Can you get the webUI to work at all? It would be a lot better if you could give us the complete diagnostics by going to Tools - Diagnostics and posting the complete diagnostics zip.

 

Or you can just type "diagnostics" at the command line to save the diagnostics zip to flash and post it.

Link to comment

I replaced 3 drives and in slot 9, drive is out waiting for replacement drive to arrive. I have slot 17, 18, 19 and 24 free. The server hangs after a period of time and I have to reset power via IPMI because reboot command via cli hangs.

Link to comment

Both the parity disk and the SSD on the Highpoint controller are having issues:
 

Jun  7 07:43:44 unRAID kernel: ata5: limiting SATA link speed to 1.5 Gbps
Jun  7 07:43:44 unRAID kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x180000 action 0x6 frozen
Jun  7 07:43:44 unRAID kernel: ata5: edma_err_cause=00000020 pp_flags=00000001, SError=00180000
Jun  7 07:43:44 unRAID kernel: ata5: SError: { 10B8B Dispar }
Jun  7 07:43:44 unRAID kernel: ata5: hard resetting link
Jun  7 07:43:44 unRAID kernel: ata6: limiting SATA link speed to 1.5 Gbps
Jun  7 07:43:44 unRAID kernel: ata6: exception Emask 0x10 SAct 0x0 SErr 0x180000 action 0x6 frozen
Jun  7 07:43:44 unRAID kernel: ata6: edma_err_cause=00000020 pp_flags=00000000, SError=00180000
Jun  7 07:43:44 unRAID kernel: ata6: SError: { 10B8B Dispar }
Jun  7 07:43:44 unRAID kernel: ata6: hard resetting link
Jun  7 07:43:44 unRAID kernel: ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun  7 07:43:44 unRAID kernel: ata5: SATA link up 1.5 Gbps (SStatus 113 SControl 310)

If possible connect both on a different controller.

 

Parity is also showing some recent SMART errors, so you may want to run a extended test on it.
 

Quote


Device Model:     WDC WD80EFZX-68UW8N0
Serial Number:    VKGNLLMX
  5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail  Always       -       14
  9 Power_On_Hours          0x0012   099   099   000    Old_age   Always       -       8719
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       14

 

Error 4 occurred at disk power-on lifetime: 8610 hours (358 days + 18 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 53 01 c6 4b 17 40  Error: ICRC, ABRT 1 sectors at LBA = 0x00174bc6 = 1526726

 

 

Link to comment

That controller was installed after the psu failure in order to move Parity from slot 1 and cache from slot 24 to inside the norco case, the highpoint was bought used, so maybe that is the problem. I will check. 

 

Thanks a lot for your help

Link to comment
2 hours ago, johnnie.black said:

It uses a Marvell controller, more than half the people I help with various disk issues are because of a Marvell controller, for current unRAID recommend using LSI controllers.

 

Since I am in Venezuela and it's hard to test cards an return them to the US if they don't work, do you have any suggestion in particular?

 

Norco RPC-4224 case, Supermicro X9SCM-iiF board, Xeon® CPU E3-1230 V2 @ 3.30GHz, 2 x 8 GB ECC Kingston Memory, 2 Serveraid M1015 cards, 2 Highpoint 620 cards and 1 Highpoint 2300LF card.

 

Thanks for help

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.