2 Disks got disabled


Recommended Posts

Actually 3 disks! 

 

First it started with Parity 1 with (UDMA CRC error count)

I changed the SATA cable and connected it to another SATA port in the motherboard, and started rebuilding.

While rebuilding, Disk 2 got disabled with +2000 IO errors. Rebuilding was not interrupted, I think because I have 2 Parities.

Rebuilding completed, Parity 1 disk became green again. Disk 2 has red ball.

After few hours, Disk 3 got disabled.

 

Disk 2 - ST8000DM004-2CX188_WCT0M8D4 (sdm) (errors 2)
Disk 3 - ST8000DM004-2CX188_WG8011AG (sdl) (errors 2048)

 

Parity 1 is connected like this: motherborad -> 5.25 to 3.5 converter -> HDD

Disk 2 and Disk 3 are connected to LSI SAS HBA through the same port.

 

What is the best way to proceed? I believe losing another disk would cause data lose.

I have a spare pre-cleared disk but it has (60 Reported uncorrect), not sure how it got pre-cleared without issues.

 

Please let me know if need more info. I know I did a mistake by shutting down the server before getting the logs, I wanted to check the connections and was too scared to lose another disk.

 

Thanks.

 

 

Unraid Version: 6.7.2

CPU: AMD Ryzen 7 2700X

Mobo: MSI X470 GAMING PLUS (MS-7B79)

Memory: 16 GiB DDR4

 

image.thumb.png.3a30391f8979c1879f48693d1a999ac5.png

 

image.png.4333bdb6ce7c0981ea6e62893704f163.png

 

tower-diagnostics-20200528-1303.zip

Edited by HAMANY
Link to comment
31 minutes ago, johnnie.black said:

You rebooted since the errors so we can't see what happened, for now I would recommend unassigning both disabled disks and starting the array, check that both emulated disk mount correctly and contents look OK.

 

Thanks for the quick response. Will try this now.

Edit: Done, the files looks fine.

 

Is there anyway to prevent the logs from being removed with rebooting, and keep them for few days?

Edited by HAMANY
Link to comment
Just now, johnnie.black said:

You can set up a syslog server, though that's used more for troubleshooting when the server keeps crashing or similar, for this kind of problem you just need to download the diagnostics before rebooting.

Noted.

 

The server is up and my files looks fine.

Should I assign the 2 disks again and start rebuilding?

Link to comment
  • 1 month later...
On 6/1/2020 at 1:18 PM, johnnie.black said:

That looks more like a power/connection issue, swap/replace cables and try again.

Thank you Johnnie!

The drives are now more stable after changing one of the power cables (4pins to SATA).

 

However, I''m getting read and SMART errors in one of the 8TB Seagate drives (Disk 10 - sdi)

I did a full scan using SeaTools but it didn't report any errors! Also it passes the SMART extended self-test.

Could it be a connection issue also?

 

The diagnostics file is attached. 

Appreciate your support.

 

Thank you.

tower-diagnostics-20200706-0948.zip

Edited by HAMANY
Link to comment

This does not look too good:

Quote

  5 Reallocated_Sector_Ct   PO--CK   099   099   010    -    2352

While reallocated sectors are not necessarily a problem as long as the numbers stays constant anything other that a small number is often a good indication that the drive's health may be suspect.  You might want to run an extended SMART test on the drive to see how that goes.

  • Thanks 1
Link to comment
4 hours ago, itimpi said:

This does not look too good:

While reallocated sectors are not necessarily a problem as long as the numbers stays constant anything other that a small number is often a good indication that the drive's health may be suspect.  You might want to run an extended SMART test on the drive to see how that goes.

 

3 hours ago, johnnie.black said:

It passed the short test, you should run a long one, but if Seatools passed it should also pass, it did fail a long test before, so there were issues before.

 

Thank you both for your reponse.

I will re-run the extended SMART and SeaTools again just to make sure.

 

The drive is still under warranty, but I can't create an RMA without the SeaTools log showing the problem.

Link to comment

For ZCT1BN14 disk, a weird thing happen, SMART clear previous error. I haven't look deep in previous diagnostics. But you got trouble on 3 Seagate disk.

Many years ago, 3 of 4 ( same lot ) Seagate 3TB disk got problem less in 2 yrs, after that, I never buy Seagate again, mainly because other brand have good quality and low price source.

 

Error 12 [11] log entry is empty
Error 11 [10] log entry is empty
Error 10 [9] log entry is empty
Error 9 [8] log entry is empty
Error 8 [7] log entry is empty
Error 7 [6] log entry is empty
Error 6 [5] log entry is empty
Error 5 [4] log entry is empty

Edited by Benson
  • Thanks 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.