Brand new HDD went into "Error State" during a Unraid Data-Rebuild


Recommended Posts

  • Replies 78
  • Created
  • Last Reply

Top Posters In This Topic

Top Posters In This Topic

Posted Images

Ah nevermind the moment I hit the last step you gave it started the data-rebuild automatically =D

I'm surprised it didn't require a Pre-clear first is that a 6.12 thing? In the previous version it required it.

Edited by Nanuk_
Link to comment

Ah! Thank you for clearing that up! Starting the Extended test now =D

Though my future goal is to retire this HBA and move the cache to two onboard nvmes since this HBA has bricked 3 SSDs and possibly a 10TB. But for some reason these 4 old 1TB are safe. That what I get for buying from the local amazon ripoff here in the Philippines. I've learned my lesson. 

Though I plan a replacement in the future from "The Art of the Server" and just ship it here. At least it'll be form a trusted brand.

 

image.png.d48dd4e31aa0b3140f4f230232f00700.png

Edited by Nanuk_
Link to comment

Yes, replaced I the cable. I've actually replaced them 3 times. But still the crc errors persist so I'm gonna assume it's the HBA which I plan to eventual replace. But since the only reputable HBA vendor I know is in the US and I'm in the PH I'll have to be patient and buy 2 NVMes as a replacement cache when I can afford it and retire this HBA.

Also the HDDs for the cache are really old 1TBs, only used them because the SDDs got bricked. They're pretty much on their last legs.

Edited by Nanuk_
Link to comment
  • 3 weeks later...

Yes, I'm trying to follow the steps your all taught me
1.) first I replaced the SATA cable. (DONE)
2.) Afterwards I disabled spindown, (DONE)
3.) placed it into maintenance mode and (DONE)
4.) ran and extended smart test. (DONE)
5.) Then I ran an xfs repair (DONE)
6.) and now I'm currently rebuilding now. (IN PROGRESS)

 

trojancarabao-smart-20230814-1328.ziptrojancarabao-diagnostics-20230814-1719.zip

Edited by Nanuk_
Link to comment
  • 1 month later...

Hi so I bought 2 new HDDs I pre-cleared both, but and after a day of use one of them got an error. I have a feeling it might be because that drive is hooked up to my HBA which I think might be faulty. I set the server to maintenance mode and I'm running an extended test. I'll run XFS Repair after and send both the test and diag here when I'm done.


image.png.871ff277cac9430845b62e2c46305123.png
image.thumb.png.b895076df9941e84730ed200077bd68d.png

Link to comment

ATA Error Count: 44315 (device log contains only the most recent five errors)
    CR = Command Register [HEX]
    FR = Features Register [HEX]
    SC = Sector Count Register [HEX]
    SN = Sector Number Register [HEX]
    CL = Cylinder Low Register [HEX]
    CH = Cylinder High Register [HEX]
    DH = Device/Head Register [HEX]
    DC = Device Command Register [HEX]
    ER = Error register [HEX]
    ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 44315 occurred at disk power-on lifetime: 30 hours (1 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 e0 08 a8 bc b9 40 00   1d+06:57:46.041  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   1d+06:57:46.041  READ LOG EXT
  60 00 50 b8 c9 b9 40 00   1d+06:57:45.929  READ FPDMA QUEUED
  60 40 70 78 c7 b9 40 00   1d+06:57:45.928  READ FPDMA QUEUED
  60 18 68 60 c7 b9 40 00   1d+06:57:45.928  READ FPDMA QUEUED

Error 44314 occurred at disk power-on lifetime: 30 hours (1 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 80 18 c8 bb b9 40 00   1d+06:57:45.898  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   1d+06:57:45.898  READ LOG EXT
  61 00 a0 50 ae b9 40 00   1d+06:57:45.892  WRITE FPDMA QUEUED
  61 c0 00 90 ad b9 40 00   1d+06:57:45.891  WRITE FPDMA QUEUED
  60 c0 a0 90 ad b9 40 00   1d+06:57:45.883  READ FPDMA QUEUED

Error 44313 occurred at disk power-on lifetime: 30 hours (1 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 c0 38 90 ad b9 40 00   1d+06:57:45.882  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   1d+06:57:45.882  READ LOG EXT
  60 c0 b0 d0 bd b9 40 00   1d+06:57:45.865  READ FPDMA QUEUED
  60 48 a8 88 bd b9 40 00   1d+06:57:45.864  READ FPDMA QUEUED
  60 e0 b8 a8 bc b9 40 00   1d+06:57:45.862  READ FPDMA QUEUED

Error 44312 occurred at disk power-on lifetime: 30 hours (1 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 28 68 c0 ac b9 40 00   1d+06:57:45.856  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   1d+06:57:45.856  READ LOG EXT
  61 38 78 a0 ab b9 40 00   1d+06:57:45.847  WRITE FPDMA QUEUED
  60 08 c8 c0 bb b9 40 00   1d+06:57:45.841  READ FPDMA QUEUED
  60 08 c8 98 4c 45 40 00   1d+06:57:45.841  READ FPDMA QUEUED

Error 44311 occurred at disk power-on lifetime: 30 hours (1 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 41 00 00 00 00 00  Error: ICRC, ABRT at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 38 60 a0 ab b9 40 00   1d+06:57:45.789  READ FPDMA QUEUED
  2f 00 01 10 00 00 00 00   1d+06:57:45.789  READ LOG EXT
  60 08 10 68 b2 b9 40 00   1d+06:57:45.778  READ FPDMA QUEUED
  60 08 10 b0 46 45 40 00   1d+06:57:45.778  READ FPDMA QUEUED
  60 08 10 a0 45 45 40 00   1d+06:57:45.778  READ FPDMA QUEUED
 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.