Jump to content

Is my drive dead?


Recommended Posts

After experiencing very slow performance yesterday, I looked into the system log and saw a lot of errors on my parity disk.

 

Mar  1 08:53:59 LAI_SERVER kernel: md: disk0 read error (Errors)
Mar  1 08:53:59 LAI_SERVER kernel: handle_stripe read error: 3585631368/0, count: 1 (Errors)
Mar  1 08:53:59 LAI_SERVER kernel: md: disk0 read error (Errors)
Mar  1 08:53:59 LAI_SERVER kernel: handle_stripe read error: 3585631376/0, count: 1 (Errors)

(...) X infinity 

 

Lo and behold, this is the smart report. Is it dead?

 

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   183   183   051    Pre-fail  Always       -       26831
  3 Spin_Up_Time            0x0027   167   164   021    Pre-fail  Always       -       6650
  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2082
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   083   083   000    Old_age   Always       -       12618
10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       450
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       11
193 Load_Cycle_Count        0x0032   190   190   000    Old_age   Always       -       31616
194 Temperature_Celsius     0x0022   126   095   000    Old_age   Always       -       24
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   196   196   000    Old_age   Always       -       1416
198 Offline_Uncorrectable   0x0030   200   199   000    Old_age   Offline      -       39
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   144   001   000    Old_age   Offline      -       15170

SMART Error Log Version: 1
ATA Error Count: 104 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 104 occurred at disk power-on lifetime: 12610 hours (525 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 90 37 ad ee  Error: UNC at LBA = 0x0ead3790 = 246232976

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 d0 36 ad ee 08  14d+10:56:11.817  READ DMA
  c8 00 00 d0 2d ad ee 08  14d+10:56:11.603  READ DMA

Error 103 occurred at disk power-on lifetime: 12610 hours (525 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 d0 20 ad ee  Error: UNC at LBA = 0x0ead20d0 = 246227152

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 d0 20 ad ee 08  14d+10:56:07.923  READ DMA
  ef 10 02 00 00 00 a0 08  14d+10:56:07.923  SET FEATURES [Reserved for Serial ATA]
  ec 00 00 00 00 00 a0 08  14d+10:56:07.919  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08  14d+10:56:07.919  SET FEATURES [set transfer mode]

Error 102 occurred at disk power-on lifetime: 12610 hours (525 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 d0 20 ad ee  Error: UNC at LBA = 0x0ead20d0 = 246227152

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 d0 20 ad ee 08  14d+10:56:05.166  READ DMA
  ef 10 02 00 00 00 a0 08  14d+10:56:05.166  SET FEATURES [Reserved for Serial ATA]
  ec 00 00 00 00 00 a0 08  14d+10:56:05.162  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08  14d+10:56:05.162  SET FEATURES [set transfer mode]

Error 101 occurred at disk power-on lifetime: 12610 hours (525 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 d0 20 ad ee  Error: UNC at LBA = 0x0ead20d0 = 246227152

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 d0 20 ad ee 08  14d+10:56:02.409  READ DMA
  ef 10 02 00 00 00 a0 08  14d+10:56:02.409  SET FEATURES [Reserved for Serial ATA]
  ec 00 00 00 00 00 a0 08  14d+10:56:02.405  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08  14d+10:56:02.405  SET FEATURES [set transfer mode]

Error 100 occurred at disk power-on lifetime: 12610 hours (525 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 d0 20 ad ee  Error: UNC at LBA = 0x0ead20d0 = 246227152

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 d0 20 ad ee 08  14d+10:55:59.652  READ DMA
  ef 10 02 00 00 00 a0 08  14d+10:55:59.652  SET FEATURES [Reserved for Serial ATA]
  ec 00 00 00 00 00 a0 08  14d+10:55:59.648  IDENTIFY DEVICE
  ef 03 46 00 00 00 a0 08  14d+10:55:59.648  SET FEATURES [set transfer mode]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      5025         -
# 2  Extended offline    Aborted by host               90%      5005         -
# 3  Extended offline    Aborted by host               20%      4958         -
# 4  Extended offline    Aborted by host               90%      4954         -
# 5  Extended offline    Aborted by host               90%      4954         -
# 6  Short offline       Completed without error       00%      4942         -
# 7  Short offline       Completed without error       00%      4831         -
# 8  Short offline       Completed without error       00%      4830         -
# 9  Short offline       Aborted by host               10%      4825         -
#10  Short offline       Aborted by host               80%      4824         -
#11  Short offline       Aborted by host               10%      4824         -
#12  Short offline       Completed without error       00%      3237         -
#13  Extended offline    Completed without error       00%        34         -
#14  Short offline       Completed without error       00%         0         -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...