UnRAID Parity Drive Errors in Main Menu error column


Recommended Posts

Hi Everyone,

 

I have gotten errors for the first time in 4 or 5 years in my monthly parity check.  It has listed 838751 errors in the parity disk error column in the unRAID 5.05 main menu area. I have pasted the log file to pastebin here https://pastebin.com/tyaxrkME

 

Does anyone know what I should be doing now?  I have basically set and forget for the last 4 or 5 years so am totally out of the loop on what I should do.

 

Thanks for the help.

 

Q

Link to comment

Hi,

 

Okay here is the Smart Report.

 

Thanks,

Q

 

Statistics for /dev/sda Hitachi_HDS723020BLA642_MN1221F300XA6A

smartctl -a -d ata /dev/sda
smartctl 6.2 2013-07-26 r3841 [i686-linux-3.9.11p-unRAID] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Hitachi Deskstar 7K3000
Device Model:     Hitachi HDS723020BLA642
Serial Number:    MN1221F300XA6A
LU WWN Device Id: 5 000cca 369c06a58
Firmware Version: MN6OA180
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    7200 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 2.6, 6.0 Gb/s (current: 1.5 Gb/s)
Local Time is:    Sun Mar 18 14:57:27 2018 MDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x85)	Offline data collection activity
					was aborted by an interrupting command from host.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(19523) seconds.
Offline data collection
capabilities: 			 (0x5b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   1) minutes.
Extended self-test routine
recommended polling time: 	 ( 326) minutes.
SCT capabilities: 	       (0x003d)	SCT Status supported.
					SCT Error Recovery Control supported.
					SCT Feature Control supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   016    Pre-fail  Always       -       65536
  2 Throughput_Performance  0x0005   135   135   054    Pre-fail  Offline      -       85
  3 Spin_Up_Time            0x0007   130   130   024    Pre-fail  Always       -       439 (Average 440)
  4 Start_Stop_Count        0x0012   100   100   000    Old_age   Always       -       1253
  5 Reallocated_Sector_Ct   0x0033   001   001   005    Pre-fail  Always   FAILING_NOW 1905
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   133   133   020    Pre-fail  Offline      -       27
  9 Power_On_Hours          0x0012   092   092   000    Old_age   Always       -       59485
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       35
192 Power-Off_Retract_Count 0x0032   099   099   000    Old_age   Always       -       1557
193 Load_Cycle_Count        0x0012   099   099   000    Old_age   Always       -       1557
194 Temperature_Celsius     0x0002   253   253   000    Old_age   Always       -       22 (Min/Max 13/43)
196 Reallocated_Event_Count 0x0032   001   001   000    Old_age   Always       -       2832
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       182
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 7055 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 7055 occurred at disk power-on lifetime: 59092 hours (2462 days + 4 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 fa 35 cb 39 0d  Error: UNC 250 sectors at LBA = 0x0d39cb35 = 221891381

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 00 2f cb 39 e0 00      20:42:36.485  READ DMA EXT
  ef 10 02 00 00 00 a0 00      20:42:36.485  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      20:42:36.485  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      20:42:36.483  IDENTIFY DEVICE
  ef 03 42 00 00 00 a0 00      20:42:36.483  SET FEATURES [Set transfer mode]

Error 7054 occurred at disk power-on lifetime: 59089 hours (2462 days + 1 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 cb a4 ee c9 0c  Error: UNC 203 sectors at LBA = 0x0cc9eea4 = 214560420

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 00 6f ee c9 e0 00      18:26:54.743  READ DMA EXT
  25 00 00 6f ea c9 e0 00      18:26:39.236  READ DMA EXT
  25 00 00 6f e6 c9 e0 00      18:26:26.463  READ DMA EXT
  25 00 00 6f e2 c9 e0 00      18:26:11.644  READ DMA EXT
  ef 10 02 00 00 00 a0 00      18:26:11.644  SET FEATURES [Enable SATA feature]

Error 7053 occurred at disk power-on lifetime: 59086 hours (2461 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 4d 72 6a 28 0b  Error: UNC 77 sectors at LBA = 0x0b286a72 = 187198066

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 50 6f 6a 28 e0 00      14:40:16.903  READ DMA EXT
  35 00 a0 bf 64 28 e0 00      14:40:16.893  WRITE DMA EXT
  e5 00 00 00 00 00 40 00      14:40:16.893  CHECK POWER MODE
  35 00 d8 e7 60 28 e0 00      14:40:16.874  WRITE DMA EXT
  ef 10 02 00 00 00 a0 00      14:40:16.874  SET FEATURES [Enable SATA feature]

Error 7052 occurred at disk power-on lifetime: 59086 hours (2461 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 08 67 6a 28 0b  Error: UNC 8 sectors at LBA = 0x0b286a67 = 187198055

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 10 5f 6a 28 e0 00      14:39:55.857  READ DMA EXT
  ef 10 02 00 00 00 a0 00      14:39:55.857  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      14:39:55.857  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      14:39:55.855  IDENTIFY DEVICE
  ef 03 42 00 00 00 a0 00      14:39:55.854  SET FEATURES [Set transfer mode]

Error 7051 occurred at disk power-on lifetime: 59086 hours (2461 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 d6 89 66 28 0b  Error: UNC 214 sectors at LBA = 0x0b286689 = 187197065

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  25 00 00 5f 66 28 e0 00      14:39:29.628  READ DMA EXT
  ef 10 02 00 00 00 a0 00      14:39:29.628  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 e0 00      14:39:29.628  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 a0 00      14:39:29.626  IDENTIFY DEVICE
  ef 03 42 00 00 00 a0 00      14:39:29.625  SET FEATURES [Set transfer mode]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     59485         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.