Help ! - High UDMA CRC error count ( UDMA_CRC_Error_Count ) - 106411915


Recommended Posts

Hi Guys,

I have this one HDD that shows particularly high "UDMA CRC error count" - 106411915 and still passes SMART Extended Test.

CA Fix Common Problems Plugin also showed notification once when I plugged the drive first time into the server saying "High UDMA CRC error count".

 

Should I keep the drive for a rainy day or should I chuck it into the Trash Can without doubt?

 

I swapped it for SSD in my old laptop thus the 8979 Power on hours.

Currently the drive is an Unassigned empty spare just in case I need it. Attaching Screenshot and smartctl data below for reference.

 

UDMA.thumb.jpg.a84ca8d9ac6b24c69de82751d876a2f9.jpg

smartctl 7.0 2018-12-30 r4883 [x86_64-linux-4.19.56-Unraid] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Toshiba 2.5" HDD MQ01ABD...
Device Model:     TOSHIBA MQ01ABD100
Serial Number:    XXXXXXXXX
LU WWN Device Id: XXXXXXXXXXXXXXXXXXXXXX
Firmware Version: AX0P2D
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Dec 13 05:20:16 2019 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever 
                                        been run.
Total time to complete Offline 
data collection:                (  120) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine 
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 236) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 128
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   100   100   001    Pre-fail  Always       -       1728
  5 Reallocated_Sector_Ct   0x0033   100   100   050    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   078   078   000    Old_age   Always       -       8978
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       1258
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       339
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       209
193 Load_Cycle_Count        0x0032   080   080   000    Old_age   Always       -       200083
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       28 (Min/Max 9/56)
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       106411915
200 Multi_Zone_Error_Rate   0x0032   100   100   000    Old_age   Always       -       277302786
240 Head_Flying_Hours       0x0032   087   087   000    Old_age   Always       -       5418
241 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       60050122750
242 Total_LBAs_Read         0x0032   100   100   000    Old_age   Always       -       53705701370
254 Free_Fall_Sensor        0x0032   100   100   000    Old_age   Always       -       150

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%      8976         -
# 2  Short offline       Completed without error       00%      8642         -
# 3  Short offline       Completed without error       00%      8608         -
# 4  Short offline       Completed without error       00%      8518         -
# 5  Short offline       Completed without error       00%      8095         -
# 6  Short offline       Completed without error       00%      5752         -
# 7  Short offline       Interrupted (host reset)      30%      4497         -
# 8  Short offline       Completed without error       00%      4233         -
# 9  Short offline       Completed without error       00%      3504         -
#10  Short offline       Completed without error       00%      3406         -
#11  Short offline       Completed without error       00%      3257         -
#12  Short offline       Completed without error       00%      3257         -
#13  Short offline       Completed without error       00%      3026         -
#14  Short offline       Completed without error       00%      2795         -
#15  Short offline       Completed without error       00%      1744         -
#16  Short offline       Completed without error       00%      1363         -
#17  Short offline       Completed without error       00%      1144         -
#18  Short offline       Completed without error       00%       984         -
#19  Short offline       Completed without error       00%       612         -
#20  Short offline       Completed without error       00%       469         -
#21  Extended offline    Interrupted (host reset)      90%       258         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

 

Thanks in Advance!

 

-SS

Dell R720xd - 12 x 3.5" + 2 x 2.5" | Lenovo SA120 - 12 x 3.5"

2 x E5 - 2650 V2 | 128Gb DDR3 ECC RAM

IT MODE HP H220 LSI 9207-8i | IT MODE LSI 9207-8e | Dell PERC H710 RAID Controller

Data drives: 4 x 4Tb - WD Gold 3.5" | 2 x 4Tb - Seagate Ironwolf 3.5" | 6 x 8Tb - WD Red 3.5" | 2 x 12Tb - WD Red 3.5"

Parity drives: 2 x 12Tb - WD Red 3.5"

Cache: 1 x 1Tb SSD - Samsung 850 Pro 2.5"

Unassigned: 1 x 1Tb 2.5" - WD Blue 2.5"

Link to comment
1 minute ago, johnnie.black said:

UDMA_CRC errors are a connection problem, not a disk problem, usually a bad SATA cable, but note that attribute doesn't reset, so only if it keeps increasing there's a problem.

Hey johnnie.black,

 

Thanks for chiming in. I actually saw this same reason mentioned online when I was researching this.

I have re-seated the Drive a few times and it keeps on increasing with every smart test / preclear I do on the disk.

It is on my rear 2.5" Drive Slot of R720xd so I feel that is good as well.

 

Examples below. Maybe you can get some more clarity with this.

preclear_report_XXXXXXXXX_2019.09.28_06.39.23.txt
############################################################################################################################
#                                                                                                                          #
#                                        unRAID Server Preclear of disk XXXXXXXXX                                          #
#                                       Cycle 1 of 1, partition start on sector 64.                                        #
#                                                                                                                          #
#                                                                                                                          #
#   Step 1 of 5 - Post-Read verification:                                                   [3:07:27 @ 88 MB/s] SUCCESS    #
#   Step 2 of 5 - Verifying unRAID's Preclear signature:                                                        SUCCESS    #
#   Step 3 of 5 - Writing unRAID's Preclear signature:                                                          SUCCESS    #
#   Step 4 of 5 - Zeroing the disk:                                                         [3:07:42 @ 88 MB/s] SUCCESS    #
#   Step 5 of 5 - Pre-read verification:                                                    [3:06:47 @ 89 MB/s] SUCCESS    #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#                               Cycle elapsed time: 9:22:07 | Total elapsed time: 9:22:08                                  #
############################################################################################################################

############################################################################################################################
#                                                                                                                          #
#                                               S.M.A.R.T. Status default                                                  #
#                                                                                                                          #
#                                                                                                                          #
#   ATTRIBUTE                 INITIAL   CYCLE 1   STATUS                                                                   #
#   5-Reallocated_Sector_Ct   0         0         -                                                                        #
#   9-Power_On_Hours          8462      8471      Up 9                                                                     #
#   194-Temperature_Celsius   28        38        Up 10                                                                    #
#   199-UDMA_CRC_Error_Count  83943701  86420220  Up 2476519                                                               #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#   SMART overall-health self-assessment test result: PASSED                                                               #
############################################################################################################################

--> ATTENTION: Please take a look into the SMART report above for drive health issues.

--> RESULT: Preclear Finished Successfully!.
preclear_report_XXXXXXXXX_2019.11.22_07.38.53.txt

############################################################################################################################
#                                                                                                                          #
#                                        unRAID Server Preclear of disk XXXXXXXXX                                          #
#                                       Cycle 1 of 1, partition start on sector 64.                                        #
#                                                                                                                          #
#                                                                                                                          #
#   Step 1 of 5 - Post-Read verification:                                                   [3:06:18 @ 89 MB/s] SUCCESS    #
#   Step 2 of 5 - Verifying unRAID's Preclear signature:                                                        SUCCESS    #
#   Step 3 of 5 - Writing unRAID's Preclear signature:                                                          SUCCESS    #
#   Step 4 of 5 - Zeroing the disk:                                                         [3:08:20 @ 88 MB/s] SUCCESS    #
#   Step 5 of 5 - Pre-read verification:                                                    [3:08:17 @ 88 MB/s] SUCCESS    #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#                               Cycle elapsed time: 9:23:11 | Total elapsed time: 9:23:13                                  #
############################################################################################################################

############################################################################################################################
#                                                                                                                          #
#                                               S.M.A.R.T. Status default                                                  #
#                                                                                                                          #
#                                                                                                                          #
#   ATTRIBUTE                 INITIAL   CYCLE 1   STATUS                                                                   #
#   5-Reallocated_Sector_Ct   0         0         -                                                                        #
#   9-Power_On_Hours          8694      8703      Up 9                                                                     #
#   194-Temperature_Celsius   30        34        Up 4                                                                     #
#   199-UDMA_CRC_Error_Count  87317299  89793763  Up 2476464                                                               #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#   SMART overall-health self-assessment test result: PASSED                                                               #
############################################################################################################################

--> ATTENTION: Please take a look into the SMART report above for drive health issues.

--> RESULT: Preclear Finished Successfully!.
preclear_report_XXXXXXXXX_2019.11.23_19.28.30.txt

############################################################################################################################
#                                                                                                                          #
#                                        unRAID Server Preclear of disk XXXXXXXXX                                          #
#                                        Cycle 2 of 2, partition start on sector 64.                                       #
#                                                                                                                          #
#                                                                                                                          #
#   Step 1 of 6 - Post-Read verification:                                                   [3:04:53 @ 90 MB/s] SUCCESS    #
#   Step 2 of 6 - Verifying unRAID's Preclear signature:                                                        SUCCESS    #
#   Step 3 of 6 - Writing unRAID's Preclear signature:                                                          SUCCESS    #
#   Step 4 of 6 - Zeroing the disk:                                                         [3:04:22 @ 90 MB/s] SUCCESS    #
#   Step 5 of 6 - Erasing the disk:                                                         [3:05:42 @ 89 MB/s] SUCCESS    #
#   Step 6 of 6 - Pre-read verification:                                                    [3:04:17 @ 90 MB/s] SUCCESS    #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#                              Cycle elapsed time: 12:19:22 | Total elapsed time: 24:45:37                                 #
############################################################################################################################

############################################################################################################################
#                                                                                                                          #
#                                               S.M.A.R.T. Status default                                                  #
#                                                                                                                          #
#                                                                                                                          #
#   ATTRIBUTE                 INITIAL   CYCLE 1   CYCLE 2   STATUS                                                         #
#   5-Reallocated_Sector_Ct   0         0         0         -                                                              #
#   9-Power_On_Hours          8714      8727      8739      Up 25                                                          #
#   194-Temperature_Celsius   30        34        34        Up 4                                                           #
#   199-UDMA_CRC_Error_Count  89799429  93099068  96395714  Up 6596285                                                     #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#   SMART overall-health self-assessment test result: PASSED                                                               #
############################################################################################################################

--> ATTENTION: Please take a look into the SMART report above for drive health issues.

--> RESULT: Preclear Finished Successfully!.
preclear_report_XXXXXXXXX_2019.11.26_00.43.02.txt

############################################################################################################################
#                                                                                                                          #
#                                        unRAID Server Preclear of disk XXXXXXXXX                                          #
#                                       Cycle 1 of 1, partition start on sector 64.                                        #
#                                                                                                                          #
#                                                                                                                          #
#   Step 1 of 3 - Verifying unRAID's Preclear signature:                                                        SUCCESS    #
#   Step 2 of 3 - Writing unRAID's Preclear signature:                                                          SUCCESS    #
#   Step 3 of 3 - Zeroing the disk:                                                         [3:03:43 @ 90 MB/s] SUCCESS    #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#                               Cycle elapsed time: 3:03:51 | Total elapsed time: 3:03:52                                  #
############################################################################################################################

############################################################################################################################
#                                                                                                                          #
#                                        S.M.A.R.T. Status (device type: default)                                          #
#                                                                                                                          #
#                                                                                                                          #
#   ATTRIBUTE                 INITIAL    CYCLE 1    STATUS                                                                 #
#   5-Reallocated_Sector_Ct   0          0          -                                                                      #
#   9-Power_On_Hours          8789       8792       Up 3                                                                   #
#   194-Temperature_Celsius   28         37         Up 9                                                                   #
#   199-UDMA_CRC_Error_Count  103063650  103886091  Up 822441                                                              #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#   SMART overall-health self-assessment test result: PASSED                                                               #
############################################################################################################################

--> ATTENTION: Please take a look into the SMART report above for drive health issues.

--> RESULT: Preclear Finished Successfully!.

Thanks,

 

-SS

Link to comment
On 12/13/2019 at 6:47 AM, johnnie.black said:

If it keeps increasing there's still a problem, try a different backplane slot, it's either a cable, backplane or controller problem.

 

On 12/13/2019 at 7:22 AM, Shomil Saini said:

😥🙄 I would be really sad if that was the case.

I will pop it into another slot and update you soon.

 

Cheers!

Hi Johnnie.black,

 

As promised, here is the report from today. Error count has gone up 2469362 again.

This was a completely different computer, drive connected via SATA port on MOBO with fresh Unraid boot drive.

 

I am a little happy that it is not a "cable, back-plane or controller problem" on my server. That would have been expensive lol.

 

Any suggestions about the drive behavior?

 

preclear_report_XXXXXXXXX_2019.12.14_17.00.09.txt 

############################################################################################################################
#                                                                                                                          #
#                                        unRAID Server Preclear of disk XXXXXXXXX                                          #
#                                       Cycle 1 of 1, partition start on sector 64.                                        #
#                                                                                                                          #
#                                                                                                                          #
#   Step 1 of 5 - Pre-read verification:                                                    [3:03:48 @ 90 MB/s] SUCCESS    #
#   Step 2 of 5 - Zeroing the disk:                                                         [3:02:30 @ 91 MB/s] SUCCESS    #
#   Step 3 of 5 - Writing unRAID's Preclear signature:                                                          SUCCESS    #
#   Step 4 of 5 - Verifying unRAID's Preclear signature:                                                        SUCCESS    #
#   Step 5 of 5 - Post-Read verification:                                                   [3:03:30 @ 90 MB/s] SUCCESS    #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
############################################################################################################################
#                               Cycle elapsed time: 9:09:51 | Total elapsed time: 9:09:51                                  #
############################################################################################################################

############################################################################################################################
#                                                                                                                          #
#                                        S.M.A.R.T. Status (device type: default)                                          #
#                                                                                                                          #
#                                                                                                                          #
#   ATTRIBUTE                 INITIAL    CYCLE 1    STATUS                                                                 #
#   5-Reallocated_Sector_Ct   0          0          -                                                                      #
#   9-Power_On_Hours          9002       9011       Up 9                                                                   #
#   194-Temperature_Celsius   30         42         Up 12                                                                  #
#   199-UDMA_CRC_Error_Count  106412609  108881971  Up 2469362                                                             #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                          #
#                                                                                                                        #
#                                                                                                                          #
############################################################################################################################
#   SMART overall-health self-assessment test result: PASSED                                                               #
############################################################################################################################

--> ATTENTION: Please take a look into the SMART report above for drive health issues.

--> RESULT: Preclear Finished Successfully!.

 

Link to comment
9 hours ago, johnnie.black said:

If the error continues to increase in another server it's likely the disk, but it's extremely rare, in fact I don't remember ever seeing it ever before, but of course it's not impossible.

Thanks johnnie.black.

I am tilting towards drive issue myself and would keep a close eye on the drive.

 

Will update here when drive dies.

For a 1Tb Toshiba Laptop drive, It has seen 31.75Tb Total lbas written and 29.5Tb Total lbas read so I am not sad to let it go.

 

Thanks for helping.

 

Cheers!

Link to comment
  • 8 months later...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.