Preclear.sh results - Questions about your results? Post them here.

JustinChase · October 17, 2014

I just finished a preclear on a new drive. There are a couple of items showing "near threshold". Should I be concerned?

** Changed attributes in files: /tmp/smart_start_sdi  /tmp/smart_finish_sdi
                ATTRIBUTE   NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS      RAW_VALUE
      Raw_Read_Error_Rate =   118     100            6        ok          179537912
             Spin_Up_Time =    98      92            0        ok          0
         Spin_Retry_Count =   100     100           97        near_thresh 0
         End-to-End_Error =   100     100           99        near_thresh 0
          High_Fly_Writes =    99     100            0        ok          1
  Airflow_Temperature_Cel =    75      74           45        ok          25
      Temperature_Celsius =    25      26            0        near_thresh 25
No SMART attributes are FAILING_NOW

0 sectors were pending re-allocation before the start of the preclear.
0 sectors were pending re-allocation after pre-read in cycle 1 of 1.
0 sectors were pending re-allocation after zero of disk in cycle 1 of 1.
0 sectors are pending re-allocation at the end of the preclear,
    the number of sectors pending re-allocation did not change.
0 sectors had been re-allocated before the start of the preclear.
0 sectors are re-allocated at the end of the preclear,
    the number of sectors re-allocated did not change.

RobJ · October 17, 2014

I just finished a preclear on a new drive. There are a couple of items showing "near threshold". Should I be concerned?

** Changed attributes in files: /tmp/smart_start_sdi  /tmp/smart_finish_sdi
                ATTRIBUTE   NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS      RAW_VALUE
      Raw_Read_Error_Rate =   118     100            6        ok          179537912
             Spin_Up_Time =    98      92            0        ok          0
         Spin_Retry_Count =   100     100           97        near_thresh 0
         End-to-End_Error =   100     100           99        near_thresh 0
          High_Fly_Writes =    99     100            0        ok          1
  Airflow_Temperature_Cel =    75      74           45        ok          25
      Temperature_Celsius =    25      26            0        near_thresh 25
No SMART attributes are FAILING_NOW

0 sectors were pending re-allocation before the start of the preclear.
0 sectors were pending re-allocation after pre-read in cycle 1 of 1.
0 sectors were pending re-allocation after zero of disk in cycle 1 of 1.
0 sectors are pending re-allocation at the end of the preclear,
    the number of sectors pending re-allocation did not change.
0 sectors had been re-allocated before the start of the preclear.
0 sectors are re-allocated at the end of the preclear,
    the number of sectors re-allocated did not change.

Nothing to worry about, at all. I've made some suggestions to Joe L about removing these near_thresh's from the reporting, but I suspect he's been too busy lately.

JustinChase · October 17, 2014

I figured as much, but it's good to get confirmation; thanks.

JustinChase · October 18, 2014

so, this is my old Parity drive. it started acting up and I've replaced it with a new drive. I was hoping a preclear would 'fix' this drive, but I'm not sure how to interpret these results.

Is this drive okay to put back into service in the array as a data drive?

** Changed attributes in files: /tmp/smart_start_sdf  /tmp/smart_finish_sdf
                ATTRIBUTE   NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS      RAW_VALUE
      Raw_Read_Error_Rate =   115     117            6        ok          188984744
           Power_On_Hours =    91      92            0        ok          7887
         Spin_Retry_Count =   100     100           97        near_thresh 0
         End-to-End_Error =   100     100           99        near_thresh 0
       Reported_Uncorrect =    89      91            0        ok          11
          High_Fly_Writes =    88      92            0        ok          12
  Airflow_Temperature_Cel =    69      72           45        near_thresh 31
      Temperature_Celsius =    31      28            0        ok          31
No SMART attributes are FAILING_NOW

8 sectors were pending re-allocation before the start of the preclear.
8 sectors were pending re-allocation after pre-read in cycle 1 of 1.
0 sectors were pending re-allocation after zero of disk in cycle 1 of 1.
0 sectors are pending re-allocation at the end of the preclear,
    a change of -8 in the number of sectors pending re-allocation.
40 sectors had been re-allocated before the start of the preclear.
40 sectors are re-allocated at the end of the preclear,
    the number of sectors re-allocated did not change.

RobJ · October 18, 2014

so, this is my old Parity drive. it started acting up and I've replaced it with a new drive. I was hoping a preclear would 'fix' this drive, but I'm not sure how to interpret these results.

Is this drive okay to put back into service in the array as a data drive?

** Changed attributes in files: /tmp/smart_start_sdf  /tmp/smart_finish_sdf
                ATTRIBUTE   NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS      RAW_VALUE
      Raw_Read_Error_Rate =   115     117            6        ok          188984744
           Power_On_Hours =    91      92            0        ok          7887
         Spin_Retry_Count =   100     100           97        near_thresh 0
         End-to-End_Error =   100     100           99        near_thresh 0
       Reported_Uncorrect =    89      91            0        ok          11
          High_Fly_Writes =    88      92            0        ok          12
  Airflow_Temperature_Cel =    69      72           45        near_thresh 31
      Temperature_Celsius =    31      28            0        ok          31
No SMART attributes are FAILING_NOW

8 sectors were pending re-allocation before the start of the preclear.
8 sectors were pending re-allocation after pre-read in cycle 1 of 1.
0 sectors were pending re-allocation after zero of disk in cycle 1 of 1.
0 sectors are pending re-allocation at the end of the preclear,
    a change of -8 in the number of sectors pending re-allocation.
40 sectors had been re-allocated before the start of the preclear.
40 sectors are re-allocated at the end of the preclear,
    the number of sectors re-allocated did not change.

It means the drive tested each of the 8 sectors, when written zeroes to, and decided they were fine so put them back online. It doesn't mean those 8 are perfect, although they could be, but for now the drive believes each of them can be trusted. The drive appears to be fine now, but not perfect. The best advice we give users when this happens is to Preclear it one or two more times, and see if anything changes. If no further changes, then the drive should be good. All other numbers look OK.

JustinChase · October 18, 2014

Okay, that makes sense. I'll preclear it 2 more times, then see what I come up with.

Should I do the full preclear, or can/should I use the -W option to save a bit of time? That preclear took about 50 hours. I can live with it taking 100 hours, but if I can cut that down a bit, that might be better.

jowi · October 20, 2014

I remember i noticed this before but i'm not sure what the reason was. I am running preclear on a new WD red, but it keeps all my drives in a spinned up state? If i spin them down, they start right up. Is this a known 'feature'?

jowi · October 20, 2014

I've disabled all plugins, the only thing that is running is preclear. Also there is constant reading from disk1? Is preclear causing this, and why?

garycase · October 20, 2014

I've disabled all plugins, the only thing that is running is preclear. Also there is constant reading from disk1? Is preclear causing this, and why?

Constant reads? ... or is it just spinning? On some controllers, if one drive is active, other drives on the controller will also be active => so if the drive you're pre-clearing and disk1 are both on a controller with these properties, that would explain it. But it shouldn't be causing actual read activity from the drive.

Do you have any plugins that may be using the disk?

jowi · October 20, 2014

No, all plugins are stopped. I can see the read counter from the webgui on disk1 increasing, also the disk light on the case is blinking... so there is definitly something going on...

*edit* i missed one plugin, cache_dirs... stopped it, it looks like all disks are spinned down... still weird, normally cache_dirs does not have that effect...

garycase · October 20, 2014

Cache_Dirs will indeed do a LOT of reading when you first boot the server; but will normally finish after all the directories are cached. In you have a LOT of files on disk1; and if the pre-clear activity is using enough of your memory that all of the directories can't be cached; then Cache_Dirs could be constantly attempting to update the cached information, and never actually finish, since everything isn't fitting in memory.

jowi · October 20, 2014

I guess that is exactly what happened. Indeed my disk1 is a collection of a lot of smaller files.

Maybe preclear could disable cache_dir when it starts, or inform the user that it is better to stop cache_dir while preclearing?

RobJ · October 20, 2014

Okay, that makes sense. I'll preclear it 2 more times, then see what I come up with.

Should I do the full preclear, or can/should I use the -W option to save a bit of time? That preclear took about 50 hours. I can live with it taking 100 hours, but if I can cut that down a bit, that might be better.

You are trying to thoroughly test the drive, so cutting back on the testing seems a little counter-productive, but yes the preread is the least useful part of the testing, so skipping it shouldn't affect the result and would save time.

Joe L. · October 21, 2014

Okay, that makes sense. I'll preclear it 2 more times, then see what I come up with.

Should I do the full preclear, or can/should I use the -W option to save a bit of time? That preclear took about 50 hours. I can live with it taking 100 hours, but if I can cut that down a bit, that might be better.

You are trying to thoroughly test the drive, so cutting back on the testing seems a little counter-productive, but yes the preread is the least useful part of the testing, so skipping it shouldn't affect the result and would save time.

Actually, it is only when "reading" that un-readable sectors can be identified. If you skip the pre-read, you'll only identify marginal sectors on the post-read, and not have any subsequent "write" to attempt to re-allocate those marginal sectors.

When requesting multiple cycles, the post-read of the first/prior cycle is used as the pre-read of the next, so some time is saved and you still get the benefit of the full test.

Joe L.

JustinChase · October 21, 2014

hmmm...

okay, too bad. I'm 48 hours into the 2 cycles without the pre-read, so that's 2 more wasted days so much for saving time.

I guess I'll cancel and start over again.

thanks for the info

Joe L. · October 21, 2014

hmmm...

okay, too bad. I'm 48 hours into the 2 cycles without the pre-read, so that's 2 more wasted days so much for saving time.

I guess I'll cancel and start over again.

thanks for the info

you'll be fine, as the intermediate post-read serves as the pre-read for the subsequent cycle. Let it continue to completion.

grither · October 24, 2014

hi all, wanted to run my preclear results by you. i've highlighted the part in bold that looks weird to me. for some background, this is a drive that flaked out on my one day and i lost ALL the data on it. i don't intend to reuse it, however i ran preclear to see what it would say. it says it has NOT been successfully precleared?

=======================================================================1.13

== invoked as: ./preclear_disk.sh -M 4 /dev/sdp == == Disk /dev/sdp has NOT been successfully precleared == Postread detected un-expected non-zero bytes on disk== == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 8:07:02 (68 MB/s)

== Last Cycle's Zeroing time : 7:09:46 (77 MB/s)

== Last Cycle's Post Read Time : 15:36:35 (35 MB/s)

== Last Cycle's Total Time : 30:54:24

==

== Total Elapsed Time 30:54:24

==

== Disk Start Temperature: 34C

==

== Current Disk Temperature: 34C,

==

============================================================================

No SMART attributes are FAILING_NOW

0 sectors were pending re-allocation before the start of the preclear.

0 sectors were pending re-allocation after pre-read in cycle 1 of 1.

0 sectors were pending re-allocation after zero of disk in cycle 1 of 1.

0 sectors are pending re-allocation at the end of the preclear,

the number of sectors pending re-allocation did not change.

0 sectors had been re-allocated before the start of the preclear.

0 sectors are re-allocated at the end of the preclear,

the number of sectors re-allocated did not change.

============================================================================

==

== S.M.A.R.T Initial Report for /dev/sdp ==

Disk: /dev/sdp

=== START OF INFORMATION SECTION ===

Model Family: Western Digital Caviar Green family

Device Model: WDC WD20EADS-00R6B0

Serial Number: WD-WCAVY0403098

Firmware Version: 01.00A01

User Capacity: 2,000,398,934,016 bytes

Device is: In smartctl database [for details use: -P show]

ATA Version is: 8

ATA Standard is: Exact ATA specification draft version not indicated

Local Time is: Tue Oct 21 20:21:21 2014 CDT

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

=== START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED

General SMART Values:

Offline data collection status: (0x84) Offline data collection activity

was suspended by an interrupting command from host.

Auto Offline Data Collection: Enabled.

Self-test execution status: ( 0) The previous self-test routine completed

without error or no self-test has ever

been run.

Total time to complete Offline

data collection: (43200) seconds.

Offline data collection

capabilities: (0x7b) SMART execute Offline immediate.

Auto Offline data collection on/off support.

Suspend Offline collection upon new

command.

Offline surface scan supported.

Self-test supported.

Conveyance Self-test supported.

Selective Self-test supported.

SMART capabilities: (0x0003) Saves SMART data before entering

power-saving mode.

Supports SMART auto save timer.

Error logging capability: (0x01) Error logging supported.

General Purpose Logging supported.

Short self-test routine

recommended polling time: ( 2) minutes.

Extended self-test routine

recommended polling time: ( 255) minutes.

Conveyance self-test routine

recommended polling time: ( 5) minutes.

SCT capabilities: (0x303f) SCT Status supported.

SCT Error Recovery Control supported.

SCT Feature Control supported.

SCT Data Table supported.

SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 48

3 Spin_Up_Time 0x0027 158 149 021 Pre-fail Always - 9066

4 Start_Stop_Count 0x0032 087 087 000 Old_age Always - 13980

5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0

7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0

9 Power_On_Hours 0x0032 040 040 000 Old_age Always - 43972

10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0

11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0

12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 118

192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 8

193 Load_Cycle_Count 0x0032 196 196 000 Old_age Always - 13260

194 Temperature_Celsius 0x0022 118 094 000 Old_age Always - 34

196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0

197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0

198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0

199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0

200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0

SMART Error Log Version: 1

No Errors Logged

SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS

1 0 0 Not_testing

2 0 0 Not_testing

3 0 0 Not_testing

4 0 0 Not_testing

5 0 0 Not_testing

Selective self-test flags (0x0):

After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.

==

============================================================================

==

== S.M.A.R.T Final Report for /dev/sdp

==

Disk: /dev/sdp

=== START OF INFORMATION SECTION ===

Model Family: Western Digital Caviar Green family

Device Model: WDC WD20EADS-00R6B0

Serial Number: WD-WCAVY0403098

Firmware Version: 01.00A01

User Capacity: 2,000,398,934,016 bytes

Device is: In smartctl database [for details use: -P show]

ATA Version is: 8

ATA Standard is: Exact ATA specification draft version not indicated

Local Time is: Thu Oct 23 03:15:44 2014 CDT

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

=== START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED

General SMART Values:

Offline data collection status: (0x84) Offline data collection activity

was suspended by an interrupting command from host.

Auto Offline Data Collection: Enabled.

Self-test execution status: ( 0) The previous self-test routine completed

without error or no self-test has ever

been run.

Total time to complete Offline

data collection: (43200) seconds.

Offline data collection

capabilities: (0x7b) SMART execute Offline immediate.

Auto Offline data collection on/off support.

Suspend Offline collection upon new

command.

Offline surface scan supported.

Self-test supported.

Conveyance Self-test supported.

Selective Self-test supported.

SMART capabilities: (0x0003) Saves SMART data before entering

power-saving mode.

Supports SMART auto save timer.

Error logging capability: (0x01) Error logging supported.

General Purpose Logging supported.

Short self-test routine

recommended polling time: ( 2) minutes.

Extended self-test routine

recommended polling time: ( 255) minutes.

Conveyance self-test routine

recommended polling time: ( 5) minutes.

SCT capabilities: (0x303f) SCT Status supported.

SCT Error Recovery Control supported.

SCT Feature Control supported.

SCT Data Table supported.

SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 48

3 Spin_Up_Time 0x0027 158 149 021 Pre-fail Always - 9066

4 Start_Stop_Count 0x0032 087 087 000 Old_age Always - 13980

5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0

7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0

9 Power_On_Hours 0x0032 040 040 000 Old_age Always - 44002