larson Posted November 18, 2011 Share Posted November 18, 2011 Your syslog is filled with these "media errors" (unreadable sectors) The speed is slowed way down since unRAID is resetting the disk and trying again and again on each failure. ov 18 08:05:56 Tower kernel: ata8.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0 Nov 18 08:05:56 Tower kernel: ata8.00: BMDMA stat 0x64 Nov 18 08:05:56 Tower kernel: ata8.00: failed command: READ DMA Nov 18 08:05:56 Tower kernel: ata8.00: cmd c8/00:08:b8:b8:dd/00:00:00:00:00/e7 tag 0 dma 4096 in Nov 18 08:05:56 Tower kernel: res 51/40:08:b8:b8:dd/40:00:07:00:00/e7 Emask 0x9 (media error) Nov 18 08:05:56 Tower kernel: ata8.00: status: { DRDY ERR } Nov 18 08:05:56 Tower kernel: ata8.00: error: { UNC } Nov 18 08:05:56 Tower kernel: ata8.00: configured for UDMA/33 Nov 18 08:05:56 Tower kernel: ata8: EH complete And the most probable reason for this would be a bad disk? Or might it be controller or cable? I'll be running it tonight in an internal bay to check it out, more results in about 12 hours. [Edit] Didn't need any 12 hours to get a result. When running in an internal slot the results are similar to the results in the ESATA cradle. Is there anything in the SMART reports that I can use when returning the disk? Latest one is here: Disk: /dev/sdk smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: WDC WD30EZRX-00MMMB0 Serial Number: WD-WCAWZ1377700 Firmware Version: 80.00A80 User Capacity: 3,000,592,982,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Nov 18 20:28:14 2011 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x80) Offline data collection activity was never started. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (49680) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 188 188 051 Pre-fail Always - 133 3 Spin_Up_Time 0x0027 192 178 021 Pre-fail Always - 7391 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 13 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 2 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 36 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 8 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 33 194 Temperature_Celsius 0x0022 116 108 000 Old_age Always - 36 196 Reallocated_Event_Count 0x0032 199 199 000 Old_age Always - 1 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 3 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 ATA Error Count: 6686 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 6686 occurred at disk power-on lifetime: 36 hours (1 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 01 51 00 90 ac ff e2 Error: AMNF at LBA = 0x02ffac90 = 50310288 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 b8 ab ff e2 08 00:45:51.078 READ DMA ec 00 00 00 00 00 a0 08 00:45:51.058 IDENTIFY DEVICE ef 03 45 00 00 00 a0 08 00:45:51.058 SET FEATURES [set transfer mode] Error 6685 occurred at disk power-on lifetime: 36 hours (1 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 01 51 00 90 ac ff e2 Error: AMNF at LBA = 0x02ffac90 = 50310288 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 b8 ab ff e2 08 00:45:42.359 READ DMA c8 00 00 b8 aa ff e2 08 00:45:42.358 READ DMA c8 00 00 b8 a9 ff e2 08 00:45:42.357 READ DMA c8 00 00 b8 a8 ff e2 08 00:45:42.356 READ DMA c8 00 00 b8 a7 ff e2 08 00:45:42.355 READ DMA Error 6684 occurred at disk power-on lifetime: 36 hours (1 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 01 51 00 98 1d d4 e2 Error: AMNF at LBA = 0x02d41d98 = 47455640 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 f0 1c d4 e2 08 00:45:00.946 READ DMA ec 00 00 00 00 00 a0 08 00:45:00.927 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:45:00.927 SET FEATURES [set transfer mode] Error 6683 occurred at disk power-on lifetime: 36 hours (1 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 98 1d d4 e2 Error: UNC at LBA = 0x02d41d98 = 47455640 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 f0 1c d4 e2 08 00:44:52.477 READ DMA c8 00 00 f0 1b d4 e2 08 00:44:52.476 READ DMA c8 00 00 f0 1a d4 e2 08 00:44:52.475 READ DMA c8 00 00 f0 19 d4 e2 08 00:44:52.473 READ DMA c8 00 00 f0 18 d4 e2 08 00:44:52.472 READ DMA Error 6682 occurred at disk power-on lifetime: 36 hours (1 days + 12 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 28 04 d4 e2 Error: UNC at LBA = 0x02d40428 = 47449128 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 f0 03 d4 e2 08 00:44:37.369 READ DMA ec 00 00 00 00 00 a0 08 00:44:37.349 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:44:37.349 SET FEATURES [set transfer mode] SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. /Lars Olof Quote Link to comment
Joe L. Posted November 19, 2011 Share Posted November 19, 2011 media errors are unreadable sectors on the disk. they might be physically bad, or just written poorly. they are not a bad cable... those show as crc errors. (checksum errors) Quote Link to comment
ctviggen Posted November 22, 2011 Share Posted November 22, 2011 I have the following errors on one drive: ========================================================================1.13 == invoked as: ./preclear_disk.sh /dev/sda == WDC WD15EARS-00Z5B1 WD-WMAVU2957091 == Disk /dev/sda has been successfully precleared == with a starting sector of 63 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 6:59:52 (59 MB/s) == Last Cycle's Zeroing time : 15:53:25 (26 MB/s) == Last Cycle's Post Read Time : 13:30:42 (30 MB/s) == Last Cycle's Total Time : 36:25:06 == == Total Elapsed Time 36:25:06 == == Disk Start Temperature: 30C == == Current Disk Temperature: 29C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sda /tmp/smart_finish_sda ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Reallocated_Sector_Ct = 132 141 140 FAILING_NOW 539 Seek_Error_Rate = 100 181 0 ok 0 Temperature_Celsius = 121 120 0 ok 29 Reallocated_Event_Count = 1 1 0 near_thresh 312 Current_Pending_Sector = 1 200 0 near_thresh 65534 *** Failing SMART Attributes in /tmp/smart_finish_sda *** ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 5 Reallocated_Sector_Ct 0x0033 132 132 140 Pre-fail Always FAILING_NOW 539 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 65534 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 65534 sectors are pending re-allocation at the end of the preclear, a change of 65534 in the number of sectors pending re-allocation. 471 sectors had been re-allocated before the start of the preclear. 539 sectors are re-allocated at the end of the preclear, a change of 68 in the number of sectors re-allocated. SMART overall-health status = FAILED! ============================================================================ What does this mean? Quote Link to comment
Joe L. Posted November 23, 2011 Share Posted November 23, 2011 It means it is time to RMA the drive. Quote Link to comment
ctviggen Posted November 23, 2011 Share Posted November 23, 2011 Thanks, that's what I thought. Unfortunately, the drive is "old", and I can't RMA it. But, I did not put it in my (new) unraid configuration. I'm going to try to buy another drive this (black) Friday. Quote Link to comment
adammerkley Posted December 3, 2011 Share Posted December 3, 2011 Just to double-check, this drive I received as an RMA from Seagate is good to go, right? ========================================================================1.13 == invoked as: ./preclear_disk.sh -A -M 4 -c 3 /dev/sda == ST31500341AS [REDACTED] == Disk /dev/sda has been successfully precleared == with a starting sector of 64 == Ran 3 cycles == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 4:34:52 (90 MB/s) == Last Cycle's Zeroing time : 4:16:25 (97 MB/s) == Last Cycle's Post Read Time : 10:33:57 (39 MB/s) == Last Cycle's Total Time : 14:51:28 == == Total Elapsed Time 49:09:41 == == Disk Start Temperature: 39C == == Current Disk Temperature: 40C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sda /tmp/smart_finish_sda ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Raw_Read_Error_Rate = 111 116 6 ok 37320087 Spin_Retry_Count = 100 100 97 near_thresh 0 End-to-End_Error = 100 100 99 near_thresh 0 High_Fly_Writes = 96 100 0 ok 4 Airflow_Temperature_Cel = 60 61 45 near_thresh 40 Temperature_Celsius = 40 39 0 ok 40 Hardware_ECC_Recovered = 65 62 0 ok 37320087 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 3. 0 sectors were pending re-allocation after post-read in cycle 1 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 2 of 3. 0 sectors were pending re-allocation after post-read in cycle 2 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 3 of 3. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ Quote Link to comment
Joe L. Posted December 3, 2011 Share Posted December 3, 2011 Just to double-check, this drive I received as an RMA from Seagate is good to go, right? ========================================================================1.13 == invoked as: ./preclear_disk.sh -A -M 4 -c 3 /dev/sda == ST31500341AS [REDACTED] == Disk /dev/sda has been successfully precleared == with a starting sector of 64 == Ran 3 cycles == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 4:34:52 (90 MB/s) == Last Cycle's Zeroing time : 4:16:25 (97 MB/s) == Last Cycle's Post Read Time : 10:33:57 (39 MB/s) == Last Cycle's Total Time : 14:51:28 == == Total Elapsed Time 49:09:41 == == Disk Start Temperature: 39C == == Current Disk Temperature: 40C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sda /tmp/smart_finish_sda ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Raw_Read_Error_Rate = 111 116 6 ok 37320087 Spin_Retry_Count = 100 100 97 near_thresh 0 End-to-End_Error = 100 100 99 near_thresh 0 High_Fly_Writes = 96 100 0 ok 4 Airflow_Temperature_Cel = 60 61 45 near_thresh 40 Temperature_Celsius = 40 39 0 ok 40 Hardware_ECC_Recovered = 65 62 0 ok 37320087 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 3. 0 sectors were pending re-allocation after post-read in cycle 1 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 2 of 3. 0 sectors were pending re-allocation after post-read in cycle 2 of 3. 0 sectors were pending re-allocation after zero of disk in cycle 3 of 3. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ looks good to me. Quote Link to comment
cherritaker Posted December 4, 2011 Share Posted December 4, 2011 Hello i was just wondering if these results were good for this old hitachi hard drive i had laying around to use as a cache drive on my unraid thank you in advance. ========================================================================1.13 == invoked as: ./preclear_disk.sh -a -M 4 /dev/sdg == HDS725050KLA360 KRVN68ZAHVNDVG == Disk /dev/sdg has been successfully precleared == with a starting sector of 63 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 3:05:37 (44 MB/s) == Last Cycle's Zeroing time : 2:56:09 (47 MB/s) == Last Cycle's Post Read Time : 7:15:34 (19 MB/s) == Last Cycle's Total Time : 13:18:36 == == Total Elapsed Time 13:18:36 == == Disk Start Temperature: 33C == == Current Disk Temperature: 35C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 157 166 0 ok 35 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 28 sectors had been re-allocated before the start of the preclear. 31 sectors are re-allocated at the end of the preclear, a change of 3 in the number of sectors re-allocated. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 01:20:07 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 28 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28387 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2050 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2050 194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 33 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 28 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ ============================================================================ == == S.M.A.R.T Final Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 14:38:43 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 65536 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 31 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28400 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2050 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2050 194 Temperature_Celsius 0x0002 157 157 000 Old_age Always - 35 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 31 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ Quote Link to comment
Joe L. Posted December 4, 2011 Share Posted December 4, 2011 Hello i was just wondering if these results were good for this old hitachi hard drive i had laying around to use as a cache drive on my unraid thank you in advance. ========================================================================1.13 == invoked as: ./preclear_disk.sh -a -M 4 /dev/sdg == HDS725050KLA360 KRVN68ZAHVNDVG == Disk /dev/sdg has been successfully precleared == with a starting sector of 63 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 3:05:37 (44 MB/s) == Last Cycle's Zeroing time : 2:56:09 (47 MB/s) == Last Cycle's Post Read Time : 7:15:34 (19 MB/s) == Last Cycle's Total Time : 13:18:36 == == Total Elapsed Time 13:18:36 == == Disk Start Temperature: 33C == == Current Disk Temperature: 35C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 157 166 0 ok 35 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 28 sectors had been re-allocated before the start of the preclear. 31 sectors are re-allocated at the end of the preclear, a change of 3 in the number of sectors re-allocated. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 01:20:07 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 28 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28387 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2050 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2050 194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 33 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 28 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ ============================================================================ == == S.M.A.R.T Final Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 14:38:43 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 65536 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 31 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28400 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2050 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2050 194 Temperature_Celsius 0x0002 157 157 000 Old_age Always - 35 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 31 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ Looks pretty decent. There were 28 sectors realocated before yhou had performed the preclear. Three more were re-allocated during the pre-clear. I'd give it one more cycle. If additional sectors are re-allocated, then I'd not trust the drive for anything critical. If it remains at 31, you look good to go. Quote Link to comment
cherritaker Posted December 4, 2011 Share Posted December 4, 2011 Ok i will run it again before i go to work tomorrow since the unraid will be used for now by critical you mean storage and cache or what? Quote Link to comment
Joe L. Posted December 4, 2011 Share Posted December 4, 2011 Ok i will run it again before i go to work tomorrow since the unraid will be used for now by critical you mean storage and cache or what? The results of your pre-clear indicated that three of your sectors were not readable. The drive marked those as needing re-allocation, so when the drive was zeroed, they were re-allocated. That is good. If you had not performed the preclear, it is possible for a file or files to have been written that was not readable. Or, a file-system structure written that could not be read. Either of those could have resulted in lost files, or corrupted files. Now, do you trust the drive? It is a know fact that drives with re-allocated sectors are more likely yo have additional re-allocated sectors in the future. The question is, is the disk stable, or will additional sectors be un-readable when you start to use it. The only way to know is to either use it for real data, or run it through another preclear cycle. Joe L. Quote Link to comment
MvL Posted December 5, 2011 Share Posted December 5, 2011 I suppose this is answered before but couldn't find the answer. I'm preclear 4 2TB drives right now and the scrips is by step post-reads. How long does this take? Is this just one cycle with the standard settings. I'm a bit confused by the explanation in the preclear thread. Quote Link to comment
Joe L. Posted December 5, 2011 Share Posted December 5, 2011 I suppose this is answered before but couldn't find the answer. I'm preclear 4 2TB drives right now and the scrips is by step post-reads. How long does this take? Is this just one cycle with the standard settings. I'm a bit confused by the explanation in the preclear thread. It entirely depends on your hardware. Somewhere around 25 to 35 hours is normal for a single 2TB drive. See here in the wiki: http://lime-technology.com/wiki/index.php?title=User_Benchmarks#Preclear_Times If there are no bottlenecks in I/O to the drives, 4 concurrent pre-clears should not take a lot more time than one. But... if the disk controller waits on one for another, it could take longer. Quote Link to comment
MvL Posted December 5, 2011 Share Posted December 5, 2011 It entirely depends on your hardware. Somewhere around 25 to 35 hours is normal for a single 2TB drive. See here in the wiki: http://lime-technology.com/wiki/index.php?title=User_Benchmarks#Preclear_Times If there are no bottlenecks in I/O to the drives, 4 concurrent pre-clears should not take a lot more time than one. But... if the disk controller waits on one for another, it could take longer. Did i understand correctly that the post-reads step does 20 cycles? Nice information in the wiki didn't know there was information about preclear. Some reading to do. Thank you! Quote Link to comment
Joe L. Posted December 5, 2011 Share Posted December 5, 2011 It entirely depends on your hardware. Somewhere around 25 to 35 hours is normal for a single 2TB drive. See here in the wiki: http://lime-technology.com/wiki/index.php?title=User_Benchmarks#Preclear_Times If there are no bottlenecks in I/O to the drives, 4 concurrent pre-clears should not take a lot more time than one. But... if the disk controller waits on one for another, it could take longer. Did i understand correctly that the post-reads step does 20 cycles? No, it does one cycle. However it can go up to 20 cycles if you use the -c NN option. (where NN is a number between 1 and 20) Quote Link to comment
cherritaker Posted December 6, 2011 Share Posted December 6, 2011 Ok joe this is the 2nd pre clear on the cache drive i only saw 1 sector changed what can you tell me about this would it be ok to use or not? ========================================================================1.13 == invoked as: ./preclear_disk.sh -a -M 4 /dev/sdg == HDS725050KLA360 KRVN68ZAHVNDVG == Disk /dev/sdg has been successfully precleared == with a starting sector of 63 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 3:05:20 (44 MB/s) == Last Cycle's Zeroing time : 2:56:18 (47 MB/s) == Last Cycle's Post Read Time : 7:16:18 (19 MB/s) == Last Cycle's Total Time : 13:19:12 == == Total Elapsed Time 13:19:12 == == Disk Start Temperature: 34C == == Current Disk Temperature: 35C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 157 161 0 ok 35 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 31 sectors had been re-allocated before the start of the preclear. 32 sectors are re-allocated at the end of the preclear, a change of 1 in the number of sectors re-allocated. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 21:59:50 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 65536 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 31 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28408 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2051 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2051 194 Temperature_Celsius 0x0002 161 161 000 Old_age Always - 34 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 31 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ ============================================================================ == == S.M.A.R.T Final Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Mon Dec 5 11:19:00 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 32 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28421 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2051 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2051 194 Temperature_Celsius 0x0002 157 157 000 Old_age Always - 35 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 32 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ Quote Link to comment
Joe L. Posted December 6, 2011 Share Posted December 6, 2011 Ok joe this is the 2nd pre clear on the cache drive i only saw 1 sector changed what can you tell me about this would it be ok to use or not? ========================================================================1.13 == invoked as: ./preclear_disk.sh -a -M 4 /dev/sdg == HDS725050KLA360 KRVN68ZAHVNDVG == Disk /dev/sdg has been successfully precleared == with a starting sector of 63 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 3:05:20 (44 MB/s) == Last Cycle's Zeroing time : 2:56:18 (47 MB/s) == Last Cycle's Post Read Time : 7:16:18 (19 MB/s) == Last Cycle's Total Time : 13:19:12 == == Total Elapsed Time 13:19:12 == == Disk Start Temperature: 34C == == Current Disk Temperature: 35C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 157 161 0 ok 35 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 31 sectors had been re-allocated before the start of the preclear. 32 sectors are re-allocated at the end of the preclear, a change of 1 in the number of sectors re-allocated. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Sun Dec 4 21:59:50 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 65536 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 31 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28408 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2051 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2051 194 Temperature_Celsius 0x0002 161 161 000 Old_age Always - 34 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 31 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ ============================================================================ == == S.M.A.R.T Final Report for /dev/sdg == Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Hitachi Deskstar 7K500 series Device Model: HDS725050KLA360 Serial Number: KRVN68ZAHVNDVG Firmware Version: K2AOAB0A User Capacity: 500,107,862,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 7 ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1 Local Time is: Mon Dec 5 11:19:00 2011 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (10419) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 174) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 599 (Average 425) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 890 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 32 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 096 096 000 Old_age Always - 28421 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 769 192 Power-Off_Retract_Count 0x0032 099 099 050 Old_age Always - 2051 193 Load_Cycle_Count 0x0012 099 099 050 Old_age Always - 2051 194 Temperature_Celsius 0x0002 157 157 000 Old_age Always - 35 (Lifetime Min/Max 12/56) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 32 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 253 000 Old_age Always - 433 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Interrupted (host reset) 40% 20815 - # 2 Short captive Completed without error 00% 18722 - # 3 Short captive Completed without error 00% 18465 - # 4 Extended captive Interrupted (host reset) 90% 18465 - # 5 Short captive Completed without error 00% 13786 - Warning! SMART Selective Self-Test Log Structure error: invalid SMART checksum. SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ If the disk were stable you would not see any additional un-readable sectors. I'd say at this point for ME to think of it as OK to use for anything critical it would have to pass several pre-clear cycles with NO change in un-readable sectors. It is entirely up to you. Do you feel lucky? Joe L. Quote Link to comment
MvL Posted December 6, 2011 Share Posted December 6, 2011 No, it does one cycle. However it can go up to 20 cycles if you use the -c NN option. (where NN is a number between 1 and 20) How does it work? The script: 1. gets a SMART report 2. pre-reads the entire disk 3. writes zeros to the entire disk 4. sets the special signature recognized by unRAID 5. verifies the signature 6. post-reads the entire disk 7. repeats the process for up to 20 cycles 8. gets a final SMART report 9. compares the SMART reports alerting you of differences. That was a bit confusing. That's why i asked about the cycles. Thanks Joe L. The four drives are now precleared. It took 25 - 30 hours to complete like you said. Quote Link to comment
Joe L. Posted December 6, 2011 Share Posted December 6, 2011 No, it does one cycle. However it can go up to 20 cycles if you use the -c NN option. (where NN is a number between 1 and 20) How does it work? The script: 1. gets a SMART report 2. pre-reads the entire disk 3. writes zeros to the entire disk 4. sets the special signature recognized by unRAID 5. verifies the signature 6. post-reads the entire disk 7. repeats the process for up to 20 cycles 8. gets a final SMART report 9. compares the SMART reports alerting you of differences. That was a bit confusing. That's why i asked about the cycles. Thanks Joe L. The four drives are now precleared. It took 25 - 30 hours to complete like you said. I changed that post to now say: 7. optionally repeats the process for additional cycles (if you specified the "-c NN" option, where NN = a number from 1 to 20, default is to run 1 cycle) hopefully, it is more clear. Quote Link to comment
mrjoshzombie Posted December 6, 2011 Share Posted December 6, 2011 I seem to be having a bit of an issue using my preclear script on one drive (a 2TB WDEARS drive, sdh). I've tried numerous times to get it to finishing preclearing under 5.12a and 5.13, but around 91% my telnet session disconnects. When I log back in and use 'screen -r' it'll go back to the process, with no changes in time/percentage/temp/anything, and disconnect a few moments later. I also noticed my system logs were filling up my flash drive... a single .txt files was expanding to 1.2GB+ a piece! It seems one error would repeat itself hundreds of times a second, filing up the file. I managed to trim it down and attached you'll see the log. I also tossed in a screenshot of my telnet session after it disconnects. My shares are still active, I can access the server from the web GUI, but I cannot telnet in, and it seems the preclear never finished (it last disconnected about 24 hours ago.) There are no preclear logs on my flash drive. I'm thinking it's time to RMA the drive, but I wanted to get word for the experts before I send it out. logs_2.zip Quote Link to comment
Joe L. Posted December 6, 2011 Share Posted December 6, 2011 I seem to be having a bit of an issue using my preclear script on one drive (a 2TB WDEARS drive, sdh). I've tried numerous times to get it to finishing preclearing under 5.12a and 5.13, but around 91% my telnet session disconnects. When I log back in and use 'screen -r' it'll go back to the process, with no changes in time/percentage/temp/anything, and disconnect a few moments later. I also noticed my system logs were filling up my flash drive... a single .txt files was expanding to 1.2GB+ a piece! It seems one error would repeat itself hundreds of times a second, filing up the file. I managed to trim it down and attached you'll see the log. I also tossed in a screenshot of my telnet session after it disconnects. My shares are still active, I can access the server from the web GUI, but I cannot telnet in, and it seems the preclear never finished (it last disconnected about 24 hours ago.) There are no preclear logs on my flash drive. I'm thinking it's time to RMA the drive, but I wanted to get word for the experts before I send it out. you are having I/O errors on /dev/sdh. Those are filling the syslog to where you are running out of available memory, so the out-of-memory process in the kernel is killing off processes in an attempt to free some memory. (and killing off your login shell, the pre-clear process, and probably your screen session too) Quote Link to comment
mrjoshzombie Posted December 6, 2011 Share Posted December 6, 2011 I seem to be having a bit of an issue using my preclear script on one drive (a 2TB WDEARS drive, sdh). I've tried numerous times to get it to finishing preclearing under 5.12a and 5.13, but around 91% my telnet session disconnects. When I log back in and use 'screen -r' it'll go back to the process, with no changes in time/percentage/temp/anything, and disconnect a few moments later. I also noticed my system logs were filling up my flash drive... a single .txt files was expanding to 1.2GB+ a piece! It seems one error would repeat itself hundreds of times a second, filing up the file. I managed to trim it down and attached you'll see the log. I also tossed in a screenshot of my telnet session after it disconnects. My shares are still active, I can access the server from the web GUI, but I cannot telnet in, and it seems the preclear never finished (it last disconnected about 24 hours ago.) There are no preclear logs on my flash drive. I'm thinking it's time to RMA the drive, but I wanted to get word for the experts before I send it out. you are having I/O errors on /dev/sdh. Those are filling the syslog to where you are running out of available memory, so the out-of-memory process in the kernel is killing off processes in an attempt to free some memory. (and killing off your login shell, the pre-clear process, and probably your screen session too) And apparently my webGUI as well if it goes long enough. I got home, hooked up my server to a monitor to see what was going on since my telnet is down, and sure enough, that error is stilllllllll going. I'm currently trying to get my web back up, so I can shut things down since my powderdown script doesn't even seem to want to work. I did noticed the I/O errors in the logs, did some various searching around on the forum, but I wasn't entirely sure what to do next in order to get things fixed. I'm still searching around google and the forum to see what exactly my next steps should be. I'm assuming double check cables? (My understanding is I/O errors could mean many of things, or I'm terribly wrong in that guess.) Quote Link to comment
Turbobuickguy Posted December 7, 2011 Share Posted December 7, 2011 New to unraid and not quite sure how to interpret these results. Hoping someone "in the know" could take a peek. preclear_results.txt Quote Link to comment
Joe L. Posted December 8, 2011 Share Posted December 8, 2011 New to unraid and not quite sure how to interpret these results. Hoping someone "in the know" could take a peek. looks good to me Quote Link to comment
Turbobuickguy Posted December 8, 2011 Share Posted December 8, 2011 Thanks for taking a look Joe!! Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.