Joe L. Posted September 3, 2012 Share Posted September 3, 2012 Hi guys nvm - Solved - discovered after studying the usage script that the following command is possible:\ preclear_disk.sh -d sat /dev/sda this instructs preclear to utilize alternate commands when running Smartctl. I did consider cases like this when I added that "-d type" option. You are one of the first to report they were able to use it on their hardware. hope someone can assist - i am running Unraid via an Adaptec controller (Model 52445). It is rather overkill for unraid since it is meant for high levels of RAID, but i didn't want to shell out additional $$$'s to get another controller - currently its performing admirably with roughly 64MB/s on post read clearing 12 drives at the same time Very nice disk controller card... (very expensive too, but I assume it was something you already had) I'm guessing your hardware is far more than a usual desktop machine. (How much RAM do you have?) everything seems fine - ie i set all the disks up as JBOD - which seems to simluate pass through(not sure on the correct terms) anyhow - so far so good. Unraid picks up the first set of 12 drives i connected (having power issues with connecting more ). Yes,you need a really good single 12 volt rail, high capacity power supply once you get up there in disks. So i guess my question is then - how do i go about trusting a drive after a preclear? Lookat the report it creates when it ends. (copies of the reports will be in /boot/preclear_reports) reading through some of the posts, it seems i need to look for the following : FAILING NOW attributes, 5 Reallocated_Sector_Ct (this should be preferably zero - or else stay a very low number.) 197 Current_Pending_Sector (this should be preferably zero - or else stay a very low number.) Your report shows one attribute that looks odd. 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 285 For many disks, this attribute represents emergency retracts of the disk heads when power is lost. It seems high unless the disk just had power cut off rather than having its heads parked in an orderly shutdown in previous usage. also - should i be worried about which "device" i am clearing? (since i gather that SDB and SDG is possibly the same thing...) /dev/sdb and /dev/sdg will never be the same disk. You can type: ls -l /dev/disk/by-id/* to see a listing of all your disks and disk partitions by model and serial number. The preclear script was written to not allow you to clear a disk that is assigned to the array, or mounted and in use. Have fun, Joe L. Quote Link to comment
Neo_x Posted September 3, 2012 Share Posted September 3, 2012 Hi guys nvm - Solved - discovered after studying the usage script that the following command is possible:\ preclear_disk.sh -d sat /dev/sda this instructs preclear to utilize alternate commands when running Smartctl. I did consider cases like this when I added that "-d type" option. You are one of the first to report they were able to use it on their hardware. ... .. dev/sdb and /dev/sdg will never be the same disk. You can type: ls -l /dev/disk/by-id/* to see a listing of all your disks and disk partitions by model and serial number. The preclear script was written to not allow you to clear a disk that is assigned to the array, or mounted and in use. Have fun, Joe L. Thx for an informative post Joe yes the controller cost me a pretty penny (more in the region of penny wise pound foolish ). about 2/3 years ago i was a strong follower of RAID5/ RAID6 - and thus decided i needed a good controller that can give me the biggest RAID-6 possible relative to my case capacity at that stage - 16 drives max. problem is - as with any data - i outgrow the server recently, and thus needed to upgrade. Problem is - although its a very nice controller, it was following the same rules of RAID. all drives needs to be the same capacity -and it seems it had another limitation - 16 drives max for a cluster. so yes - i wasn't willing to shell out additional cash upgrading the 16 x 1.5TB drives to 16 x 3TB drives, as this would just be pointless(converting the RAID would have taken days if not weeks - in the region of 30 hours per drive), and then i would be stuck in the same position a few years down the line. so yes - now i am moving over to Unraid - and definitely seems to be the way to go. otherwise - nope are having a "normal" desktop pc in use ( Tri-SLI board with one of the first Quad cores(Q6600) running and i think in the region of 4GB RAM - nothing serious ( home user ). the controller have a duo core cpu and 512MB ram onboard- so i think i am safe performance wise. i am seemingly having some issues with spin-up / spin-down on the controller - will investigate a bit more once i finished migrating data over from backups onto the server. will keep you guys updated Quote Link to comment
drawz Posted September 9, 2012 Share Posted September 9, 2012 I have a brand new 3TB Hitachi 5K3000 that just finished 3 preclear cycles with 58 reallocated sectors (and 70 reallocated events?). Final SMART output is attached. Preclear actually said that there were 0 sectors pending reallocation after the 1st 2 passes, but if you look at the individual "smart_mid_post_read1" log, it really shows up after the first pass (bug in preclear?). This is confirmed by these entries in the syslog during the first pass, with no similar entries in later passes: Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 dc 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907533312 Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441664 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441665 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441666 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441667 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441668 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441669 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441670 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441671 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441672 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: Buffer I/O error on device sdc, logical block 113441673 Sep 5 10:24:20 tower kernel: lost page write due to I/O error on sdc Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 e0 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907534336 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 e4 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907535360 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 e8 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907536384 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 ec 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907537408 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 f0 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907538432 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 f4 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907539456 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 f8 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907540480 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 17 fc 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907541504 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 00 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907542528 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 04 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907543552 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 08 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907544576 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 28 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907552768 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 2c 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907553792 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Unhandled error code Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] Result: hostbyte=0x00 driverbyte=0x06 Sep 5 10:24:20 tower kernel: sd 3:0:0:0: [sdc] CDB: cdb[0]=0x2a: 2a 00 36 18 30 00 00 04 00 00 Sep 5 10:24:20 tower kernel: end_request: I/O error, dev sdc, sector 907554816 The sector count reallocation was stable through the 2nd two runs, but I'm still inclined to send back a brand new drive with any problems. Unfortunately, Newegg doesn't have any more of these in stock, so an exchange through them may not be an option. I'd hate to RMA a brand new drive and get a refurbished one back. Am I being too paranoid? preclear_MJ1311YNG7RNDA_2012-09-09.zip Quote Link to comment
grither Posted September 10, 2012 Share Posted September 10, 2012 Hi all... does anyone see anything bad about my preclear report? EDIT should have menitoned that this is a brand new drive. also edited to remove my question about the 'old age' flag, i think this is the type of indicator, not a notification Date: 10 September, 2012 12:09:44 PM CDT To: root Subject: Preclear: PASS! Preclearing Disk sdr Finished!!! Cycle 1 of 1 ========================================================================1.13 == invoked as: ./preclear_disk.sh -M 4 /dev/sdr == ST3000DM001-9YN166 W1F0L8TJ == Disk /dev/sdr has been successfully precleared == with a starting sector of 1 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 6:03:40 (137 MB/s) == Last Cycle's Zeroing time : 7:09:43 (116 MB/s) == Last Cycle's Post Read Time : 14:33:30 (57 MB/s) == Last Cycle's Total Time : 27:47:58 == == Total Elapsed Time 27:47:58 == == Disk Start Temperature: 32C == == Current Disk Temperature: 34C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdr /tmp/smart_finish_sdr ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Raw_Read_Error_Rate = 114 109 6 ok 75612992 Seek_Error_Rate = 63 100 30 ok 2393921 Spin_Retry_Count = 100 100 97 near_thresh 0 End-to-End_Error = 100 100 99 near_thresh 0 Airflow_Temperature_Cel = 66 68 45 near_thresh 34 Temperature_Celsius = 34 32 0 ok 34 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sdr == Disk: /dev/sdr smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright © 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: ST3000DM001-9YN166 Serial Number: W1F0L8TJ Firmware Version: CC4B User Capacity: 3,000,592,982,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Sun Sep 9 08:21:45 2012 CDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 575) seconds. Offline data collection capabilities: (0x73) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. No Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x3085) SCT Status supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 109 100 006 Pre-fail Always - 22302048 3 Spin_Up_Time 0x0003 094 094 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 6 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail Always - 996636 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 14 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 6 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 068 062 045 Old_age Always - 32 (Min/Max 30/32) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 5 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 23 194 Temperature_Celsius 0x0022 032 040 000 Old_age Always - 32 (0 26 0 0) 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 135029476818955 241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 25231187323737 242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 21829135496 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ ============================================================================ == == S.M.A.R.T Final Report for /dev/sdr == Disk: /dev/sdr smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright © 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: ST3000DM001-9YN166 Serial Number: W1F0L8TJ Firmware Version: CC4B User Capacity: 3,000,592,982,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Mon Sep 10 12:09:42 2012 CDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 575) seconds. Offline data collection capabilities: (0x73) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. No Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x3085) SCT Status supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 114 100 006 Pre-fail Always - 75612992 3 Spin_Up_Time 0x0003 094 094 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 6 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 063 060 030 Pre-fail Always - 2393921 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 42 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 6 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 066 062 045 Old_age Always - 34 (Min/Max 30/38) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 5 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 23 194 Temperature_Celsius 0x0022 034 040 000 Old_age Always - 34 (0 26 0 0) 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 242674242158631 241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 56194266962989 242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 43987297111 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. == ============================================================================ Quote Link to comment
Joe L. Posted September 11, 2012 Share Posted September 11, 2012 Hi all... does anyone see anything bad about my preclear report? EDIT should have menitoned that this is a brand new drive. also edited to remove my question about the 'old age' flag, i think this is the type of indicator, not a notification Looks fine. Enjoy your new drive. Quote Link to comment
grither Posted September 11, 2012 Share Posted September 11, 2012 Great thanks joe Quote Link to comment
Danieluwe Posted September 12, 2012 Share Posted September 12, 2012 Just fitted a sil3132 card into my server to give a couple of extra slots, attached a new IBm 3tb coolspin and started the preclear, pre read ran through at between 120MB/s and 80MB/s pretty normal for my server, it has moved onto stage 2 copying zeros but has been running at 36MB/s consistently for the entire time I'm onto 22hrs elapsed and only 50% into writing zeros, Anyone have any idea what can cause this behaviour?? Many thanks in advance Dan Quote Link to comment
Joe L. Posted September 12, 2012 Share Posted September 12, 2012 it has moved onto stage 2 copying zeros but has been running at 36MB/s consistently for the entire time I'm onto 22hrs elapsed and only 50% into writing zeros, Anyone have any idea what can cause this behaviour ? ? A. it writes slower than it reads ? ? ? B. You are doing something else that is keeping the same disk controller busy ? ? ? C. Slow electrons ? ? ? D. Otherwise, no... you did NOT provide a syslog for analysis. (so all we can do is guess) did you look there for any signs of errors? Quote Link to comment
Danieluwe Posted September 12, 2012 Share Posted September 12, 2012 There is nothing else connected to the controller and the server hasn't even been doing anything, I've left it alone during the pre-clear, I checked the syslog, but there was nothing out of the ordinary on it, just normal ip lease requests and disc spindowns since the preclear started, I will wait for the preclear to finish before I start to worry too much, but is quite unusual never seen a drive speed vary so much during a preclear Ps just hit post read and back up to 120MB/s very strange Quote Link to comment
StevenD Posted September 14, 2012 Share Posted September 14, 2012 New Hitachi 4TB. Took 39 hours for one pass. ========================================================================1.13 == invoked as: ./preclear_disk.sh -A -m /dev/sda == Hitachi HDS724040ALE640 PK2301P == Disk /dev/sda has been successfully precleared == with a starting sector of 1 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 9:46:06 (113 MB/s) == Last Cycle's Zeroing time : 10:19:09 (107 MB/s) == Last Cycle's Post Read Time : 18:50:08 (59 MB/s) == Last Cycle's Total Time : 38:56:32 == == Total Elapsed Time 38:56:32 == == Disk Start Temperature: 36C == == Current Disk Temperature: 39C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sda /tmp/smart_finish_sda ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 153 166 0 ok 39 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sda == Disk: /dev/sda smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS724040ALE640 Serial Number: PK2301P Firmware Version: MJAOA3B0 User Capacity: 4,000,787,030,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Wed Sep 12 20:25:26 2012 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 24) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 054 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 100 100 024 Pre-fail Always - 0 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 4 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 20 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 4 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 4 194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 36 (Min/Max 25/37) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. ============================================================================== ============================================================================== == S.M.A.R.T Final Report for /dev/sda == Disk: /dev/sda smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS724040ALE640 Serial Number: PK2301P Firmware Version: MJAOA3B0 User Capacity: 4,000,787,030,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Fri Sep 14 11:21:58 2012 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 24) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 054 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 100 100 024 Pre-fail Always - 0 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 4 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 58 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 4 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 4 194 Temperature_Celsius 0x0002 153 153 000 Old_age Always - 39 (Min/Max 25/40) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. ============================================================================== Quote Link to comment
Joe L. Posted September 15, 2012 Share Posted September 15, 2012 New Hitachi 4TB. Took 39 hours for one pass. ========================================================================1.13 == invoked as: ./preclear_disk.sh -A -m /dev/sda == Hitachi HDS724040ALE640 PK2301P == Disk /dev/sda has been successfully precleared == with a starting sector of 1 == Ran 1 cycle == == Using :Read block size = 8225280 Bytes == Last Cycle's Pre Read Time : 9:46:06 (113 MB/s) == Last Cycle's Zeroing time : 10:19:09 (107 MB/s) == Last Cycle's Post Read Time : 18:50:08 (59 MB/s) == Last Cycle's Total Time : 38:56:32 == == Total Elapsed Time 38:56:32 == == Disk Start Temperature: 36C == == Current Disk Temperature: 39C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sda /tmp/smart_finish_sda ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 153 166 0 ok 39 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ ============================================================================ == == S.M.A.R.T Initial Report for /dev/sda == Disk: /dev/sda smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS724040ALE640 Serial Number: PK2301P Firmware Version: MJAOA3B0 User Capacity: 4,000,787,030,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Wed Sep 12 20:25:26 2012 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 24) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 054 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 100 100 024 Pre-fail Always - 0 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 4 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 20 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 4 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 4 194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 36 (Min/Max 25/37) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. ============================================================================== ============================================================================== == S.M.A.R.T Final Report for /dev/sda == Disk: /dev/sda smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS724040ALE640 Serial Number: PK2301P Firmware Version: MJAOA3B0 User Capacity: 4,000,787,030,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Fri Sep 14 11:21:58 2012 Local time zone must be set--see zic m SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 24) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 054 Pre-fail Offline - 0 3 Spin_Up_Time 0x0007 100 100 024 Pre-fail Always - 0 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 4 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 0 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 58 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 4 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 4 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 4 194 Temperature_Celsius 0x0002 153 153 000 Old_age Always - 39 (Min/Max 25/40) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. ============================================================================== LOOKS GOOD TO ME. Quote Link to comment
firstraid Posted September 19, 2012 Share Posted September 19, 2012 Hi Thanks to Joe L. I manage my first preclear disk. Can some kind person tell me if my old drive is acceptable for the array, as it's now 1 drive missing and I do not have the money for a new drive. I have attached the report files. Thank very much.. preclear_start__S0Z4JDWPC05764_2012-09-19.txt preclear_rpt__S0Z4JDWPC05764_2012-09-19.txt preclear_finish__S0Z4JDWPC05764_2012-09-19.txt Quote Link to comment
Joe L. Posted September 19, 2012 Share Posted September 19, 2012 Hi Thanks to Joe L. I manage my first preclear disk. Can some kind person tell me if my old drive is acceptable for the array, as it's now 1 drive missing and I do not have the money for a new drive. I have attached the report files. Thank very much.. Looks fine. It had three sectors which had been re-allocated before the start of the pre-clear, and no additional sectors were identified by the SMART firmware during the pre-clear. The drive should be fine. Quote Link to comment
firstraid Posted September 21, 2012 Share Posted September 21, 2012 I want to add a very old 200G PATA drive to my server. Could some take a look @ the result please .. P.S want should I be look for ?? cause I'ce got another PATA on preclear ... preclear_start__B41K0CGH_2012-09-21.txt preclear_rpt__B41K0CGH_2012-09-21.txt preclear_finish__B41K0CGH_2012-09-21.txt Quote Link to comment
Joe L. Posted September 21, 2012 Share Posted September 21, 2012 I want to add a very old 200G PATA drive to my server. Could some take a look @ the result please .. P.S want should I be look for ?? cause I'ce got another PATA on preclear ... Looks fine. It had one sector which had been re-allocated before the start of the pre-clear, and no additional sectors were identified by the SMART firmware during the pre-clear. The drive should be fine. You are looking for sectors pending re-allocation and re-allocated sectors. (and changes in those parameters during the process) Most disks have anywhere from several hundred to several thousand spare sectors that can be used by the disks's SMART firmware to re-allocate un-readable sectors. Your disk is doing fine. Other than that, you are looking for ANY "normalized VALUE" parameter that has reached or fallen below the affiliated failure THRESHOLD. Joe L. Quote Link to comment
firstraid Posted September 21, 2012 Share Posted September 21, 2012 Thanks again Joe , Quote Link to comment
itimpi Posted October 1, 2012 Share Posted October 1, 2012 I have some 3TB disks that have had zeroes written to them using the manufacturers diagnostic tools. Is it possible to just write the unRAID cleared indicator to them? The -z option talks about zeroing out the MBR but not setting it up as though it has been cleared for unRAID purposes. If not, then perhaps a new option (such as -Z) could be added to allow for this could be added to the script? Quote Link to comment
Joe L. Posted October 2, 2012 Share Posted October 2, 2012 I have some 3TB disks that have had zeroes written to them using the manufacturers diagnostic tools. No, there is no guarantee the manufacturer does what is needed. Is it possible to just write the unRAID cleared indicator to them? The -z option talks about zeroing out the MBR but not setting it up as though it has been cleared for unRAID purposes. If not, then perhaps a new option (such as -Z) could be added to allow for this could be added to the script? The risk is too high. So rather than risk someone's data, I have elected to not not add that option. Quote Link to comment
page3 Posted October 2, 2012 Share Posted October 2, 2012 Hi all, - please be gentle! I have a HP Microserver running two Samsung f2 1.5tb drives fine. Trying to add a new Seagate Green 1.5TB and having problems with preclear. Still stuck on cycle 1 of 1: = unRAID server Pre-Clear disk /dev/sda = cycle 1 of 1, partition start on sector 64 = Disk Pre-Read in progress: 34% complete = ( 516,547,584,000 bytes of 1,500,301,910,016 read ) = Disk Temperature: 34C, Elapsed Time: 2:54:23 Note: elapsed time is wrong. Been running well over four hours now. % and time not increasing! Loads of errors filling the syslog: -- Syslog snippit -- Oct 2 21:15:52 Tower kernel: Descriptor sense data with sense descriptors (in hex): Oct 2 21:15:52 Tower kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 Oct 2 21:15:52 Tower kernel: 3c 44 27 20 Oct 2 21:15:52 Tower kernel: sd 2:0:0:0: [sda] ASC=0x11 ASCQ=0x4 Oct 2 21:15:52 Tower kernel: sd 2:0:0:0: [sda] CDB: cdb[0]=0x28: 28 00 3c 44 27 20 00 00 08 00 Oct 2 21:15:52 Tower kernel: end_request: I/O error, dev sda, sector 1011099424 Oct 2 21:15:52 Tower kernel: Buffer I/O error on device sda, logical block 126387428 Oct 2 21:15:52 Tower kernel: ata2: EH complete Oct 2 21:15:55 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:15:55 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:15:55 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:15:55 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:15:55 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:15:55 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:15:55 Tower kernel: ata2.00: error: { UNC } Oct 2 21:15:56 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:15:56 Tower kernel: ata2: EH complete Oct 2 21:15:59 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:15:59 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:15:59 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:15:59 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:15:59 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:15:59 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:15:59 Tower kernel: ata2.00: error: { UNC } Oct 2 21:15:59 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:15:59 Tower kernel: ata2: EH complete Oct 2 21:16:02 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:16:02 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:16:02 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:16:02 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:16:02 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:16:02 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:16:02 Tower kernel: ata2.00: error: { UNC } Oct 2 21:16:02 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:16:02 Tower kernel: ata2: EH complete --- ends --- SMART REPORTS SMART Preclear Start Disk: /dev/sdb smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: VB0250EAVER Serial Number: W2A4LK0R Firmware Version: HPG7 User Capacity: 250,059,350,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat Feb 4 16:25:55 2012 GMT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 625) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 45) minutes. SCT capabilities: (0x1039) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 117 100 006 Pre-fail Always - 136856044 3 Spin_Up_Time 0x0023 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 48 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 39 7 Seek_Error_Rate 0x002f 100 253 030 Pre-fail Always - 392797 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 72 10 Spin_Retry_Count 0x0033 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 23 180 Unused_Rsvd_Blk_Cnt_Tot 0x002b 100 100 000 Pre-fail Always - 1033259290 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 097 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 067 065 045 Old_age Always - 33 (Min/Max 25/33) 194 Temperature_Celsius 0x0022 033 040 000 Old_age Always - 33 (0 15 0 0) 195 Hardware_ECC_Recovered 0x003a 066 053 000 Old_age Always - 136856044 196 Reallocated_Event_Count 0x0032 100 100 036 Old_age Always - 39 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Preclear Finish Disk: /dev/sdb smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: VB0250EAVER Serial Number: W2A4LK0R Firmware Version: HPG7 User Capacity: 250,059,350,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat Feb 4 17:15:56 2012 GMT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 625) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 45) minutes. SCT capabilities: (0x1039) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 117 100 006 Pre-fail Always - 136884121 3 Spin_Up_Time 0x0023 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 48 5 Reallocated_Sector_Ct 0x0033 099 099 036 Pre-fail Always - 63 7 Seek_Error_Rate 0x002f 100 253 030 Pre-fail Always - 477913 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 72 10 Spin_Retry_Count 0x0033 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 23 180 Unused_Rsvd_Blk_Cnt_Tot 0x002b 100 100 000 Pre-fail Always - 1521664575 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 097 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 065 065 045 Old_age Always - 35 (Min/Max 25/35) 194 Temperature_Celsius 0x0022 035 040 000 Old_age Always - 35 (0 15 0 0) 195 Hardware_ECC_Recovered 0x003a 066 053 000 Old_age Always - 136884121 196 Reallocated_Event_Count 0x0032 099 099 036 Old_age Always - 63 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Preclear has been running around 4 hours, and is still spurting out errors. Do I leave it and see what happens, or is this a faulty drive? Advice very much appreciated! Quote Link to comment
Joe L. Posted October 3, 2012 Share Posted October 3, 2012 Hi all, - please be gentle! I have a HP Microserver running two Samsung f2 1.5tb drives fine. Trying to add a new Seagate Green 1.5TB and having problems with preclear. Still stuck on cycle 1 of 1: = unRAID server Pre-Clear disk /dev/sda = cycle 1 of 1, partition start on sector 64 = Disk Pre-Read in progress: 34% complete = ( 516,547,584,000 bytes of 1,500,301,910,016 read ) = Disk Temperature: 34C, Elapsed Time: 2:54:23 Note: elapsed time is wrong. Been running well over four hours now. % and time not increasing! Loads of errors filling the syslog: -- Syslog snippit -- Oct 2 21:15:52 Tower kernel: Descriptor sense data with sense descriptors (in hex): Oct 2 21:15:52 Tower kernel: 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 Oct 2 21:15:52 Tower kernel: 3c 44 27 20 Oct 2 21:15:52 Tower kernel: sd 2:0:0:0: [sda] ASC=0x11 ASCQ=0x4 Oct 2 21:15:52 Tower kernel: sd 2:0:0:0: [sda] CDB: cdb[0]=0x28: 28 00 3c 44 27 20 00 00 08 00 Oct 2 21:15:52 Tower kernel: end_request: I/O error, dev sda, sector 1011099424 Oct 2 21:15:52 Tower kernel: Buffer I/O error on device sda, logical block 126387428 Oct 2 21:15:52 Tower kernel: ata2: EH complete Oct 2 21:15:55 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:15:55 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:15:55 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:15:55 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:15:55 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:15:55 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:15:55 Tower kernel: ata2.00: error: { UNC } Oct 2 21:15:56 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:15:56 Tower kernel: ata2: EH complete Oct 2 21:15:59 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:15:59 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:15:59 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:15:59 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:15:59 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:15:59 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:15:59 Tower kernel: ata2.00: error: { UNC } Oct 2 21:15:59 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:15:59 Tower kernel: ata2: EH complete Oct 2 21:16:02 Tower kernel: ata2.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action 0x0 Oct 2 21:16:02 Tower kernel: ata2.00: irq_stat 0x40000008 Oct 2 21:16:02 Tower kernel: ata2.00: failed command: READ FPDMA QUEUED Oct 2 21:16:02 Tower kernel: ata2.00: cmd 60/08:00:48:6c:44/00:00:3c:00:00/40 tag 0 ncq 4096 in Oct 2 21:16:02 Tower kernel: res 41/40:08:48:6c:44/00:00:3c:00:00/00 Emask 0x409 (media error) <F> Oct 2 21:16:02 Tower kernel: ata2.00: status: { DRDY ERR } Oct 2 21:16:02 Tower kernel: ata2.00: error: { UNC } Oct 2 21:16:02 Tower kernel: ata2.00: configured for UDMA/133 Oct 2 21:16:02 Tower kernel: ata2: EH complete --- ends --- SMART REPORTS SMART Preclear Start Disk: /dev/sdb smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: VB0250EAVER Serial Number: W2A4LK0R Firmware Version: HPG7 User Capacity: 250,059,350,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat Feb 4 16:25:55 2012 GMT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 625) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 45) minutes. SCT capabilities: (0x1039) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 117 100 006 Pre-fail Always - 136856044 3 Spin_Up_Time 0x0023 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 48 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 39 7 Seek_Error_Rate 0x002f 100 253 030 Pre-fail Always - 392797 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 72 10 Spin_Retry_Count 0x0033 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 23 180 Unused_Rsvd_Blk_Cnt_Tot 0x002b 100 100 000 Pre-fail Always - 1033259290 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 097 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 067 065 045 Old_age Always - 33 (Min/Max 25/33) 194 Temperature_Celsius 0x0022 033 040 000 Old_age Always - 33 (0 15 0 0) 195 Hardware_ECC_Recovered 0x003a 066 053 000 Old_age Always - 136856044 196 Reallocated_Event_Count 0x0032 100 100 036 Old_age Always - 39 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Preclear Finish Disk: /dev/sdb smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: VB0250EAVER Serial Number: W2A4LK0R Firmware Version: HPG7 User Capacity: 250,059,350,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat Feb 4 17:15:56 2012 GMT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 625) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 45) minutes. SCT capabilities: (0x1039) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 117 100 006 Pre-fail Always - 136884121 3 Spin_Up_Time 0x0023 097 097 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 48 5 Reallocated_Sector_Ct 0x0033 099 099 036 Pre-fail Always - 63 7 Seek_Error_Rate 0x002f 100 253 030 Pre-fail Always - 477913 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 72 10 Spin_Retry_Count 0x0033 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 23 180 Unused_Rsvd_Blk_Cnt_Tot 0x002b 100 100 000 Pre-fail Always - 1521664575 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 097 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 065 065 045 Old_age Always - 35 (Min/Max 25/35) 194 Temperature_Celsius 0x0022 035 040 000 Old_age Always - 35 (0 15 0 0) 195 Hardware_ECC_Recovered 0x003a 066 053 000 Old_age Always - 136884121 196 Reallocated_Event_Count 0x0032 099 099 036 Old_age Always - 63 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Preclear has been running around 4 hours, and is still spurting out errors. Do I leave it and see what happens, or is this a faulty drive? Advice very much appreciated! UNC media errors are sectors where the data on the sector does not match the checksum at the end of that sector. It might be a defective sector, or it might be written poorly. Either way, let the process complete. It will re-allocate the sector if it needs to when it gets to the writing phase, or, re-write it in place (it tries that first) in an attempt to not re-allocate the sector. Notice there were 39 sectors already re-allocated prior to the start of the process. Can't tell you about the time counters... Joe L. Quote Link to comment
page3 Posted October 3, 2012 Share Posted October 3, 2012 UNC media errors are sectors where the data on the sector does not match the checksum at the end of that sector. It might be a defective sector, or it might be written poorly. Either way, let the process complete. It will re-allocate the sector if it needs to when it gets to the writing phase, or, re-write it in place (it tries that first) in an attempt to not re-allocate the sector. Notice there were 39 sectors already re-allocated prior to the start of the process. Can't tell you about the time counters... Joe L. Thanks Joe. I really appreciate the swift reply. It is now on stage 2 so I'll see what it reports and post result here, if that's ok. Quote Link to comment
itimpi Posted October 3, 2012 Share Posted October 3, 2012 Is it possible to just write the unRAID cleared indicator to them? The -z option talks about zeroing out the MBR but not setting it up as though it has been cleared for unRAID purposes. If not, then perhaps a new option (such as -Z) could be added to allow for this could be added to the script? The risk is too high. So rather than risk someone's data, I have elected to not not add that option. Thanks for the feedback - I thought I would ask but I can understand your viewpoint. The emphasis has to be on minimizing any chance of losing data.. I guess in this case I always have the option of skipping the pre and post clear phases to speed things up if the disk has been tested via other methods. I guess the better thing to do is to do some early preparation work and have new/spare disks well in advance of needing it and doing using the preclear_disk.sh script to do a thorough stress test. However this requires some pre-planning - not always my strongest point Quote Link to comment
servion Posted October 3, 2012 Share Posted October 3, 2012 I've got an older 750GB barracuda 7200 drive that I went through hell with in the past.... when the whole click of death thing was happening to all the seagate drives. First one crashed after a month, second one after a week, this is the 3rd... it started occasionally clicking after a week or so, so I decided to basically stop using it as opposed to continual RMAs. Fast forward to now ~4 years later... the drive has a 5-year warranty and still has some covered time left, and now I want to see if I can either start using it in the array, or RMA it and hopefully get a good working replacement this time. On windows, I use their SeaTools tool (which they imply is required to show errors to RMA it). Well, all the SeaTools tests (short, long, smart) pass... but using other smart tools, they show issues (100 reallocated sectors). I figured I'd put it into my unraid box and preclear it to see if it can finally kill it... well, it seems to not be getting worse, and I haven't noticed clicking for a while... but I'm hesitant to use it seeing as I remember it scaring me in the past. Can someone look over my preclear results email (which includes the pre and post 3-cycle SMART results)? If either you think its OK to use in the array, or if there's something I can point out when trying to RMA, please let me know! Thanks! preclear_seagate.txt Quote Link to comment
page3 Posted October 3, 2012 Share Posted October 3, 2012 Hi Joe, I've been a total idiot. Those smart reports I copied above are dated February 2012, so are for my old drive! If you forgive me for wasting your time, I'll attach the three smart reports produced so far. Currently it is on the post read, still spitting out errors. Regards, Adam. smart_mid_after_zero1_sda.txt smart_mid_preread1_sda.txt smart_start_sda.txt Quote Link to comment
Joe L. Posted October 3, 2012 Share Posted October 3, 2012 I'd say the disk is bad... I think 9168 is the highest number of re-allocated sectors I've ever seen on a disk without it being marked as failed. I'd strongly advise not using it in the unRAID array. 5 Reallocated_Sector_Ct 0x0033 087 087 036 Pre-fail Always - 9168 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.