luca Posted November 17, 2013 Share Posted November 17, 2013 This Hitachi drive shows up as green, but gives a bunch of parity errors every time a parity check is run. File system check is OK. Any smoking guns? Luca smartctl -a -d ata /dev/sdc smartctl 5.43 2012-06-30 r3573 [i686-linux-3.9.11p-unRAID] (local build) Copyright © 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS723030BLE640 Serial Number: MS79215X02R90G LU WWN Device Id: 5 000cca 37ec13c6b Firmware Version: MX6OAAB0 User Capacity: 3,000,592,982,016 bytes [3.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Sat Nov 16 21:17:52 2013 PST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x85) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (23226) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 387) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 139 139 054 Pre-fail Offline - 70 3 Spin_Up_Time 0x0007 136 136 024 Pre-fail Always - 420 (Average 424) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 34 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 121 121 020 Pre-fail Offline - 34 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 331 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 13 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 40 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 40 194 Temperature_Celsius 0x0002 193 193 000 Old_age Always - 31 (Min/Max 18/33) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 ATA Error Count: 28 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 28 occurred at disk power-on lifetime: 323 hours (13 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 70 98 a9 97 0e Error: UNC 112 sectors at LBA = 0x0e97a998 = 244820376 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 f0 18 a6 97 e0 08 00:59:10.520 READ DMA EXT 25 00 00 18 a2 97 e0 08 00:59:10.517 READ DMA EXT 25 00 00 18 9e 97 e0 08 00:59:10.513 READ DMA EXT c8 00 60 b8 9d 97 ee 08 00:59:10.504 READ DMA c8 00 50 68 9d 97 ee 08 00:59:10.503 READ DMA Error 27 occurred at disk power-on lifetime: 322 hours (13 days + 10 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 58 90 cb fb 08 Error: UNC 88 sectors at LBA = 0x08fbcb90 = 150719376 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 00 e8 c9 fb e0 08 00:41:14.396 READ DMA EXT ea 00 00 e7 c9 fb a0 08 00:41:14.220 FLUSH CACHE EXT 25 00 00 e8 c5 fb e0 08 00:41:14.218 READ DMA EXT ca 00 08 30 be fb e8 08 00:41:14.218 WRITE DMA ca 00 08 38 be fb e8 08 00:41:14.218 WRITE DMA Error 26 occurred at disk power-on lifetime: 322 hours (13 days + 10 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 b8 30 be fb 08 Error: UNC 184 sectors at LBA = 0x08fbbe30 = 150715952 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 00 e8 bd fb e0 08 00:41:10.259 READ DMA EXT 25 00 00 e8 b9 fb e0 08 00:41:10.241 READ DMA EXT 25 00 00 e8 b5 fb e0 08 00:41:10.237 READ DMA EXT ea 00 00 e7 b5 fb a0 08 00:41:10.060 FLUSH CACHE EXT 25 00 00 e8 b1 fb e0 08 00:41:10.058 READ DMA EXT Error 25 occurred at disk power-on lifetime: 322 hours (13 days + 10 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 e8 f8 af fb 08 Error: UNC 232 sectors at LBA = 0x08fbaff8 = 150712312 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 88 58 ae fb e0 08 00:41:06.111 READ DMA EXT 25 00 00 58 aa fb e0 08 00:41:06.107 READ DMA EXT 25 00 00 58 a6 fb e0 08 00:41:06.104 READ DMA EXT c8 00 70 e8 a5 fb e8 08 00:41:06.095 READ DMA c8 00 08 e0 a5 fb e8 08 00:41:06.095 READ DMA Error 24 occurred at disk power-on lifetime: 305 hours (12 days + 17 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 10 98 a9 97 0e Error: UNC 16 sectors at LBA = 0x0e97a998 = 244820376 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 00 a8 a6 97 e0 08 06:44:44.450 READ DMA EXT 27 00 00 00 00 00 e0 08 06:44:44.450 READ NATIVE MAX ADDRESS EXT ec 00 00 00 00 00 a0 08 06:44:44.425 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 06:44:44.406 SET FEATURES [set transfer mode] 27 00 00 00 00 00 e0 08 06:44:44.386 READ NATIVE MAX ADDRESS EXT SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 331 - # 2 Short offline Completed without error 00% 327 - # 3 Short offline Completed without error 00% 325 - # 4 Short offline Completed without error 00% 325 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Link to comment
c3 Posted November 17, 2013 Share Posted November 17, 2013 Seems like it is only a few weeks old (power on). Did you run preclear? The drive is reporting various UNCorrectable reads, so far no write errors. Link to comment
BRiT Posted November 17, 2013 Share Posted November 17, 2013 You should have run run preclear on that disk. I think you will need to RMA it with all those errors. Link to comment
luca Posted November 19, 2013 Author Share Posted November 19, 2013 I am not sure if I had run preclear on that drive. I tried running it yesterday, and after going through the whole process, it's telling me that the device MBR could not be precleared. Not sure what that means. From your previous comments, it sounds like I should RMA the drive anyway... This drive is on my backup system. This system is off most of the time, except for backups, of course. =========================================================================== = unRAID server Pre-Clear disk /dev/sdc = cycle 1 of 1 = Disk Pre-Clear-Read completed DONE = Step 1 of 10 - Copying zeros to first 2048k bytes DONE = Step 2 of 10 - Copying zeros to remainder of disk to clear it DONE = Step 3 of 10 - Disk is now cleared from MBR onward. DONE = Step 4 of 10 - Clearing MBR bytes for partition 2,3 & 4 DONE = Step 5 of 10 - Clearing MBR code area DONE = Step 6 of 10 - Setting MBR signature bytes DONE = Step 7 of 10 - Setting partition 1 to precleared state DONE = Step 8 of 10 - Notifying kernel we changed the partitioning DONE = Step 9 of 10 - Creating the /dev/disk/by* entries DONE = Step 10 of 10 - Testing if the clear has been successful. DONE = Disk Temperature: 30C, Elapsed Time: 12:04:12 ============================================================================ == == SORRY: Disk /dev/sdc MBR could NOT be precleared == == out4= 00092 == out5= 00092 ============================================================================ 0000000 0000 0000 0000 0000 0000 0000 0000 0000 1+0 records in 1+0 records out * 0000700 0000 0000 0000 003f 0000 0a37 15d5 0000 0000720 0000 0000 0000 0000 0000 0000 0000 0000 512 bytes (512 B) copied* 0000760 0000 0000 0000 0000 0000 0000 0000 5c5c 0001000 , 0.029286 s, 17.5 kB/s Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.