Jump to content

alphazo

Members
  • Posts

    109
  • Joined

  • Last visited

Everything posted by alphazo

  1. Hello, I have been running unRaid 5.0 for quite some time now have 13 various drives in the system. Yesterday in order to expand space I decided to replace disk1 (1.5B) by a brand new 4TB Hitachi drive. Here is what I did: Ran a parity check and rebuild Precleared new 4TB Replaced disk1 (1.5TB) by the 4TB one Rebooted and assigned the new drive Data rebuild then started for disk1 with an estimated time to completion of 1 day. Following moring I checked the status and ETA jumped to 400 days. I then looked at dmesg and it was full of read error messages from disk12 (another 4TB that was added to the system many months ago). md: disk12 read error, sector=586096 md: disk12 read error, sector=586104 ata7.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 ata7.00: edma_err_cause=00000084 pp_flags=00000001, dev error, EDMA self-disable ata7.00: failed command: READ DMA ata7.00: cmd c8/00:00:c0:f1:08/00:00:00:00:00/e0 tag 0 dma 131072 in res 51/40:00:c0:f1:08/40:04:00:00:00/e0 Emask 0x9 (media error) ata7.00: status: { DRDY ERR } ata7.00: error: { UNC } ata7: hard resetting link ata7: SATA link up 3.0 Gbps (SStatus 123 SControl 300) ata7.00: configured for UDMA/133 sd 6:0:0:0: [sdh] Unhandled sense code sd 6:0:0:0: [sdh] Result: hostbyte=0x00 driverbyte=0x08 sd 6:0:0:0: [sdh] Sense Key : 0x3 [current] [descriptor] Descriptor sense data with sense descriptors (in hex): 72 03 11 04 00 00 00 0c 00 0a 80 00 00 00 00 00 00 08 f1 c0 sd 6:0:0:0: [sdh] ASC=0x11 ASCQ=0x4 sd 6:0:0:0: [sdh] CDB: cdb[0]=0x88: 88 00 00 00 00 00 00 08 f1 c0 00 00 01 00 00 00 end_request: I/O error, dev sdh, sector 586176 md: disk12 read error, sector=586112 md: disk12 read error, sector=586120 ata7: EH complete md: disk12 read error, sector=586128 md: disk12 read error, sector=586136 md: disk12 read error, sector=586144 md: disk12 read error, sector=586152 md: disk12 read error, sector=586160 I shutdown the sytem and replaced sata cable and power cable for disk12 but issue keeps coming back. I don't exactly what to do at this point. Here is what I have: 4TB disk1 drive that needs to be rebuilt Original 1.5B disk1 drive with valid data and that is working fine Valid 4TB Parity drive (I guess) since I haven't started a parity check since the data rebuild Spare (precleared) 4TB drive I was tempted to put the 1.5TB back and go the "Make unRAID Trust the Parity Drive, Avoid Rebuilding Parity Unnecessarily" way but since the configuration doesn't watch because of the size mismatch I'm not sure it will work. What would you recommend? Thanks Alphazo PS: here is the smartctl for disk12 smartctl -a -d ata /dev/sdh smartctl 6.2 2013-07-26 r3841 [i686-linux-3.9.11p-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Device Model: WDC WD40EFRX-68WT0N0 Serial Number: WD-WCC4E0083446 LU WWN Device Id: 5 0014ee 2b3ba9249 Firmware Version: 80.00A80 User Capacity: 4,000,787,030,016 bytes [4.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 5400 rpm Device is: Not in smartctl database [for details use: -P showall] ATA Version is: ACS-2 (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s) Local Time is: Tue Nov 25 23:12:18 2014 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (52080) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 521) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x703d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 5737 3 Spin_Up_Time 0x0027 177 177 021 Pre-fail Always - 8125 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 509 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 090 090 000 Old_age Always - 7571 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 29 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 4 193 Load_Cycle_Count 0x0032 198 198 000 Old_age Always - 8770 194 Temperature_Celsius 0x0022 130 111 000 Old_age Always - 22 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 197 197 000 Old_age Always - 2163 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0 SMART Error Log Version: 1 ATA Error Count: 108 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 108 occurred at disk power-on lifetime: 7571 hours (315 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 40 38 f9 09 e0 Error: UNC 64 sectors at LBA = 0x0009f938 = 653624 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 40 38 f9 09 e0 08 00:10:51.916 READ DMA b0 d1 01 01 4f c2 00 08 00:10:51.896 SMART READ ATTRIBUTE THRESHOLDS [OBS-4] b0 d5 01 06 4f c2 00 08 00:10:51.876 SMART READ LOG b0 d0 01 00 4f c2 00 08 00:10:51.856 SMART READ DATA Error 107 occurred at disk power-on lifetime: 7571 hours (315 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 c0 78 f8 09 e0 Error: UNC 192 sectors at LBA = 0x0009f878 = 653432 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 c0 78 f8 09 e0 08 00:10:47.104 READ DMA ec 00 00 00 00 00 00 08 00:10:47.101 IDENTIFY DEVICE ec 00 00 00 00 00 a0 08 00:10:47.081 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:10:47.081 SET FEATURES [set transfer mode] Error 106 occurred at disk power-on lifetime: 7571 hours (315 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 40 38 f0 09 e0 Error: UNC 64 sectors at LBA = 0x0009f038 = 651320 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 40 38 f0 09 e0 08 00:10:33.633 READ DMA b0 d5 01 01 4f c2 00 08 00:10:33.613 SMART READ LOG ec 00 00 00 00 00 a0 08 00:10:33.593 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:10:33.593 SET FEATURES [set transfer mode] Error 105 occurred at disk power-on lifetime: 7571 hours (315 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 c0 78 ef 09 e0 Error: UNC 192 sectors at LBA = 0x0009ef78 = 651128 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 c0 78 ef 09 e0 08 00:10:28.858 READ DMA ec 00 00 00 00 00 a0 08 00:10:28.838 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:10:28.838 SET FEATURES [set transfer mode] Error 104 occurred at disk power-on lifetime: 7571 hours (315 days + 11 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 40 38 e7 09 e0 Error: UNC 64 sectors at LBA = 0x0009e738 = 649016 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 40 38 e7 09 e0 08 00:10:15.350 READ DMA b0 d5 01 00 4f c2 00 08 00:10:15.330 SMART READ LOG ec 00 00 00 00 00 a0 08 00:10:15.310 IDENTIFY DEVICE ef 03 46 00 00 00 a0 08 00:10:15.310 SET FEATURES [set transfer mode] SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. I have an old backup of super.dat that seems to have the configuation with the working 1.5TB disk1 drive. I looked at the file using an hex editor and all the serial numbers were there. Since disk12 seems in bad shape I need the parity not to be overwritten so I can restore disk12 on a new blank drive. [Few hours later] I went ahead and replaced my super.dat by an older backup of it that had the 1.5TB registered. Started back the machine with the old working 1.5TB on disk1 and a brand new precleared 4TB on disk12. Disk12 is now reconstructing. Will investigate later the status of the faulty drive. Lets's see what comes out of this rebuild.
  2. Big thanks to everybody for providing such valuable information. A pair of 4TB Red drives is on the way. I will only replace the parity drive. Funny to see that failure on parity drive is evolving (in the wrong direction) but ball is still solid green. I'm marking the thread as Solved for now. root@babylon:~# smartctl -a -A /dev/sdi smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (Adv. Format) family Device Model: WDC WD20EARS-00MVWB0 Serial Number: WD-WCAZA4474532 Firmware Version: 51.0AB51 User Capacity: 2,000,398,934,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Sat Nov 23 09:45:40 2013 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (36360) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 171 171 051 Pre-fail Always - 105420 3 Spin_Up_Time 0x0027 167 164 021 Pre-fail Always - 6650 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always - 1131 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 072 072 000 Old_age Always - 20911 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 140 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 84 193 Load_Cycle_Count 0x0032 152 152 000 Old_age Always - 146244 194 Temperature_Celsius 0x0022 127 110 000 Old_age Always - 23 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 196 196 000 Old_age Always - 1419 198 Offline_Uncorrectable 0x0030 200 197 000 Old_age Offline - 30 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 001 001 000 Old_age Offline - 148883 SMART Error Log Version: 1 ATA Error Count: 2 CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 2 occurred at disk power-on lifetime: 20903 hours (870 days + 23 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 18 d6 02 ef Error: UNC at LBA = 0x0f02d618 = 251844120 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 70 d5 02 ef 08 40d+16:29:59.194 READ DMA c8 00 00 70 cc 02 ef 08 40d+16:29:58.251 READ DMA Error 1 occurred at disk power-on lifetime: 20901 hours (870 days + 21 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 88 4b b5 e7 Error: UNC at LBA = 0x07b54b88 = 129321864 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 00 20 4b b5 e7 08 40d+14:40:35.048 READ DMA c8 00 00 20 42 b5 e7 08 40d+14:40:32.954 READ DMA SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 20899 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  3. So don't the following errors indicate a drive going bad? 1 Raw_Read_Error_Rate 0x000f 102 099 006 Pre-fail Always - 291896 7 Seek_Error_Rate 0x000f 073 060 030 Pre-fail Always - 21171801
  4. Hello, I was doing a random check on my unRAID server and noticed 4306 errors for the Parity drive (but ball is still green). I then ran a smartctl on all the drive and also found a high number of error on one of the data disk. I'm going to buy two new drive (and switch parity to 3TB). Which drive do you recommend me to swap first (parity or disk11) ? Should I run a parity check before? Thanks PARITY DRIVE root@babylon:~# smartctl -a -A /dev/sdi smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (Adv. Format) family Device Model: WDC WD20EARS-00MVWB0 Serial Number: WD-WCAZA4474532 Firmware Version: 51.0AB51 User Capacity: 2,000,398,934,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Fri Nov 22 22:25:13 2013 CET SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (36360) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 198 198 051 Pre-fail Always - 9667 3 Spin_Up_Time 0x0027 167 164 021 Pre-fail Always - 6650 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always - 1131 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 072 072 000 Old_age Always - 20899 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 140 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 84 193 Load_Cycle_Count 0x0032 152 152 000 Old_age Always - 146232 194 Temperature_Celsius 0x0022 129 110 000 Old_age Always - 21 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 196 196 000 Old_age Always - 1419 198 Offline_Uncorrectable 0x0030 200 197 000 Old_age Offline - 30 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 001 001 000 Old_age Offline - 148883 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 20899 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. DISK11 Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 2) minutes. SCT capabilities: (0x30b7) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 10 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000f 102 099 006 Pre-fail Always - 291896 3 Spin_Up_Time 0x0003 093 093 000 Pre-fail Always - 0 4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 760 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 7 Seek_Error_Rate 0x000f 073 060 030 Pre-fail Always - 21171801 9 Power_On_Hours 0x0032 028 028 000 Old_age Always - 63694 10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 30 183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0 184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 190 Airflow_Temperature_Cel 0x0022 079 058 045 Old_age Always - 21 (Min/Max 18/31) 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 6 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 760 194 Temperature_Celsius 0x0022 021 042 000 Old_age Always - 21 (0 16 0 0) 195 Hardware_ECC_Recovered 0x001a 037 011 000 Old_age Always - 291896 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0 240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 26164940768955 241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 678912054 242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 1438427421 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  5. From day one, this particular user share was setup including only one physical drive and excluding the others. This has never been touched. I also checked each of the /mnt/diskX drive but couldn't find any trace of the missing files.
  6. I don't think it is related to rsync as my friend who has the same issue doesn't use rsync at all. It has many empty directories as well with lots of vanished files. In my case I have been using the same rsync script for many years using a source that still has the files. I also closely monitor rsync output to check what has been uploaded and deleted. FYI, I just ran a hashdeep between the known good source and the damaged user share and found about 3255 missing files out of 61565.
  7. Hi, I have been running 5.0betaXX for many years and decided to switch to the release version. I started from a fresh installation disk using my old disk configuration. Before switching over to the new version I noticed that all my disks were 100% full. I found out that my backup script went crazy and filled up all my free space. I cleaned it up, ran a parity check and upgraded. Everything went very smooth. Recently I decided to perform my regular rsync task from my external HDD to unRAID (photos & videos) and noticed that old files were uploading even if I haven't used them for ages. I interrupted the process and looked at the user share and found out that I have more than twenty empty directories. I also looked at the physical drive (/mnt/user/disk1/PHOTOS) and found the same empty directory. For information, this particular share only uses one physical disk. I also don't use any cache drive. I have been able to sync the missing files but I now I'm very worried about the other user shares. Have I lost other files? Could this be linked to the 100% usage prior to the upgrade to 5.0 release? If not how can I investigate it? Talked to a friend of mine who is running the same setup and he has experienced the exact same problem and is unRAID is about 99% full.
  8. I wanted to share my notes when migrating to 5.0 Release (from 5.0beta14). Most of the stuff and tweaks comes from the forum but I wanted to have a convenient place to share that with my other friends who are going to update soon. You can read it here: http://is.gd/xFyGd8
  9. I just moved from a Chenbro ES34069 with 4 drives to a Norco RPC-450B based hardware with 10 drives (max 15 drives). I wanted a device that could hold more drives with a good air flow. I don't really care about hot-swap drive bays (too expensive and unRAID doesn't support hot-swap today) but also didn't want to spend too much time building my own drive bays. Furthermore I wanted something both heavy and not eye-catching so a potential "visitor" would be less tempted to take it over. As this beast is going in my garage, noise is less of an issue. I looked around and couldn't find any nice industrial chassis that would fit many drives out of the box. Then I found the Norco RPC-450B that would hold 10 drives with no screws have two front 120mm fans. It even has room for additional drives as there is an empty 3x5.25" bay. Priced at $78 this sounded a too good deal. Well I did pay that amount but living in Europe I also payed more than $100 for shipping and guess what the parcel got returned to the US the first time so I ended paying even more for a Fedex delivery. Assembly went well, built quality is ok but nothing like a Lian Li case. First things I replaced were the front fans (way too loud even for a garage). The result is acceptable but due to the two grids and dust filter in front of each them, the case is not quiet. Performance was up compared to previous Atom based platform but mostly on the write speed. I guess at the moment read speed is limited by the client PC. Last thing, I do recommend those metal grip SATA cables. They make your life much easier (and safer) when playing around with drives and SATA controllers. Norco RPC-450B (10x3.5" drive bays with screwless rails and 3x5.25" drive bay) SuperMicro C2SEA motherboard Intel Celeron 430 Stock heatsink (I initially planned for a full Ninja2 but it was too big for my taste and the 35W Celeron is cool) Kingston HyperX KHX1333C7D3K2, 2x1GB memory Corsair TX650W power supply Adaptec 1430SA 4-port SATA controller card 2x Enermax UCMA-8 100080-101, 34CFM, 21dbA fans (rear) 2x Scythe SY1225SL12M, 68CFM, 24dbA fans (front) 10x Metal grip 50cm SATA cables http://www.pc-look.com/boutik/Prod_Sharkoon_SATA-Cable-Metal-Grip-050-cm-Red-__18998_en.html 2x SATA Power Supply Splitter http://www.pc-look.com/boutik/Prod_InLine_SATA-Power-Supply-Splitter-015-cm__37572_en.html 2x 500GB WD Caviar Green HDD 1x 500GB Samsung HD501LJ HDD 5x 1.5TB Samsung EcoGreen HDD 2x 1.5TB WD Caviar Green HDD Overview Screwless drive bay Random pictures Full photo gallery here: http://bit.ly/cCxpMP
×
×
  • Create New...