skyhawk

Members
  • Posts

    88
  • Joined

  • Last visited

Everything posted by skyhawk

  1. Success. Thanks. Physically unplugging it allowed it to mount. Files in tact. Time to add another backup cache. Wish I could have disabled the other drive completely through the gui but oh well
  2. forgot to mention, there are a few files on cache that id really like to restore. i just reinstalled win10 on my laptop, so i was using cache as a temp storage for a few things (nothing critical). just would be convenient to be able to get them from the cache
  3. System froze the other night. seems there is an issue with the Sabnzbd docker and when i try to kill the process or anything, everything freezes. Performed a hard reset. Rebooted and got 2 emails saying that my cache had an error - Current pending sector (2) Array started and cache is unmountable. I have 2 disks with my thought being that one would be a copy. So, I tried to unmount both cache and try mounting them using Unassigned Devices. No joy. Tried to mount only cache2 and start the array, thinking that its a copy. No joy. Cache 1 -- see this at the end (mount: wrong fs type, bad option, bad superblock on /dev/sde1) Jul 15 22:34:29 Tower kernel: ata5: SATA max UDMA/133 abar m2048@0xf7e16000 port 0xf7e16100 irq 26 Jul 15 22:34:29 Tower kernel: ata5: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Jul 15 22:34:29 Tower kernel: ata5.00: ATA-8: Hitachi HDP725032GLA360, GEA330RC0RR62G, GM3OA52A, max UDMA/133 Jul 15 22:34:29 Tower kernel: ata5.00: 625142448 sectors, multi 16: LBA48 NCQ (depth 31/32), AA Jul 15 22:34:29 Tower kernel: ata5.00: configured for UDMA/133 Jul 15 22:34:29 Tower kernel: sd 5:0:0:0: [sde] 625142448 512-byte logical blocks: (320 GB/298 GiB) Jul 15 22:34:29 Tower kernel: sd 5:0:0:0: [sde] Write Protect is off Jul 15 22:34:29 Tower kernel: sd 5:0:0:0: [sde] Mode Sense: 00 3a 00 00 Jul 15 22:34:29 Tower kernel: sd 5:0:0:0: [sde] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Jul 15 22:34:29 Tower kernel: sde: sde1 Jul 15 22:34:29 Tower kernel: sd 5:0:0:0: [sde] Attached SCSI disk Jul 15 22:34:35 Tower emhttp: Hitachi_HDP725032GLA360_GEA330RC0RR62G (sde) 312571224 Jul 15 22:34:35 Tower emhttp: import 9 cache device: sde Jul 15 22:34:36 Tower emhttp: shcmd (6): /usr/sbin/hdparm -S0 /dev/sde &> /dev/null Jul 15 22:34:37 Tower emhttp: Hitachi_HDP725032GLA360_GEA330RC0RR62G (sde) 312571224 Jul 15 22:34:37 Tower emhttp: import 9 cache device: sde Jul 15 22:36:23 Tower emhttp: Hitachi_HDP725032GLA360_GEA330RC0RR62G (sde) 312571224 Jul 15 22:36:23 Tower emhttp: import 9 cache device: sde Jul 15 22:36:50 Tower emhttp: Hitachi_HDP725032GLA360_GEA330RC0RR62G (sde) 312571224 Jul 15 22:36:50 Tower emhttp: import 9 cache device: sde Jul 15 22:37:05 Tower emhttp: Hitachi_HDP725032GLA360_GEA330RC0RR62G (sde) 312571224 Jul 15 22:37:05 Tower emhttp: import 9 cache device: sde Jul 15 22:37:06 Tower emhttp: shcmd (30): /usr/sbin/hdparm -S0 /dev/sde &> /dev/null Jul 15 22:37:07 Tower kernel: BTRFS: device fsid 322fda74-7cc7-48c5-bc5b-c1a92e375fa0 devid 1 transid 965735 /dev/sde1 Jul 15 22:37:08 Tower logger: mount: wrong fs type, bad option, bad superblock on /dev/sde1, Cache2 ErrorWarningSystemArray Jul 15 22:34:29 Tower kernel: ata9: SATA max UDMA/133 abar m2048@0xf7e16000 port 0xf7e16300 irq 26 Jul 15 22:34:29 Tower kernel: ata9: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Jul 15 22:34:29 Tower kernel: ata9.00: ATA-7: Hitachi HDT725040VLA360, VFH301R3CWGRXH, V5COA7BA, max UDMA/133 Jul 15 22:34:29 Tower kernel: ata9.00: 781422768 sectors, multi 16: LBA48 NCQ (depth 31/32), AA Jul 15 22:34:29 Tower kernel: ata9.00: configured for UDMA/133 Jul 15 22:34:29 Tower kernel: sd 9:0:0:0: [sdi] 781422768 512-byte logical blocks: (400 GB/373 GiB) Jul 15 22:34:29 Tower kernel: sd 9:0:0:0: [sdi] Write Protect is off Jul 15 22:34:29 Tower kernel: sd 9:0:0:0: [sdi] Mode Sense: 00 3a 00 00 Jul 15 22:34:29 Tower kernel: sd 9:0:0:0: [sdi] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA Jul 15 22:34:29 Tower kernel: sdi: sdi1 Jul 15 22:34:29 Tower kernel: sd 9:0:0:0: [sdi] Attached SCSI disk Jul 15 22:34:35 Tower emhttp: Hitachi_HDT725040VLA360_VFH301R3CWGRXH (sdi) 390711384 Jul 15 22:34:35 Tower emhttp: import 10 cache device: sdi Jul 15 22:34:36 Tower emhttp: shcmd (7): /usr/sbin/hdparm -S0 /dev/sdi &> /dev/null Jul 15 22:34:37 Tower emhttp: Hitachi_HDT725040VLA360_VFH301R3CWGRXH (sdi) 390711384 Jul 15 22:34:37 Tower emhttp: import 10 cache device: sdi Jul 15 22:36:23 Tower emhttp: Hitachi_HDT725040VLA360_VFH301R3CWGRXH (sdi) 390711384 Jul 15 22:36:23 Tower emhttp: import 10 cache device: sdi Jul 15 22:36:50 Tower emhttp: Hitachi_HDT725040VLA360_VFH301R3CWGRXH (sdi) 390711384 Jul 15 22:36:50 Tower emhttp: import 10 cache device: sdi Jul 15 22:37:05 Tower emhttp: Hitachi_HDT725040VLA360_VFH301R3CWGRXH (sdi) 390711384 Jul 15 22:37:05 Tower emhttp: import 10 cache device: sdi Jul 15 22:37:06 Tower emhttp: shcmd (31): /usr/sbin/hdparm -S0 /dev/sdi &> /dev/null Jul 15 22:37:07 Tower kernel: BTRFS: device fsid 322fda74-7cc7-48c5-bc5b-c1a92e375fa0 devid 2 transid 993218 /dev/sdi1 Jul 15 22:37:08 Tower kernel: BTRFS info (device sdi1): disk space caching is enabled Jul 15 22:37:08 Tower kernel: BTRFS: failed to read the system array on sdi1 Syslog.txt
  4. Regarding the initial issue, I had the same problem last week. Ended up being a bad electric connector to the drive. Never had one fail before. I had swapped sata connectors, sata ports etc, and then my backup drive had the same failure. So, if smart is good but you are getting disabled, check your other hardware.
  5. Frys will have a 5TB WD for $98 tomorrow. NEed to sign up for their emails to get your unique code. Free shipping.. Yes.. run a whole lotta preclears.
  6. funny.. SAME thing happened to me tonight. I was only using the EXCLUDE function. Updated to use BOTH Include and Exclude and it seems to be working now (though i still have a bunch of files in queue). I didnt stop/start the array but that was my next step. I updated to the most recent version of unraid about 1 hour earlier, so its possible there is an issue in the OS with this. Edit.. nevermind..seems its all copying to cache..for whatever reason.
  7. I'll run a few more preclears, but will it possible fix the reallocated sectors... Or just confirm its not worse than that? Or I'm against the timeline for returning to Amazon, so should I just return it and call it a day... When I got the drive, the first preclears took almost 3 days. I had limited time so only got one done.
  8. Had an issue with my server and a bad power plug on a hard drive. This caused the drive to go offline and caused me a ton of frustration until I figured out that a 25 cent part was the source of my problems. Shortly thereafter or during the issues, the drive popped up with 240 reallocated sectors. This is a new drive (30 days) and I could possibly still return it for an exchange. Thoughts? It originally passed 1 preclear successfully and then I loaded it with about 1.5TB and had no issues for a week or two. Smart report below. smartctl 6.2 2013-07-26 r3841 [x86_64-linux-4.0.4-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Device Model: TOSHIBA MD04ACA500 Serial Number: 9549K5XIFS9A LU WWN Device Id: 5 000039 68bb00711 Firmware Version: FP2A User Capacity: 5,000,981,078,016 bytes [5.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 7200 rpm Device is: Not in smartctl database [for details use: -P showall] ATA Version is: ATA8-ACS (minor revision not indicated) SATA Version is: SATA 3.0, 6.0 Gb/s (current: 1.5 Gb/s) Local Time is: Wed Dec 2 23:26:27 2015 EST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 120) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 552) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0 2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0 3 Spin_Up_Time 0x0027 100 100 001 Pre-fail Always - 8835 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 54 5 Reallocated_Sector_Ct 0x0033 100 100 050 Pre-fail Always - 240 7 Seek_Error_Rate 0x000b 100 100 050 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 100 100 050 Pre-fail Offline - 0 9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 412 10 Spin_Retry_Count 0x0033 101 100 030 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 23 191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 49 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 17 193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 114 194 Temperature_Celsius 0x0022 100 100 000 Old_age Always - 28 (Min/Max 19/41) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 27 197 Current_Pending_Sector 0x0032 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 220 Disk_Shift 0x0002 100 100 000 Old_age Always - 0 222 Loaded_Hours 0x0032 100 100 000 Old_age Always - 240 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 0 224 Load_Friction 0x0022 100 100 000 Old_age Always - 0 226 Load-in_Time 0x0026 100 100 000 Old_age Always - 206 240 Head_Flying_Hours 0x0001 100 100 001 Pre-fail Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 411 - # 2 Extended offline Aborted by host 70% 409 - # 3 Short offline Completed without error 00% 218 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  9. I had an almost identical error for the past few weeks. Ended up being a bad power connector. I had replaced the disk, sata cable and port. Drive came right back on after I replaced power. Rebuild on 5tb took 24 hours.
  10. Since this seems to be a pain in the butt to fix, I'm going with the following solution unless other advice is offered today. Backing up my 4.5 tb of data to a new 5tb external. Then I'm just going to force the invalid drive back to working. If I lose anything I have it all backed up. I'll then preclear the drives with issues and see how they are working and if the errors clear. Also buying a new 4 port sata card since I'm convinced one port is bad. After, I'll keep the new 5tb as a hot spare since I see the value in that now. I always have drives sitting around but didn't remember you can't replace a problem drive with a smaller one. Suggestions welcomes on sata card. Leaning towards this.SYBA SI-PEX40064 PCI-Express 2.0 Low Profile Ready SATA III (6.0 Gb/s) Controller Card. $22 at newegg. Hopefully I see faster speeds since my current card is sata i.
  11. Best buy has a 5tb external wd for $120. Same price point and better brand imo
  12. Can you send me the link to who had it at that price?... Assuming if they have more. I have a standard asus mobo...p8b75-m/csm ... Will this type card work? Need to replace a broken 4 port sata card. Sorry if a dumb question but never dealt with these before and would much rather have more drive capability if I'm replacing anyways. Thx.
  13. I had this issue last year.. Took the advice of several members and aren't $130 on an asus router. They have cheaper models but i spent more to get other features. The vpn setup took 5 mins and works flawlessly. They have an openvpn option. When I want to connect on my phone, I just open the openvpn app and connect and I'm now on my home network. I can access my server, see my security cameras, and steam through plex, though plex often takes a while to find a new server on a network.
  14. Generally would these be covered under warranty? Since it's a fault but working as expected, will most companies replace these?
  15. new diagnostics here: https://drive.google.com/file/d/0B1uaV-iMhp2lY2xBODgxbDVpVFU/view?usp=sharing the current disk 5 is the brand new 5tb. Passed preclear (1 pass, didnt have time for more). Assuming that it is not the disk (2 identical failures would be very unlikely), Im leaning towards the sata card as it was a cheap card and its 4 years old. And its the only common denominator between the 2. The new disk 5 was working fine (played a dozen movies and other files to test that it was working... all OK) until I moved it and switched it to the sata card (same port) as the old disk 5. The 2 other drives on the sata card are working fine (3rd isnt hooked up).
  16. Not yet. Disk 5 was up less than a day and I was packing the house. I was going to fix disk 6 once I moved. I'm here now with no furniture, but a server and internet lol. I did back up all of disk 6 to an external hard drive, so the data is safe.
  17. Unfortunately, another update So, after being out of town, I had the pleasure of moving and had 8 days to do so.. fun! Before I left, I ran preclear on the new disk (5tb... 72 hours later), took the old disk 5 out of the array and restored the data. Everything worked great. Since my tower was full, I had the new disk sitting on a flat surface with a fan on it. So, after everything was complete, I decided I needed to take the bad drive out of the computer, and put the new drive in its place. That way, I could deal with formatting and testing the 'bad' drive after the move, and the new drive was secure. Powered down, replaced the drive. Within 15 mins of restarting... Bam. New disk is Invalid, contents emulated. Im wondering if my 4 Port Sata card is bad. Thats the only thing that both drives have in common (I replaced the sata cable, and power cable is the same but that seems less likely). Here is the card that Im using and have been for the past 4 years in Unraid (http://www.monoprice.com/product?c_id=104&cp_id=10407&cs_id=1040702&p_id=2667&seq=1&format=2) Thoughts? If I should replace it, whats a good and budget suggestion. 4 port minimum. greater than 4 would be great. $100 max if possible Seeing as i had the exact same issue a few weeks ago, its unlikely that Im having a drive issue. Is there a way for me to get unraid to accept this disk as good again so that I dont have to preclear another drive (72 hours) and rebuild (36 hours). or would this be a bad idea. thanks
  18. not rebuild parity, rebuild disk5. lol thats what i meant... rebuild the 'missing' disk onto the new disk using the parity's data. sorry.. been a crazy week. just got home and needed a hobby for the afternoon!
  19. Thanks for the reply. I can easily get a second drive.. it would offer better fault tolerance anyways. I feel like Im wasting time and energy with smaller (500gb, 1.5tb, 1tb) disks when I can just get another 5tb, have a ton of extra storage and more fault tolerance. Whats the best route to rebuild disk 5? - preclear the new drive, remove disk 5, replace with new drive, rebuild parity? I wasnt aware that a preclear might resolve the pending sectors issue. Thats cool.
  20. UPDATE.. Im back in town new update at bottom OK, re-seated the cables and Disk 5 was visible. Tried to bring it online but got an Offline error. Power is working as the drive is spinning. I replaced the sata cable, restarted and I can access disk 5 again. See smart report (attached). However, there is still a red x next to the drive and it says Disabled, data emulated. When I boot the server, I can select the disk from the drop down list. Once I try to start the array, it goes offline. If i stop the array, it shows No Device and its missing from the pull down list until the next restart. So, i think the drive is working. Smart looks good to me (but I might be wrong). So, does this mean the drive has issues or do i have to do something to force unraid to see the disk again as valid? Keep in mind that Disk 6 is also throwing errors. A new 5tb drive arrived today, but it still needs to be precleared. But, I dont want to risk Disk 6 dying or Im losing some data. DISK 5 smartctl 6.2 2013-07-26 r3841 [x86_64-linux-4.0.4-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Toshiba 3.5" HDD DT01ACA... Device Model: TOSHIBA DT01ACA200 Serial Number: Y4B5UKNTS LU WWN Device Id: 5 000039 ffaded4d9 Firmware Version: MX4OABB0 User Capacity: 2,000,398,934,016 bytes [2.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 7200 rpm Device is: In smartctl database [for details use: -P show] ATA Version is: ATA8-ACS T13/1699-D revision 4 SATA Version is: SATA 3.0, 6.0 Gb/s (current: 1.5 Gb/s) Local Time is: Sat Oct 31 18:08:40 2015 EDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (14535) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 243) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 139 139 054 Pre-fail Offline - 71 3 Spin_Up_Time 0x0007 253 253 024 Pre-fail Always - 96 (Average 131) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 698 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 094 094 067 Pre-fail Always - 6 8 Seek_Time_Performance 0x0005 124 124 020 Pre-fail Offline - 33 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 5266 10 Spin_Retry_Count 0x0013 090 090 060 Pre-fail Always - 131072 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 12 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 699 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 701 194 Temperature_Celsius 0x0002 200 200 000 Old_age Always - 30 (Min/Max 24/35) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. DISK 6 smartctl 6.2 2013-07-26 r3841 [x86_64-linux-4.0.4-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (AF) Device Model: WDC WD20EARS-00MVWB0 Serial Number: WD-WMAZA4709370 LU WWN Device Id: 5 0014ee 002c2a55a Firmware Version: 51.0AB51 User Capacity: 2,000,398,934,016 bytes [2.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: ATA8-ACS (minor revision not indicated) SATA Version is: SATA 2.6, 3.0 Gb/s Local Time is: Sat Oct 31 18:22:15 2015 EDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (40860) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 394) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 199 199 051 Pre-fail Always - 3910 3 Spin_Up_Time 0x0027 203 164 021 Pre-fail Always - 4841 4 Start_Stop_Count 0x0032 097 097 000 Old_age Always - 3430 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 051 051 000 Old_age Always - 35794 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 114 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 57 193 Load_Cycle_Count 0x0032 119 119 000 Old_age Always - 243719 194 Temperature_Celsius 0x0022 115 096 000 Old_age Always - 35 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 7 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 6 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 199 199 000 Old_age Offline - 488 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 35631 346803864 # 2 Extended offline Completed: read failure 90% 35579 346803867 # 3 Extended offline Completed: read failure 90% 35297 346803865 # 4 Extended offline Completed without error 00% 30589 - # 5 Short offline Completed without error 00% 30538 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  21. see below So, I'm going to shut down until I get home Any input on the best route for recovery? Check cables on disk 5, if drive dead, remove from array, rebuild parity with missing drive. Immediately thereafter replace disk 6 (or just migrate data as my replacement will be a 5 tb drive). If disk 5 was just a loose cable, run parity, and then replace disk 6. Or skip parity and remove 6 and rebuild. Thanks.
  22. I replaced the parity and had 3tb extra space and 3tb data, so I thought I was covered. 100% extra. But most of my drives are smaller and older, so I figured 2 2tb drives was good overhead. Never expected both could have issues simultaneously.. Figured a old small one would die first. But live and learn. I see most drives are built to handle 55 degrees c. So, I doubt a new drive would be too negatively effected if wifey put it in the fanless hot swap. But...opinions....I won't have a lot of free time when i get home...moving.
  23. Diagnostics attached here... too large to attach on the forum https://drive.google.com/file/d/0B1uaV-iMhp2lTjdiRHBRMVAxOEE/view?usp=sharing No other backups. Its 95% tv and movies, not essential but a pain in the ass to lose. I have docker running the usual suspects... SAB, CP, Sonarr, not much else really. Sometime in the next year or 2 I plan to build a second server for off-site redundancy, but its not in the budget yet. worst case I can shut down and wife will just buy episodes from amazon prime. Idiot tax for not getting a hot swap bay. ...and... THANKS!
  24. So, ive been away for 4 months for work. Luckily I return in 2 weeks. I tried my best to add redundancy to the server before I left, and it turns out that 1 more backup drive would have been a good option.. or adding a 5 in 3 hot swap cage Here's the rundown: Disk 6: 2 weeks ago I started to get emails about disk 6 having 1 Offline Uncorrectable error. Looked it up and the forum said 1 is ok, more is bad. As of Thursday night, this number is up to 6. Time to move. This is my original parity drive from 6-7 years ago when i started with unraid.. so shes old. Thurs night i took disk 6 out of my shared folders so the mover would stop adding to it and used MC to move the data over to disk 5. Everything was going great at first (moved about 300 GB successfully), but then I saw the errors adding up on Disk 5 (1536 read errors on the unraid dashboard). Then it said Disk offline, data emulated (or something to that effect). So, I took the array offline. Its still powered up but i stopped the array. I tried starting the array but now Disk 5 shows nothing and the hard drive isnt selectable. So... is it possible that a sata cable just happened to come loose after all this time? The drive has been in use for 6 months. Or, is Disk 5 possibly failing as well? I know this is hard to diagnose over the internet. Drive is only 6 months old and has been used successfully since then. 2? preclears before installing. no issues. What would you do in this situation... Options that I can think of: (NOTE: with disk 5 and Disk 6 BOTH out of the array, I only have 3TB between the other drives, which isnt enough to move both of these drives over to the remaining drives) -Shut down the server, wait 2 weeks, check it out then. (problem: wifes at home and she wont have access to her tv shows) -Turn off parity check and and just run with disk 5 offline, move disk 6 data to another disk. (turn off parity check so parity is still valid when i get there so i have more options in case the disk 5 is DOA). hhmmm on second thought this wont work if any changes are made in the meantime -order a new drive and have wifey install a new drive in the one and only hot swap bay that i have... but it has no fan and would run at 50-53 degrees C until i get home... then just rebuild parity with disk 5 missing and copy over disk 6. - Offer a south florida unraid member money/beer/etc to check it out or pay an IT guy to go out. - ? ? ? ? ? ? Disk 6 Smart attached. I cant access disk 5 so i dont have the smart report. Disk_6_Smart_Test_1016.txt syslog.zip