upthetoon

Members
  • Posts

    112
  • Joined

  • Last visited

Everything posted by upthetoon

  1. 1266 pending failed sectors. Based on that, I'd say take it out of the array. You can try running several preclears on it to see if the pending failed sectors are corrected, but I wouldn't trust the drive anymore. Thanks. I'll give the pre-clear a go but will probably just scrap it.
  2. I have a drive that has started generating errors on the UNRAID dashboard (accomapined by a noticable lag if its a video playing and is presumably being rebuilt on the fly). I've ordered a new drive to replace this one, is there any thing I can do to re-use this drive in my array or is it time to scrap it? smartctl 6.2 2013-07-26 r3841 [x86_64-linux-4.1.7-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green (AF) Device Model: WDC WD20EARS-00MVWB0 Serial Number: WD-WMAZA0689789 LU WWN Device Id: 5 0014ee 600663089 Firmware Version: 51.0AB51 User Capacity: 2,000,398,934,016 bytes [2.00 TB] Sector Size: 512 bytes logical/physical Device is: In smartctl database [for details use: -P show] ATA Version is: ATA8-ACS (minor revision not indicated) SATA Version is: SATA 2.6, 3.0 Gb/s Local Time is: Tue Oct 20 17:47:21 2015 BST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x82) Offline data collection activity was completed without error. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (39180) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 378) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x3035) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 171 164 021 Pre-fail Always - 6433 4 Start_Stop_Count 0x0032 097 097 000 Old_age Always - 3905 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 045 045 000 Old_age Always - 40347 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 207 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 84 193 Load_Cycle_Count 0x0032 154 154 000 Old_age Always - 140393 194 Temperature_Celsius 0x0022 124 112 000 Old_age Always - 26 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 197 197 000 Old_age Always - 1266 198 Offline_Uncorrectable 0x0030 198 198 000 Old_age Offline - 805 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 183 183 000 Old_age Offline - 4662 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 40241 1271113988 # 2 Short offline Completed: read failure 10% 39968 1322567359 SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  3. Thanks! It's building the parity again now. Fingers crossed it won't happen again...
  4. Everything seems fine connection wise. No change when I booted back up. I can't actually see any errors or issues in the syslog.
  5. Report of the offending drive smartctl 6.2 2013-07-26 r3841 [x86_64-linux-4.1.7-unRAID] (local build) Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org === START OF INFORMATION SECTION === Device Model: HGST HDN724040ALE640 Serial Number: PK1334PBH0G3LS LU WWN Device Id: 5 000cca 23dce4c7e Firmware Version: MJAOA5E0 User Capacity: 4,000,787,030,016 bytes [4.00 TB] Sector Sizes: 512 bytes logical, 4096 bytes physical Rotation Rate: 7200 rpm Device is: Not in smartctl database [for details use: -P showall] ATA Version is: ATA8-ACS T13/1699-D revision 4 SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s) Local Time is: Fri Oct 2 16:26:17 2015 BST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x85) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: ( 24) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 587) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 137 137 054 Pre-fail Offline - 77 3 Spin_Up_Time 0x0007 131 131 024 Pre-fail Always - 592 (Average 591) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 849 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 121 121 020 Pre-fail Offline - 34 9 Power_On_Hours 0x0012 099 099 000 Old_age Always - 11209 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 34 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 855 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 855 194 Temperature_Celsius 0x0002 162 162 000 Old_age Always - 37 (Min/Max 13/44) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 11209 - # 2 Extended offline Aborted by host 50% 11203 - # 3 Short offline Aborted by host 90% 9964 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
  6. Hi, Aa few apps were wonky this morning so I rebooted by server. It's come back up with the partiy drive marked as Faulty. This is the yongest drive in my array so I hope its not goosed. I'm going to get the server out and re-seat the connections but can anyone see anything telling in the attached diagnostics? Cheers. ridcully-diagnostics-20151002-1626.zip
  7. Fantastic, thanks, I had a feeling it was something very basic about the process that had bypassed me Working great now. Thanks for the script and the help!
  8. This is exactly what I've been looking for too, thanks. I've ran into a little difficulty though... I think I have it all set up properly but it doesn't wake my server. I think it may be to do with the way I've mounted my shares on osx, I have just done it through go > connect to server. Do they have to be mounted so osx 'remembers' them somehow? My end goal is to use the shares in xbmc. I have a mac mini attached to my tv but I am not experienced in osx. Any help appreciated!
  9. That error is down to your usenet provider. The files you are trying to download are not available any more as they are outside the retention range that they have.
  10. Ah... I know what I've done. The first run was on the new version. When I changed the sata port I also snapped my USB stick in two. I then went back to a backup I had but I forgot to upgrade the preclear script again. Many thanks Joe, once I get my key sorted for the replacement USB stick I'll be back up and running!
  11. Hi again Joe, I changed SATA ports on that HDD and it completed this time but with an error... =========================================================================== = unRAID server Pre-Clear disk /dev/sdf = cycle 1 of 1 = Disk Pre-Clear-Read completed DONE = Step 1 of 10 - Copying zeros to first 2048k bytes DONE = Step 2 of 10 - Copying zeros to remainder of disk to clear it DONE = Step 3 of 10 - Disk is now cleared from MBR onward. DONE = Step 4 of 10 - Clearing MBR bytes for partition 2,3 & 4 DONE = Step 5 of 10 - Clearing MBR code area DONE = Step 6 of 10 - Setting MBR signature bytes DONE = Step 7 of 10 - Setting partition 1 to precleared state DONE = Step 8 of 10 - Notifying kernel we changed the partitioning DONE = Step 9 of 10 - Creating the /dev/disk/by* entries DONE = Step 10 of 10 - Testing if the clear has been successful. DONE = Disk Temperature: 32C, Elapsed Time: 12:19:26 ============================================================================ == == SORRY: Disk /dev/sdf MBR could NOT be precleared == == out4= 00092 == out5= 00092 ============================================================================ 1+0 records in 1+0 records out 0000000 0000 0000 0000 0000 0000 0000 0000 0000 512 bytes (512 B) copied* 0000700 0000 0000 0000 003f 0000 8871 e8e0 0000 , 0.0245598 s, 20.8 kB/s 0000720 0000 0000 0000 0000 0000 0000 0000 0000 * 0000760 0000 0000 0000 0000 0000 0000 0000 5c5c 0001000 I did a smart report too but it seems Ok? root@Ridcully:~# smartctl -a /dev/sdf smartctl 5.40 2010-10-16 r3189 [i486-slackware-linux-gnu] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: Hitachi HDS5C3020ALA632 Serial Number: ML0220F30VB0JD Firmware Version: ML6OA580 User Capacity: 2,000,398,934,016 bytes Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 4 Local Time is: Sat Jun 4 09:01:40 2011 BST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x85) Offline data collection activity was aborted by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 249) Self-test routine in progress... 90% of test remaining. Total time to complete Offline data collection: (23815) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 1) minutes. Extended self-test routine recommended polling time: ( 255) minutes. SCT capabilities: (0x003d) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 134 134 054 Pre-fail Offline - 100 3 Spin_Up_Time 0x0007 167 167 024 Pre-fail Always - 359 (Average 298) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 10 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 146 146 020 Pre-fail Offline - 29 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 58 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 10 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 10 194 Temperature_Celsius 0x0002 193 193 000 Old_age Always - 31 (Min/Max 22/34) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 34 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. ...I've also managed to break my USB stick moving my server around but that's seperate issue...
  12. Ok, thanks Joe. I've sent an email to Tomm. Just thinking, the preclear said it had ran for 8 hrs when I got back from work so it lasted long after this crash in the log.
  13. Hi Joe, I restarted the machine but copied the log first. There doesn't seem to be much in there? Booted up and started the preclear went off to work. Got back, logged in again and crash by the looks of it? Jun 2 07:24:43 Ridcully in.telnetd[6260]: connect from 192.168.0.100 (192.168.0.100) Jun 2 07:24:48 Ridcully login[6261]: ROOT LOGIN on '/dev/pts/0' from '192.168.0.100' Jun 2 07:25:52 Ridcully in.telnetd[7029]: connect from 192.168.0.100 (192.168.0.100) Jun 2 07:25:57 Ridcully login[7030]: ROOT LOGIN on '/dev/pts/0' from '192.168.0.100' Jun 2 07:26:22 Ridcully kernel: NTFS driver 2.1.29 [Flags: R/W MODULE]. Jun 2 07:26:34 Ridcully kernel: sdg: unknown partition table Jun 2 07:27:04 Ridcully in.telnetd[7592]: connect from 192.168.0.100 (192.168.0.100) Jun 2 07:27:09 Ridcully login[7593]: ROOT LOGIN on '/dev/pts/0' from '192.168.0.100' Jun 2 13:09:32 Ridcully kernel: mdcmd (35): spindown 1 Jun 2 13:09:33 Ridcully kernel: mdcmd (36): spindown 2 Jun 2 14:21:46 Ridcully kernel: md: sync done. time=25215sec Jun 2 14:21:46 Ridcully kernel: md: recovery thread sync completion status: 0 Jun 2 15:21:53 Ridcully kernel: mdcmd (37): spindown 0 Jun 2 15:21:54 Ridcully kernel: mdcmd (38): spindown 3 Jun 2 16:33:25 Ridcully in.telnetd[24394]: connect from 192.168.0.100 (192.168.0.100) Jun 2 16:33:29 Ridcully login[24395]: ROOT LOGIN on '/dev/pts/0' from '192.168.0.100' syslog.txt
  14. I have tried preclearing the same drive twice now (2TB Hitachi Deskstar, 0F12117), both times it has crashed/became stuck in writing the zeros (second stage). The first time it got to 44%, this time 55%. I used screen to launch the script. If I do a ps I can see the script is still there... root 7161 7012 0 07:26 pts/1 00:00:05 /bin/bash ./preclear_disk.sh /dev/sdg but I don't think it's doing anything because this is what top shows... PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 3398 root 20 0 4112 2856 620 R 5 0.1 12:02.99 dd 3365 root 20 0 0 0 0 S 2 0.0 4:24.72 flush-8:96 437 root 20 0 0 0 0 S 2 0.0 6:38.60 kswapd0 6048 nobody 20 0 288m 85m 3520 S 0 2.2 1:14.43 python 1 root 20 0 828 308 264 S 0 0.0 0:02.56 init 2 root 20 0 0 0 0 S 0 0.0 0:00.00 kthreadd 3 root 20 0 0 0 0 S 0 0.0 0:00.13 ksoftirqd/0 Both time it has also taken down the unraid and unmenu interfaces with it and I'm not sure how to launch them again without a hard reset. My shares are also down but applications such as sabnzbd are still working. Is this a drive issue?
  15. As beta 4 is already on the roadmap as a feature release and will probably have been referred to elsewhere in the forums would it be easier to call it beta 3.1? Looking forward to giving this a go
  16. thanks pro but using /tmp/log doesn't appear to let the drive spindown. I've 777'd my /var/log directory and its working now. Nothing has blown up just yet but I'll keep an eye on it...
  17. I had been running sickbeard with the log location of /var/log/ which meant my cache drive would spin down. When running sickbeard as nobody I had to change the log directory to a folder on my cache drive otherwise it would generate permission errors with /var/log/. After googling the issue it seems only root has access to /var/log/. I have tried changing the permissions of /var/log/ to 775 but it still gives errors. Does anyone know if there is anyway to give nobody access to write to /var/log/ or some other solution to change the logging of sickbeard to allow the cache drive to spin down?
  18. Take a copy / printscreen of your drives page first. I made this change a while back and I think a drive was unassigned as a result. Just make sure you put any drives back in the right slot on the drives page and all should be fine.
  19. Yes, I came across this magic number after researching the error. When you see this do you want to put a note in the opening post Sacretangent?
  20. Wahey. Finally got this working. I was using port 88 for SABnzbd but it didn't like this running under nobody. I gave it a higher port number and now its working fine. Thanks again sacretagent.
  21. From the XBMC forums... "The atv2 is still, regrettably, hardware limited on output. It can decode 1080p, but it'll still output 720p."
  22. Thanks sacretagent, I will be trying this tonight! One related note... if you run couchpotato I don't think whatever drive you are running it from will sleep. The other two will after a few changes to the configs.
  23. a-ha... I wasn't giving nobody shell access. I will test this out tonight. Thanks for working through all this sacretagent. Just seen your new thread documenting it all, nice work!