twg

Members
  • Posts

    126
  • Joined

  • Last visited

Everything posted by twg

  1. ok, the 2nd preclear -V resulted in the following: root@Tower:/tmp# cat postread_errorssdg skip=1000 count=200 bs=8225280 returned 59505 instead of 00000 skip=106800 count=200 bs=8225280 returned 10405 instead of 00000 I'm going to try that command above to see what I get.
  2. ok, thanks. I'm running the post read only right now to see if I can get consistent results. I'll try some things to try to isolate whether it's the drive or something else. It's definitely not memory test, I ran it 5 days straight without any issues.
  3. I have a HD that will not preclear. root@Tower:/tmp# cat postread_errorssdg skip=1000 count=200 bs=8225280 returned 59505 instead of 00000 root@Tower:/boot/preclear_reports# cat preclear_start_sdg_2011-04-05 Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright © 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green family Device Model: WDC WD15EADS-00S2B0 Serial Number: WD-WCAVY3345713 Firmware Version: 01.00A01 User Capacity: 1,500,301,910,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Mon Apr 4 17:14:01 2011 EDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (32100) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 171 152 021 Pre-fail Always - 8416 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 200 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0 9 Power_On_Hours 0x0032 094 094 000 Old_age Always - 4628 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 8 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 1052 194 Temperature_Celsius 0x0022 122 098 000 Old_age Always - 30 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 4411 - # 2 Extended offline Completed without error 00% 4244 - # 3 Extended offline Completed without error 00% 4076 - # 4 Extended offline Completed without error 00% 3909 - # 5 Extended offline Completed without error 00% 3741 - # 6 Extended offline Completed without error 00% 3573 - # 7 Extended offline Completed without error 00% 3406 - # 8 Extended offline Completed without error 00% 3238 - # 9 Extended offline Completed without error 00% 3071 - #10 Extended offline Completed without error 00% 2904 - #11 Extended offline Completed without error 00% 2736 - #12 Extended offline Completed without error 00% 2568 - #13 Extended offline Completed without error 00% 2401 - #14 Extended offline Completed without error 00% 2233 - #15 Extended offline Completed without error 00% 2065 - #16 Extended offline Completed without error 00% 1897 - #17 Extended offline Completed without error 00% 1730 - #18 Extended offline Completed without error 00% 1562 - #19 Extended offline Completed without error 00% 1394 - #20 Extended offline Completed without error 00% 1226 - #21 Extended offline Completed without error 00% 1058 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. root@Tower:/boot/preclear_reports# cat preclear_finish_sdg_2011-04-05 Disk: /dev/sdg smartctl 5.39.1 2010-01-28 r3054 [i486-slackware-linux-gnu] (local build) Copyright © 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Model Family: Western Digital Caviar Green family Device Model: WDC WD15EADS-00S2B0 Serial Number: WD-WCAVY3345713 Firmware Version: 01.00A01 User Capacity: 1,500,301,910,016 bytes Device is: In smartctl database [for details use: -P show] ATA Version is: 8 ATA Standard is: Exact ATA specification draft version not indicated Local Time is: Tue Apr 5 09:20:40 2011 EDT SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x84) Offline data collection activity was suspended by an interrupting command from host. Auto Offline Data Collection: Enabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (32100) seconds. Offline data collection capabilities: (0x7b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 255) minutes. Conveyance self-test routine recommended polling time: ( 5) minutes. SCT capabilities: (0x303f) SCT Status supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0 3 Spin_Up_Time 0x0027 171 152 021 Pre-fail Always - 8416 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 200 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0 9 Power_On_Hours 0x0032 094 094 000 Old_age Always - 4644 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 10 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 8 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 1052 194 Temperature_Celsius 0x0022 121 098 000 Old_age Always - 31 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 4411 - # 2 Extended offline Completed without error 00% 4244 - # 3 Extended offline Completed without error 00% 4076 - # 4 Extended offline Completed without error 00% 3909 - # 5 Extended offline Completed without error 00% 3741 - # 6 Extended offline Completed without error 00% 3573 - # 7 Extended offline Completed without error 00% 3406 - # 8 Extended offline Completed without error 00% 3238 - # 9 Extended offline Completed without error 00% 3071 - #10 Extended offline Completed without error 00% 2904 - #11 Extended offline Completed without error 00% 2736 - #12 Extended offline Completed without error 00% 2568 - #13 Extended offline Completed without error 00% 2401 - #14 Extended offline Completed without error 00% 2233 - #15 Extended offline Completed without error 00% 2065 - #16 Extended offline Completed without error 00% 1897 - #17 Extended offline Completed without error 00% 1730 - #18 Extended offline Completed without error 00% 1562 - #19 Extended offline Completed without error 00% 1394 - #20 Extended offline Completed without error 00% 1226 - #21 Extended offline Completed without error 00% 1058 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. root@Tower:/boot/preclear_reports# cat preclear_rpt_sdg_2011-04-05 ========================================================================1.9 == invoked as: ./preclear_disk.sh -W /dev/sdg == == Disk /dev/sdg has NOT been successfully precleared == Postread detected un-expected non-zero bytes on disk== == Ran 1 cycle == == Last Cycle's Zeroing time : 5:10:04 (80 MB/s) == Last Cycle's Total Time : 16:06:39 == == Total Elapsed Time 16:06:39 == == Disk Start Temperature: 30C == == Current Disk Temperature: 31C, == ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 121 122 0 ok 31 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. ============================================================================ Also noticed this in file /tmp/zerosdg, this is at the end: 1500222267392 bytes (1.5 TB) copied, 18577.8 s, 80.8 MB/s dd: writing `/dev/sdg': No space left on device 715393+10 records in 715392+10 records out 1500299812864 bytes (1.5 TB) copied, 18580.5 s, 80.7 MB/s Basically it fails in the post read, it says parts of the disk should be zero but its not. I've tried preclearing twice now. It's connected to the motherboard SATA ports too... nothing at all in the syslog... no drive errors... anyone see this before ?
  4. I have the iStars... I choose them because they had the sweet spot in terms of price, size (depth) and also reliability/ease/features of use. I have no complaints with them. If I had to do it all over again, I'd seriously consider their trayless 5 in 3 cages though... just cuz i'm lazy. Oooo, yes please - I would love to hear opinions on the iStar BPN-350 before I go to the trouble of getting three of them shipped to Philippines!
  5. just food for thought... when i had my new drive fail, it caused the syslog to grow to beyond 2gigs... which ultimately crashed my system.. that is when I moved the syslog to a tmpfs instead to limit its impact on the server.
  6. i had a weird problem using 4.7 actually... I copied files to the server as a test, then tried to delete them several hours later... I couldn't... had to telnet in and "rm" them from the command line. weird.
  7. thanks for the laugh. Who said? They are lying. Don't believe them.
  8. hmm... sounds like the problem I had. out of 6 new drives, 1 was bad and would cause log explosion which ultimately crashed the system.
  9. well... the network speed is consistently high now. Not sure what originally caused the crash in the first place... will keep an eye out on it I guess. I also notiecd that my torrents are disconnected because the unRaid server is losing the link but re-establishing it very quickly... I know I read about this elsewhere so will have to go for a dig. Man, it's kinda frustrating getting this thing to work reliably...
  10. So when I first setup the unRaid server, I didn't think much about setting up multiple user shares. I wanted to keep it easy and have 1 large share. However, now I've decided that I want seperate user shares (largely because I want the cache drive only for some user shares and not others). I have about 4gigsTB worth of data under my main user share called "UR". Can I create another user share, then telnet into the server and use "mv" to move between user shares ? The user share "UR" spans 4 disks and I don't really care where the files ultimately end up. I also want to rename the user share "UR" to something else, like "Videos" or something, can I just change the name in the user share menu ? Will this affect my data ? thanks
  11. how about running top in another window to keep track of memory usage and processes when it crashes ? I assume you've tried to reseat all cables/interface cards ? Have you checked your network connections ethtool eth0, and checked the SMART values ? smartctl -a /dev/sdX You could also try testing the HD: hdparm -tT /dev/[hs]d? I noticed that I had a brand new drive that was flaky which caused weird system crashes that wasn't obvious thru the logs... Nothing at all was output to the syslog when the crashes occur. I'm stumped.
  12. Try reading the SMART values: Smartctl -a /dev/sdX, where X is your drive. It could be a failing disk where reads are ok but it is having trouble writing. Have you tried using hdparm -tT /dev/sdX as well to test the disk throughput? Do you have any other disk in your array that you can compare with? And I assumed you've rebooted the array with same results? Any errors in your syslog?
  13. ya, if the wires were reverse that means the polarity to the electronics were reversed. so instead of applying +12V and gnd, you applied gnd and +12V... this would have blown out the electronics on the board. There's a good chance that swapping out electronic boards could save the drive, unless if the motor is blown too but I doubt that. There's a good chance sending it to a data recovery service will work, but it will be $$$. I would suggest above, someone recommended trying to swap out electronics on a drive that the data isn't as critical to see how that fares first.
  14. I rebooted and I still got very low speeds... 170kb/s. As a diagnostic I checked the network link speed and it was at 100Mb... so I unplugged and replugged the cable into the server and the wall... bingo! back up to 1000Mb link. My reads from the server is now around 55-60Mb/s. I've restarted my torrents and will proceed to preclear my last drive as well as initiate a post-read of the precleared drive above that failed: preclear_disk.sh -V /dev/sdg. Will report tomorrow. Can't believe it could have been the result of a bad network connection!
  15. I'm using onboard SATA plus AOC-SASLP-MV8 plus the JMicron eBay special. The HDs that are being cleared are on the SAS and the MB. Oddly enough, the preclear that was on the MB finished the fastest but had errors /dev/sdg (above). The preclear using the SAS finished 2nd but was fine. The 3rd and last preclear is off the MB and finished fine too. no reallocated sectors no errors. All were WD15EADS drives (1.5TB WD Green non AF). I ran memtest for 7 days solid with no memory errors. I'm running a preclear_disk -V on /dev/sdg to perform the post read to see if the errors were a read error or the drive actually didn't get cleared properly. Now that the preclears are finished, the free memory is back up: root@Tower:/boot# free total used free shared buffers cached Mem: 4115664 834112 3281552 0 505024 219536 -/+ buffers/cache: 109552 4006112 Swap: 0 0 0 but copying from unRaid server is still slow... 170kb/s Latest syslog is attached. I'm going to reboot and see what happens. syslog-2011-04-04.zip
  16. sorry, what are HBAs ? I'm new around here... My 2nd preclear finished fine, added that to the array and it formatted and added fine. So now I have 1 preclear left that's running. Copying from the unRaid server is still really slow at 171kb/s... Trying to access unMenu I show this in the syslog: Apr 3 23:11:40 Tower unmenu-status: Exiting unmenu web-server, exit status code = 141 Apr 3 23:11:40 Tower unmenu-status: Starting unmenu web-server
  17. Weird... one of my preclears just finished and it failed, gave me the following: ================================================================== 1.9 = unRAID server Pre-Clear disk /dev/sdg = cycle 1 of 1, partition start on sector 64 = Disk Pre-Clear-Read completed DONE = Step 1 of 10 - Copying zeros to first 2048k bytes DONE = Step 2 of 10 - Copying zeros to remainder of disk to clear it DONE = Step 3 of 10 - Disk is now cleared from MBR onward. DONE = Step 4 of 10 - Clearing MBR bytes for partition 2,3 & 4 DONE = Step 5 of 10 - Clearing MBR code area DONE = Step 6 of 10 - Setting MBR signature bytes DONE = Step 7 of 10 - Setting partition 1 to precleared state DONE = Step 8 of 10 - Notifying kernel we changed the partitioning DONE = Step 9 of 10 - Creating the /dev/disk/by* entries DONE = Step 10 of 10 - Verifying if the MBR is cleared. DONE = Disk Post-Clear-Read completed DONE Disk Temperature: 33C, Elapsed Time: 22:22:06 ========================================================================1.9 == WDC WD15EADS-00S2B0 WD-WCAVY3345713 == Disk /dev/sdg has NOT been precleared successfully == skip=1000 count=200 bs=8225280 returned 04964 instead of 00000 skip=106800 count=200 bs=8225280 returned 06360 instead of 00000 ============================================================================ ** Changed attributes in files: /tmp/smart_start_sdg /tmp/smart_finish_sdg ATTRIBUTE NEW_VAL OLD_VAL FAILURE_THRESHOLD STATUS RAW_VALUE Temperature_Celsius = 119 117 0 ok 33 No SMART attributes are FAILING_NOW 0 sectors were pending re-allocation before the start of the preclear. 0 sectors were pending re-allocation after pre-read in cycle 1 of 1. 0 sectors were pending re-allocation after zero of disk in cycle 1 of 1. 0 sectors are pending re-allocation at the end of the preclear, the number of sectors pending re-allocation did not change. 0 sectors had been re-allocated before the start of the preclear. 0 sectors are re-allocated at the end of the preclear, the number of sectors re-allocated did not change. the full syslog zip file is in my post above. 4gig memory, amd athlon II x3 processor, biostar 785 board...
  18. in my limited experience setting up my server, hard server crashes are a result of running out of memory... this could be due to syslog explosion (syslog grows too large and hogs all the ramdisk) or something else that's chewing up the memory. Preclearing a drive shouldn't req. too much memory, do you have 4gig ? that's what I thought I read. I have 4gig and was able to preclear 5 drives at a time. To check available memory, you use the "free" command. Just type in in a telnet window. Other option is to run "top", it lists all the processes that's running and it updates itself. Keep this running in a telnet window until it crashes and see what it says. You can also have in another telnet window the following command: "tail -f -n 20 /var/log/syslog" it will show the contents of syslog and update it when new entries are added. When the unRaid server crashes, you can see what the last few entries are.
  19. if you don't have any parity errors anymore, they miht have been corrected in the first parity check. Were all of your HDs spinning or just the one that you were streaming content to ? HDs can survive large G forces when they're off... and fairly large when they're on too... a drop while writing is bad, drop while reading isn't as bad... i'd run a SMART test on the drive that had your music on it just to be sure.
  20. i don't see anything obvious... Can you try telneting into your unRaid server, then typing: tail -f -n 20 /var/log/syslog this way, whenever syslog is updated, it outputs it to your telnet screen... leave it running and when your server crashes, you will be able to see the last 20-30 lines in your syslog to see what happened.
  21. i'm not sure since i couldn't recover the syslog due to the crash... but right now my log file is tiny, around 75k... I've got 3 preclears going but the fastest I can read from the unRaid server is about 200kb/s... which is far shy of the 55Mb/s I was getting the other day...
  22. So I finally got the unRaid server setup a couple of days ago. Copied all my data over. Now I'm starting to use it as I did with my old DLINK 343 NAS which has served me pretty well, albeit a bit slow. Here's the problem: Last night, I'm streaming a TV show over network to my WDTV player. The show stops all of a sudden and I find out that the unRaid server crashed... hard. No console video output, would not even respond to pings... so I had to hard reboot. After rebooting, checked syslog, they looked fine. I do notice that free memory was slowly being used... mostly to the cache, which is not uncommon with linux. I finish watching rest of my TV show while monitoring the free memory and while it was slowly decreasing it hit 300Mb when I finished the show. This morning I check on the unRaid server and here is what I find: root@Tower:/boot/scripts# free total used free shared buffers cached Mem: 4115664 4097292 18372 0 731088 3221064 -/+ buffers/cache: 145140 3970524 Swap: 0 0 0 top - 06:55:46 up 8:10, 5 users, load average: 0.52, 0.43, 0.33 Tasks: 116 total, 1 running, 115 sleeping, 0 stopped, 0 zombie Cpu(s): 0.4%us, 10.2%sy, 0.0%ni, 42.3%id, 45.5%wa, 0.2%hi, 1.5%si, 0.0%st Mem: 4115664k total, 4097820k used, 17844k free, 730824k buffers Swap: 0k total, 0k used, 0k free, 3221116k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 1030 root 20 0 0 0 0 S 9 0.0 7:42.63 flush-8:48 15617 root 20 0 0 0 0 S 8 0.0 5:24.99 flush-8:112 21082 root 20 0 0 0 0 S 7 0.0 8:34.90 flush-8:96 332 root 20 0 0 0 0 S 3 0.0 11:58.92 kswapd0 15785 root 20 0 3844 2708 568 S 2 0.1 1:03.34 dd 1191 root 20 0 3840 2704 568 S 1 0.1 1:12.32 dd 1623 root 20 0 4944 3900 892 S 1 0.1 0:19.40 awk 21215 root 20 0 3840 2704 568 S 1 0.1 1:30.20 dd 30241 root 20 0 2116 1032 788 R 1 0.0 0:00.02 top 1 root 20 0 700 304 264 S 0 0.0 0:01.85 init 2 root 20 0 0 0 0 S 0 0.0 0:00.00 kthreadd 3 root RT 0 0 0 0 S 0 0.0 0:00.29 migration/0 4 root 20 0 0 0 0 S 0 0.0 0:00.02 ksoftirqd/0 5 root RT 0 0 0 0 S 0 0.0 0:00.44 migration/1 6 root 20 0 0 0 0 S 0 0.0 0:00.00 ksoftirqd/1 7 root RT 0 0 0 0 S 0 0.0 0:03.58 migration/2 8 root 20 0 0 0 0 S 0 0.0 0:00.01 ksoftirqd/2 9 root 20 0 0 0 0 S 0 0.0 0:00.00 events/0 10 root 20 0 0 0 0 S 0 0.0 0:00.00 events/1 11 root 20 0 0 0 0 S 0 0.0 0:00.00 events/2 12 root 20 0 0 0 0 S 0 0.0 0:00.01 khelper 17 root 20 0 0 0 0 S 0 0.0 0:00.00 async/mgr 120 root 20 0 0 0 0 S 0 0.0 0:05.30 sync_supers 122 root 20 0 0 0 0 S 0 0.0 0:00.56 bdi-default 124 root 20 0 0 0 0 S 0 0.0 0:00.00 kblockd/0 125 root 20 0 0 0 0 S 0 0.0 0:00.03 kblockd/1 126 root 20 0 0 0 0 S 0 0.0 0:00.24 kblockd/2 127 root 20 0 0 0 0 S 0 0.0 0:00.00 kacpid 128 root 20 0 0 0 0 S 0 0.0 0:00.00 kacpi_notify 129 root 20 0 0 0 0 S 0 0.0 0:00.00 kacpi_hotplug 241 root 20 0 0 0 0 S 0 0.0 0:00.00 ata/0 242 root 20 0 0 0 0 S 0 0.0 0:00.00 ata/1 243 root 20 0 0 0 0 S 0 0.0 0:00.00 ata/2 244 root 20 0 0 0 0 S 0 0.0 0:00.00 ata_aux 248 root 20 0 0 0 0 S 0 0.0 0:00.00 ksuspend_usbd 253 root 20 0 0 0 0 S 0 0.0 0:00.00 khubd 256 root 20 0 0 0 0 S 0 0.0 0:00.00 kseriod 291 root 20 0 0 0 0 S 0 0.0 0:00.00 rpciod/0 292 root 20 0 0 0 0 S 0 0.0 0:00.00 rpciod/1 293 root 20 0 0 0 0 S 0 0.0 0:00.00 rpciod/2 My uTorrent client on my desktop PC which directly saves to unRaid is running but it claims the disk is overloaded and half of the torrents show big red X and that the directory is unavailable, seems like unRaid timed out on a few torrents. When I go to restart all of the torrents (10) it works fine for a bit but then speed drops to <30kb/s down from 600kb/s and again, the disk overloaded message appears on the bottom of uTorrent. On unRaid I killed cache_dirs wondering if that was causing issue, but no effect. I do have 3 preclears running on unRaid at the moment. (it was preclearing when I was streaming the TV show, had it going with torrents too) Could this be causing some of this ? Is it the cache that is using up all of the memory ? showing 18M free memory seems really really low. The syslog is attached and is fine, not big at all. I even have my log file on a tmp ram drive. root@Tower:/boot/scripts# df Filesystem 1K-blocks Used Available Use% Mounted on /dev/sda1 3907568 131212 3776356 4% /boot tmpfs 2057832 84 2057748 1% /tmp/log /dev/md4 488371640 246929796 241441844 51% /mnt/disk4 /dev/md2 1953454928 1831607096 121847832 94% /mnt/disk2 /dev/md3 488371640 341668268 146703372 70% /mnt/disk3 /dev/md1 1953454928 1831331836 122123092 94% /mnt/disk1 shfs 4883653136 4251536996 632116140 88% /mnt/user as you can see only very small part of memory is used to hold syslog. My problem right now is why uTorrent is showing disk overloaded, seems like the IO is very slow from unRaid so it has to throttle my torrents back to accomodate. I'm stopped uTorrent and even accessing files on the server is slow... I'm trying to copy a directory to my desktop PC and it's transferring at 173kb/s!!! syslog-2011-04-03.zip
  23. you might have hit the nail on the head... Although I didn't check explicitly, I did try copying a large 13gig file from my desktop PC to unRaid and got around 75Mb/s transfer rate... which is a lot faster than I expected. I tried this several times, sometimes it was a bit slower down at 60Mb/s but 75Mb/s seemed the norm. (Caveat: This is without parity on writing to a Hitachi 7k2000 drive). I'm pretty happy with the results. It must be a speed issue between the Dlink 343 and unRaid, which I don't care about since I'm retiring the Dlink once the data is copied over. Solved!
  24. added results of hdparm -tT command: root@Tower:/# hdparm -tT /dev/sdb /dev/sdb: Timing cached reads: 3872 MB in 2.00 seconds = 1936.67 MB/sec Timing buffered disk reads: 372 MB in 3.01 seconds = 123.73 MB/sec root@Tower:/# hdparm -tT /dev/sdc /dev/sdc: Timing cached reads: 3906 MB in 2.00 seconds = 1952.97 MB/sec Timing buffered disk reads: 372 MB in 3.01 seconds = 123.38 MB/sec root@Tower:/# hdparm -tT /dev/sdd /dev/sdd: Timing cached reads: 3760 MB in 2.00 seconds = 1880.64 MB/sec Timing buffered disk reads: 384 MB in 3.01 seconds = 127.50 MB/sec root@Tower:/# hdparm -tT /dev/sde /dev/sde: Timing cached reads: 3726 MB in 2.00 seconds = 1863.42 MB/sec Timing buffered disk reads: 326 MB in 3.02 seconds = 108.03 MB/sec root@Tower:/# hdparm -tT /dev/sdf /dev/sdf: Timing cached reads: 3726 MB in 2.00 seconds = 1863.00 MB/sec Timing buffered disk reads: 324 MB in 3.02 seconds = 107.35 MB/sec sdb = hitachi 7k2000 sdc = hitachi 5k3000 sdd = hitachi 7k2000 sde = seagate st3500320as sdf = seagate st3500320as Reads seem decent ?