Jump to content

[SOLVED BY UPGRADING to 5.0b6] Read speed drop every 1 to 2 mins !!!


mwpmo

Recommended Posts

I just found that the read speed of my unRAID dropped to zero for a very short period (less that a second) every 1 - 2 mins.

 

The result was that the playback of video freeze for a very short period every 1 - 2 mins.

 

Any hints for me to debug?

 

----

 

One founding was that even without any read/write access to my unRAID, the disk access light for all drive that spin-up will be on in sequence every 1 - 2 mins.

 

Is there any process /plugin will access the harddisk in sequence every 1 - 2 mins?

 

----

 

In addition, I got 2 GB Ram in unRAID, is it possible to use some memory as buffers, so that even my harddisk was busy every 1-2 mins by other process, I can still got some steady output.

 

----

 

My Setup

 

Motherboard - Gigabyte X48-DQ6

CPU - Intel C2D E8400

RAM - 2GB DDR2-800

 

Parity Disk - Hitachi 7K3000 7200rpm 2TB (on Channel 0 of 1st Sil3132 PCI-e 1x card)

Disk 1 to 8 - Hitachi 5K3000 5900rpm 2TB x 8 (on board ICH9 (AHCI) x 6 + on board JMicorn (AHCI) x 2)

Cache Disk - WD 7200rpm 1TB (on Channel 0 of 2nd Sil3132 PCI-e 1x card)

 

UnRaid 4.7 AIO

 

Link to comment

My Setup

 

Motherboard - Gigabyte X48-DQ6

CPU - Intel C2D E8400

RAM - 2GB DDR2-800

 

Parity Disk - Hitachi 7K3000 7200rpm 2TB (on Channel 0 of 1st Sil3132 PCI-e 1x card)

Disk 1 to 8 - Hitachi 5K3000 5900rpm 2TB x 8 (on board ICH9 (AHCI) x 6 + on board JMicorn (AHCI) x 2)

Cache Disk - WD 7200rpm 1TB (on Channel 0 of 2nd Sil3132 PCI-e 1x card)

 

UnRaid 4.7 AIO

 

-----

 

TOP result (when I copy file from unRAID)

 

top - 09:20:43 up  7:44,  1 user,  load average: 1.73, 1.82, 1.85

Tasks:  98 total,  2 running,  96 sleeping,  0 stopped,  0 zombie

Cpu(s):  0.8%us,  7.0%sy,  0.0%ni, 34.4%id, 50.0%wa,  5.0%hi,  2.8%si,  0.0%st

Mem:  2073808k total,  2025872k used,    47936k free,  346052k buffers

Swap:        0k total,        0k used,        0k free,  1478524k cached

 

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND

4439 root      20  0 15808 5256 3772 R    8  0.3  7:20.97 smbd

2624 root      20  0 81344  11m  708 S    6  0.6  11:54.21 shfs

2134 root      20  0    0    0    0 S    2  0.0  2:34.03 unraidd

  322 root      20  0    0    0    0 S    1  0.0  1:06.00 kswapd0

    1 root      20  0  700  328  288 S    0  0.0  0:01.61 init

    2 root      20  0    0    0    0 S    0  0.0  0:00.00 kthreadd

    3 root      RT  0    0    0    0 S    0  0.0  0:00.07 migration/0

    4 root      20  0    0    0    0 S    0  0.0  0:00.00 ksoftirqd/0

    5 root      RT  0    0    0    0 S    0  0.0  0:00.04 migration/1

    6 root      20  0    0    0    0 S    0  0.0  0:00.00 ksoftirqd/1

    7 root      20  0    0    0    0 S    0  0.0  0:00.00 events/0

    8 root      20  0    0    0    0 S    0  0.0  0:00.00 events/1

    9 root      20  0    0    0    0 S    0  0.0  0:00.00 khelper

  14 root      20  0    0    0    0 S    0  0.0  0:00.00 async/mgr

  116 root      20  0    0    0    0 S    0  0.0  0:10.33 sync_supers

  118 root      20  0    0    0    0 S    0  0.0  0:00.00 bdi-default

  120 root      20  0    0    0    0 S    0  0.0  0:00.00 kblockd/0

 

----

 

TOP result (after the copy was completed)

 

top - 09:25:38 up  7:49,  1 user,  load average: 1.35, 1.52, 1.71

Tasks:  98 total,  1 running,  97 sleeping,  0 stopped,  0 zombie

Cpu(s):  0.0%us,  0.0%sy,  0.0%ni, 50.0%id, 50.0%wa,  0.0%hi,  0.0%si,  0.0%st

Mem:  2073808k total,  1524112k used,  549696k free,  414604k buffers

Swap:        0k total,        0k used,        0k free,  903780k cached

 

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND

    1 root      20  0  700  328  288 S    0  0.0  0:01.61 init

    2 root      20  0    0    0    0 S    0  0.0  0:00.00 kthreadd

    3 root      RT  0    0    0    0 S    0  0.0  0:00.07 migration/0

    4 root      20  0    0    0    0 S    0  0.0  0:00.00 ksoftirqd/0

    5 root      RT  0    0    0    0 S    0  0.0  0:00.04 migration/1

    6 root      20  0    0    0    0 S    0  0.0  0:00.00 ksoftirqd/1

    7 root      20  0    0    0    0 S    0  0.0  0:00.00 events/0

    8 root      20  0    0    0    0 S    0  0.0  0:00.00 events/1

    9 root      20  0    0    0    0 S    0  0.0  0:00.00 khelper

  14 root      20  0    0    0    0 S    0  0.0  0:00.00 async/mgr

  116 root      20  0    0    0    0 S    0  0.0  0:10.33 sync_supers

  118 root      20  0    0    0    0 S    0  0.0  0:00.00 bdi-default

  120 root      20  0    0    0    0 S    0  0.0  0:00.00 kblockd/0

  121 root      20  0    0    0    0 S    0  0.0  0:00.00 kblockd/1

  122 root      20  0    0    0    0 S    0  0.0  0:00.00 kacpid

  123 root      20  0    0    0    0 S    0  0.0  0:00.00 kacpi_notify

  124 root      20  0    0    0    0 S    0  0.0  0:00.00 kacpi_hotplug

 

Link to comment

As I remember, Just a few for management only, no other plugin that related to video-transcode that need a lot of CPU was installed.

 

1) unMenu

2) socat

3) unraid_notify

4) powerdown

5) apcupsd

6) over-temp shutdown

7) cache_dirs

 

 

-----

I attache the GO script for your information

 

#!/bin/bash

# Start the Management Utility

/usr/local/sbin/emhttp &

installpkg /boot/package/socat-1.7.0.0-i486-2bj.tgz

installpkg /boot/package/unraid_notify-2.55-noarch-unRAID.tgz

unraid_notify start

cd /boot/packages && find . -name '*.auto_install' -type f -print | sort | xargs -n1 sh -c

/boot/unmenu/uu

sysctl -w kernel.poweroff_cmd="/sbin/powerdown"

/boot/cache_dirs  -B -w

Link to comment

Thanks a lot for your advice.

 

I tried disable cache_dirs by running "cache_dirs -q" in console to kill the job but it just the same.

OK. It looks like this is a real "who-done-it", and we've just ruled out the handyman.

 

Now, let's try to rule out an "inside job", in general. Disconnect the unRAID server's ethernet connection. Is the culrpit still active?

 

I'm suspecting not (but this is how us "detectives" do our job :)).

 

[re-establish the enet connection]

 

If it is an "outside job", let's see what part of town he's from. Individually (one by one), disconnect each client from your network and test for suspicious activity.

 

Report back, and we'll seek an indictment.

 

If, however, it is an inside job, we can call in CSI.

 

[While doing your sleuthing, try to get an exact timing of this activity--both the interval from one start to the next start, and the duration, in seconds (that "one or two minutes" won't hold up in court).]

 

-- UhClem  "Nick Danger, third eye"

 

Link to comment

Thanks a lot for your advice.

Actually I tried disconnect the network connection and the situation remain unchanged.

 

Without any lines related to this activity in the syslog, it is not easy to count the number of sec between each incident.

 

--------------------

 

Anyway I solved the problem by upgrading the system from 4.7 to 5.0b6.

 

After the upgrade, the problem disappear.

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...