March 5, 20251 yr Hi all, I'm experiencing issues since a few months, where sometimes my unraid os gets inaccessible, either via the webgui or via ssh. But all services continue to work (docker's and vm's) And it seems like when i can ssh (it kinda works sometimes but it takes forever to login) i can see my cpu is under heavy load with php-fpm taking 100% cpu and the docker daemon 400%, and when i try to do docker ps or any docker command it is completely unresponsive. I have a netdata docker installed to track everything, but it doesn't tell me what process is taking all that cpu. And finally, the CPU is "only" at 500% when it happens, it's still a 16 Threads CPU so it must be enough for a few containers that are idling most of the time. I can't nail down to what it is. If some of you have some suggestions. It all started since a upgrade to unraid 7.0 and i switched to zfs pools at the same time. So i can't really downgrade to check if it's the unraid version or zfs pools. I've attached my diagnostics, thank you in advance for your time trying to help me If you need any more info ask me thematrix-diagnostics-20250305-1407.zip
March 5, 20251 yr Community Expert Mar 5 14:06:13 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823009 exited on signal 9 (SIGKILL) after 70.022560 seconds from start Mar 5 14:06:15 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823033 exited on signal 9 (SIGKILL) after 70.005531 seconds from start Mar 5 14:07:03 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2824185 exited on signal 9 (SIGKILL) after 78.037543 seconds from start Mar 5 14:08:13 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823114 exited on signal 9 (SIGKILL) after 183.891123 seconds from start In my experience, these errors can be the result of the server being close to exhausting the memory, GUI can become extremely slow, like 1 minute to open the dashboard, try limiting the memory for VMs/docker services, or adding a little more RAM. Could also be one or more containers hogging the CPU, try pinning only some cores to them, and leave cores 0/1 available for Unraid.
March 5, 20251 yr Author 1 hour ago, JorgeB said: Mar 5 14:06:13 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823009 exited on signal 9 (SIGKILL) after 70.022560 seconds from start Mar 5 14:06:15 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823033 exited on signal 9 (SIGKILL) after 70.005531 seconds from start Mar 5 14:07:03 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2824185 exited on signal 9 (SIGKILL) after 78.037543 seconds from start Mar 5 14:08:13 TheMatrix php-fpm[17548]: [WARNING] [pool www] child 2823114 exited on signal 9 (SIGKILL) after 183.891123 seconds from start In my experience, these errors can be the result of the server being close to exhausting the memory, GUI can become extremely slow, like 1 minute to open the dashboard, try limiting the memory for VMs/docker services, or adding a little more RAM. Could also be one or more containers hogging the CPU, try pinning only some cores to them, and leave cores 0/1 available for Unraid. Thank you for responding, I'm not sure if it's ram, because i always have 10 GB free or more. With 48 GB total and i'm not using anything fancy for zfs lile special pool or metadata pool. And i've already pinned my dockers and vm and left 0/1 free for unraid.
March 10, 20251 yr Author Quick update, i've tried docker system prune. It removed 16 GB out of docker.img (it was almot 50GB). As it's always dockerd using up iowait maybe it will fix it. I'll respond if it made any difference. I have 30 docker containers, i don't know if that's a lot for a i9 9900K and 48G of RAM but i don't think so. Because they're all almost idling all the time anyway.
March 10, 20251 yr Author I have found another potential hint in my research. When i run iostat -xm 1 i've found the flash drive is the only drive in red when the slowings are happening. Maybe i need to change my flash drive. But i thought unraid runs in ram. So why at some random times it blocks all cpu with iowait trying to read from the flash drive ?
March 10, 20251 yr Author Solution I think i've solved it with https://docs.unraid.net/unraid-os/release-notes/7.0.0/#excessive-flash-drive-activity-slows-the-system-down
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.