Web UI unresponsive

January 19, 20251 yr

Hi all,

I recently updated to 7.0. Sometimes the web UI works fine, but then after I come back in about half a day, it's unresponsive. I can load the login screen and log in okay, but then the main page doesn't load. It just spins and nothing shows up, and then it times out.

I can ssh in, and my VMs and Docker seem to be running fine. If I run "shutdown -r now" it never completes.

Syslog attached. I'm not sure where to start debugging this. (I see an XFS metadata error, which I'll try to fix once I can access the UI.)s

Thanks!

syslog.txt

Quote

January 19, 20251 yr

Community Expert

Check filesystem for disk2, but that should be unrelated, enable the syslog server and post that after a crash.

Quote

January 20, 20251 yr

Author

Thanks, but it doesn't crash. Like I said, I can log in and get the syslog (which I posted). The web UI simply stops responding.

Quote

January 28, 20251 yr

Author

I repaired the file system on sdb1. Unfortunately, I couldn't mount the disk and there was a transaction log. I had to delete that before the repair could complete.

Sadly, the server was wedged again when I woke up this morning. I guess I'll uninstall plugins next in case one of them is interacting poorly with the new version.

I tried to run "ps" to see if mover was running, and it wedged. "top" run fine, until I pressed "c" to show full command lines, at which point it wedged too. Weird. Here's the output:

top - 08:47:30 up 12:45,  1 user,  load average: 32.18, 32.16, 32.13
Tasks: 1106 total,   3 running, 1103 sleeping,   0 stopped,   0 zombie
%Cpu(s):  3.2 us,  6.4 sy,  0.0 ni, 90.1 id,  0.0 wa,  0.0 hi,  0.2 si,  0.0 st
MiB Mem : 128585.4 total,  12450.2 free,  73395.6 used,  44329.3 buff/cache
MiB Swap:      0.0 total,      0.0 free,      0.0 used.  55189.8 avail Mem

    PID USER      PR  NI    VIRT    RES    SHR S  %CPU  %MEM     TIME+ COMMAND
  15201 root      20   0   26.9g  24.2g  62720 R  85.1  19.3      9,41 qemu-system-x86
  12396 root      20   0  718856  88532   1024 S  82.8   0.1      7,21 shfs
  15936 root      20   0   26.4g  24.1g  62976 S  45.0  19.2 251:33.63 qemu-system-x86
  10318 root      20   0       0      0      0 S  24.5   0.0 152:57.24 unraidd0
1900849 root      20   0    5012   3804   1792 R  20.9   0.0   0:02.14 find
   8409 root      20   0       0      0      0 D  10.3   0.0  62:38.12 mdrecoveryd
  17930 root      20   0   22968  11356   9564 S   2.6   0.0  16:56.66 openvpn
    651 root       0 -20       0      0      0 I   1.7   0.0  10:49.35 kworker/7:1H-kblockd
    180 root      20   0       0      0      0 S   0.7   0.0   2:24.67 ksoftirqd/7
   8395 root      20   0  279972   5120   4352 S   0.7   0.0   3:00.61 emhttpd
  26415 nobody    20   0 4405024   2.8g   2.8g S   0.7   2.2  12:58.13 qbittorrent-nox
1900251 root      20   0    9944   4352   2304 R   0.7   0.0   0:00.19 top
     16 root      20   0       0      0      0 S   0.3   0.0   0:04.73 ksoftirqd/0
     17 root      20   0       0      0      0 I   0.3   0.0   1:16.21 rcu_preempt
  24198 root      20   0 1239296  17544   9728 S   0.3   0.0   0:04.16 containerd-shim
  29782 root      20   0   38324  27876   9956 S   0.3   0.0   0:05.05 supervisord
 429290 root      20   0  103728  32912  25856 S   0.3   0.0   1:11.43 device_list

The two qemu processes are my VMs. "shfs", "find" and "unraidd0" are using a lot of CPU. Maybe the parity check is still running in the background...

Quote

January 28, 20251 yr

You should post the full diagnostics. And also update all the plugins

Quote

January 29, 20251 yr

Author

Here are the diagnostics. All plugins and docker containers are updated. Thanks Squid!

I wonder if despite repairing the disk, when the parity check hits a "bad part" of the disk it locks everything up.

storage-diagnostics-20250128-2300.zip

Quote

January 29, 20251 yr

Community Expert

There appears to be a container constantly restarting, see if you can find which by looking at their uptimes.

Quote

January 31, 20251 yr

Author

I stopped that container. It's weird, because it was working before but now its appdata subdirectories are empty.

It locked up during the night last night. Here's the syslog from while I was asleep:

Jan 30 22:36:49 storage emhttpd: read SMART /dev/sdg
Jan 30 23:14:21 storage kernel: python3[24845]: segfault at 0 ip 0000000000000000 sp 000014f23108cd98 error 14 likely on CPU 4 (core 8, socket 0)
Jan 30 23:14:21 storage kernel: Code: Unable to access opcode bytes at 0xffffffffffffffd6.
Jan 30 23:38:29 storage emhttpd: spinning down /dev/sdd
Jan 30 23:38:33 storage emhttpd: spinning down /dev/sdg
Jan 30 23:38:34 storage emhttpd: spinning down /dev/sdb
Jan 31 00:19:33 storage kernel: traps: sed[3214755] general protection fault ip:146dc212a427 sp:7ffcf5ce6270 error:0 in libc-2.40.so[146dc20ab000+17c000]
Jan 31 00:47:55 storage kernel: tdarr-ffmpeg[3298066]: segfault at 146e569f5000 ip 0000146e66c699c0 sp 0000146e569f2fc0 error 6 in libx265.so.199[146e66c2a000+e98000] likely on CPU 5 (core 8, socket 0)
Jan 31 00:47:55 storage kernel: Code: 83 c5 04 49 83 c4 08 53 48 8b bd 10 fe ff ff 48 83 c3 04 4c 89 95 18 fe ff ff e8 7b c6 ff ff 4c 8b 95 18 fe ff ff 48 63 53 fc <49> 01 12 59 5e 48 3b 9d 00 fe ff ff 74 12 48 8b 85 08 fe ff ff 48
Jan 31 02:04:32 storage emhttpd: read SMART /dev/sdd
Jan 31 02:04:42 storage emhttpd: read SMART /dev/sdg
Jan 31 03:12:10 storage emhttpd: spinning down /dev/sdg
Jan 31 03:12:28 storage emhttpd: spinning down /dev/sdd
Jan 31 04:43:30 storage rc.nginx: Reloading Nginx server daemon...
Jan 31 04:43:31 storage rc.nginx: Checking configuration for correct syntax and then trying to open files referenced in configuration...
Jan 31 04:43:31 storage rc.nginx: /usr/sbin/nginx -t -c /etc/nginx/nginx.conf
Jan 31 04:43:31 storage rc.nginx: Reloading Nginx server daemon configuration...
Jan 31 04:43:31 storage nginx: 2025/01/31 04:43:31 [alert] 14705#14705: *141820 open socket #4 left in connection 12
Jan 31 04:43:31 storage nginx: 2025/01/31 04:43:31 [alert] 14705#14705: aborting
Jan 31 05:00:01 storage Plugin Auto Update: Checking for available plugin updates
Jan 31 05:00:05 storage Plugin Auto Update: Community Applications Plugin Auto Update finished
Jan 31 07:19:55 storage emhttpd: spinning down /dev/sdc
Jan 31 08:30:13 storage emhttpd: read SMART /dev/sdc

I see several segfaults on core 8, socket 0. I've had stability issue before but they seemed to go away. It's an Intel i9-13900K. Looks like there's a new bios. I'll try that. I'll also try disabling that core in the bios if I can.

Quote

February 1, 20251 yr

Author

No luck with setting the CPU voltages as recommended by Intel. Or I should say I'm not seeing segfaults so far, but my web UI is still hanging.

There's this in the syslog:

Feb  1 00:37:53 storage nginx: 2025/02/01 00:37:53 [error] 15024#15024: *958 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.22.33.101, server: , request: "GET /Main HTTP/1.1", subrequest: "/auth-request.php", upstream: "fastcgi://unix:/var/run/php-fpm.sock", host: "storage.coppit.org", referrer: "http://storage.coppit.org/login"
Feb  1 00:37:53 storage nginx: 2025/02/01 00:37:53 [error] 15024#15024: *958 auth request unexpected status: 504 while sending to client, client: 10.22.33.101, server: , request: "GET /Main HTTP/1.1", host: "storage.coppit.org", referrer: "http://storage.coppit.org/login"

Who is the upstream in this context? emhttp? Syslog attached.

syslog

Quote

February 1, 20251 yr

Community Expert

Those nginx errors should not be a big deal, but if you keep seeing segfaults those are usually RAM or CPU related.

Quote

February 1, 20251 yr

Author

Aren’t the NGINX errors exactly the problem? I log in, the main page tries to load for several minutes, and after a timeout I get an error in my browser and NGINX errors in the log.

I’m not sure I mentioned it, but earlier I had tried to “pskill nginx” and the command hanged.

Quote

February 1, 20251 yr

Community Expert

Could be, I was referring to these:

10 hours ago, coppit said:

Jan 31 04:43:31 storage nginx: 2025/01/31 04:43:31 [alert] 14705#14705: *141820 open socket #4 left in connection 12
Jan 31 04:43:31 storage nginx: 2025/01/31 04:43:31 [alert] 14705#14705: aborting

Does the issue also happen if you boot in safe mode?

Quote

February 7, 20251 yr

Author
Solution

On 2/1/2025 at 4:05 AM, JorgeB said:

Does the issue also happen if you boot in safe mode?

I'm not sure. Sometimes it takes nearly a day to show itself, and I didn't run it long enough in safe mode to know.

Here's what I think is happening... Hardware issues are causing segfaults, which cause some component of the web UI to fail, which then causes the browser timeouts. So basically not an UNRAID problem.

So given the instabilities of the recent Intel CPUs, yesterday I replaced my motherboard and CPU with an AMD setup. So far no GUI hangs, but I see these in the syslog:

Feb  7 00:19:32 storage kernel: ccextractor[528241]: segfault at 0 ip 000014d9771f2384 sp 00007fff1fd35308 error 4 in libc.so.6[14d977193000+195000] likely on CPU 9 (core 9, socket 0)
Feb  7 00:19:32 storage kernel: Code: 5d c3 0f 1f 40 00 8b 47 04 83 e2 cf 8d 48 01 89 4f 04 89 13 eb bb e8 bb 9e 00 00 eb 92 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa <8b> 17 f6 47 74 80 74 74 89 d0 25 00 80 00 00 75 6b 55 53 48 89 fb
Feb  7 00:22:55 storage kernel: ccextractor[543229]: segfault at 0 ip 000015540a56c384 sp 00007ffc97191298 error 4 in libc.so.6[15540a50d000+195000] likely on CPU 11 (core 11, socket 0)
Feb  7 00:22:55 storage kernel: Code: 5d c3 0f 1f 40 00 8b 47 04 83 e2 cf 8d 48 01 89 4f 04 89 13 eb bb e8 bb 9e 00 00 eb 92 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa <8b> 17 f6 47 74 80 74 74 89 d0 25 00 80 00 00 75 6b 55 53 48 89 fb

But it looks like a known issue with Plex or some other subtitle app.

We'll see how things hold up after a few days, but it looks like I'm in the clear. Thanks for all the help!

Quote

1

Web UI unresponsive

Featured Replies

Solved by coppit

Join the conversation

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)