drsprite Posted April 25, 2021 Share Posted April 25, 2021 Hi all, I'm running the trial version of unraid to get a feel for it. So far so good except every 2 days or so, the system locks up entirely. The web ui is functional to a certain degree but this is my indication that the underlying system is locked up. I don't know what's going on - hopefully these diagnostics after my forced reboot captured something? unraid-diagnostics-20210425-1033.zip Quote Link to comment
JorgeB Posted April 26, 2021 Share Posted April 26, 2021 Syslog starts over after every reboot, enable this then post that log after a crash. 1 Quote Link to comment
drsprite Posted April 27, 2021 Author Share Posted April 27, 2021 Today I performed a clean reboot, and the lockup happened just after 14 minutes of uptime. My ssh session, sftp session were locked up, so I went to the web ui and saw the CPU utilization panel details weren't moving. I had to press the power button which started a graceful shutdown, however it got stuck at this screen below. I gave it about 10 minutes before I just unplugged the server. Things to note: Today I rebooted because I added 2 new disks. 1 for a storage pool, and 1 to expand the array. However this type of lockup has happened previously when I wasn't expanding the disks. What's interesting is the SFTP attempted to re-connect well after the runlevel 0 happened. Attached are the syslog reports. graylog-search-result-relative-0.csv Quote Link to comment
JorgeB Posted April 27, 2021 Share Posted April 27, 2021 I do see some nginx issues, but those are outside my wheelhouse: 2021-04-27T11:21:19.000Z,"unraid nginx: 2021/04/27 11:21:19 [error] 11092#11092: *4947 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 192.168.0.13, server: , request: ""POST /webGui/include/Boot.php HTTP/1.1"", upstream: ""fastcgi://unix:/var/run/php5-fpm.sock"", host: ""unraid:7000"", referrer: ""http://unraid:7000/Dashboard""" 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *4809 open socket #25 left in connection 10 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *5276 open socket #19 left in connection 9 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *5280 open socket #31 left in connection 13 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *3413 open socket #24 left in connection 8 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *3407 open socket #26 left in connection 18 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: aborting Make sure you only have one browser windows open in the GUI, hopefully somebody else has more ideas. Quote Link to comment
drsprite Posted April 27, 2021 Author Share Posted April 27, 2021 (edited) Just happened again. Exact same behavior. Had to press the power button. It got stuck. Then I unplugged it. This time with an uptime of 20 minutes. graylog-search-result-relative-0 (1).csv Edited April 27, 2021 by drsprite Quote Link to comment
drsprite Posted April 27, 2021 Author Share Posted April 27, 2021 (edited) 1 hour ago, JorgeB said: I do see some nginx issues, but those are outside my wheelhouse: 2021-04-27T11:21:19.000Z,"unraid nginx: 2021/04/27 11:21:19 [error] 11092#11092: *4947 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 192.168.0.13, server: , request: ""POST /webGui/include/Boot.php HTTP/1.1"", upstream: ""fastcgi://unix:/var/run/php5-fpm.sock"", host: ""unraid:7000"", referrer: ""http://unraid:7000/Dashboard""" 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *4809 open socket #25 left in connection 10 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *5276 open socket #19 left in connection 9 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *5280 open socket #31 left in connection 13 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *3413 open socket #24 left in connection 8 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: *3407 open socket #26 left in connection 18 2021-04-27T11:21:19.000Z,unraid nginx: 2021/04/27 11:21:19 [alert] 11092#11092: aborting Make sure you only have one browser windows open in the GUI, hopefully somebody else has more ideas. It's happened again after about an hour. My docker containers appear to still be working. So 3 times today Yep, only 1 browser open. I'll be happy if it's just an nginx issue. How do I restart the web ui? Edit: ok not all containers are working, but some are. Could it be a disk issue? Edit again: I see this in the syslog right before the nginx timeouts occur. Apr 27 12:39:49 unraid kernel: sd 7:0:4:0: Power-on or device reset occurred Edited April 27, 2021 by drsprite Quote Link to comment
drsprite Posted April 29, 2021 Author Share Posted April 29, 2021 I think I had a bad P840 card. I replaced it and so far the last 2 days have been good. I will keep monitoring it. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.