July 4, 20251 yr Unraid system that's been running from the early 6.x days, upgrade as they come up within a month or so. Running 7.1.4 currently.Recently the system will go offline, no web gui or access to docker containers or smb shares. But will ping and allow SSH. Also allows local console access. Tried restarting ngnix, no change. Have to reboot from the local console as rebooting from ssh seems to hang ssh session/not reboot.Enabled an external syslog collector, and I don't see anything. I rebooted it at 8:22 local time in the attached logs.Anyone have any clue?messages.zip tower-diagnostics-20250704-0835.zip Edited July 4, 20251 yr by Ozbarge
July 4, 20251 yr Reboot was between these lines, correct?2025-07-04T06:14:17.000+00:00 172.17.0.1 HOST=172.17.0.1 HOST_FROM=172.17.0.1 LEGACY_MSGHDR="Docker " MESSAGE="Auto Update: No updates will be installed" MSGFORMAT=rfc3164 PROGRAM=Docker SOURCE=s_network_udp TRANSPORT=rfc54262025-07-04T08:21:48.000+00:00 172.17.0.1 HOST=172.17.0.1 HOST_FROM=172.17.0.1 LEGACY_MSGHDR="login: " MESSAGE="pam_unix(login:session): session opened for user root(uid=0) by LOGIN(uid=0)" MSGFORMAT=rfc3164 PROGRAM=login SOURCE=s_network_udp TRANSPORT=rfc5426
July 4, 20251 yr Author Yes that is correct, it just… drops off it seems but I could still ping the server before reboot just no other services will respond.
July 5, 2025Jul 5 Unfortunately, there's nothing relevant logged, this can also be a hardware issue, one thing you can try is to boot the server in safe mode with all docker containers/VMs disabled, let it run as a basic NAS for a few days, if it still crashes it's likely a hardware problem, if it doesn't start turning on the other services one by one, including the individual docker containers. Additionally, look in the BIOS for a "Global C-States" or similar setting and disable that to retest, it's been known to be a problem with some boards, with both Intel and AMD CPUs.
July 18, 2025Jul 18 Author Disabled C-States, disabled most Docker containers. and it still locked up overnight after ~3 days. Local CLI was still responsive. Was able to save syslog to another location before graceful reboot via local CLI. Could I bother you to take another look before I decide to buy new hardware, re-install a fresh copy of UNRAID, or just turn it off and walk away for a while? syslog tower-diagnostics-20250718-0819.zip
July 18, 2025Jul 18 Other than some NUT UPS errors that are not relevant for this, I'm afraid that there's still nothing out of the ordinary logged, and that it would explain the server crashing.My recommendation would still be to see if it crashes with services disabled, because if it does it's almost certainly hardware, and if it doesn't, you can see if it's one of the containers causing issues, some containers have been known to crash servers.
July 18, 2025Jul 18 6 hours ago, Ozbarge said:Disabled C-States, disabled most Docker containers. and it still locked up overnight after ~3 days. Local CLI was still responsive. Was able to save syslog to another location before graceful reboot via local CLI.Could I bother you to take another look before I decide to buy new hardware, re-install a fresh copy of UNRAID, or just turn it off and walk away for a while?syslog tower-diagnostics-20250718-0819.zipMy previous thread had the same symptoms. But mine was usually within a few hours. The only thing that worked for me was to rebuild the flash drive and only copying the bare minimum files over needed to preserve disk assignments and docker containers. I don't have VMs so not sure what is required to keep those.
July 19, 2025Jul 19 Author 5 hours ago, jmztaylor said:My previous thread had the same symptoms. But mine was usually within a few hours. The only thing that worked for me was to rebuild the flash drive and only copying the bare minimum files over needed to preserve disk assignments and docker containers. I don't have VMs so not sure what is required to keep those.Can you tell me what I need to copy over to a fresh install of Unraid to keep disk assignments and Docker containers? I don't have VMs either, at least none that I care about.
July 19, 2025Jul 19 Author Disregard. I think..I think it's the Realtek 8125B. I recently moved my servers uplink to a 2.5GB SFP to ethernet adapter. When I couldn't reach the webpage/dockers but could ping it, it obviously couldn't be the network...except I moved it back to a 1Gbps port and the webpage/docker services came online from my phone/desktop instantly. Just installed the Realtek 8125b driver from the Community Apps and fingers crossed. Now to wait for 1 hour to 4.5 days...
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.