OwenT Posted March 13 Share Posted March 13 This is my third time coming to the forums for this issue, fingers crossed I can find a solution this time. My specs are: AMD Ryzen 5 1400 ASRock A320M-DGS 16 GiB RAM After between 24 hours and 2 weeks of up time, my server will fully hang. Web UI stops responding, all docker containers stop, no output if I have a monitor plugged in (this CPU has no onboard display output, however I used to have a GPU in the system. I removed it to eliminate it as a cause for the issues). The only way to recover from this crash is to hold the power button until the mobo turns off, then reboot. Previous investigation lead to it being connected to ipvlan /macvlan but I've tried both with no change. In the syslog right before the crash I'm seeing an error, but I'm not sure what it could be. Mar 12 03:07:07 xephyr smbd[25757]: #25 /lib64/libc.so.6(__libc_start_main+0x85) [0x1481d7602775] Mar 12 03:07:07 xephyr smbd[25757]: #26 /usr/sbin/smbd(_start+0x21) [0x5576b5babb31] Mar 12 03:07:07 xephyr smbd[25757]: [2024/03/12 03:07:07.456711, 0] ../../source3/lib/dumpcore.c:315(dump_core) Mar 12 03:07:07 xephyr smbd[25757]: dumping core in /var/log/samba/cores/smbd Mar 12 03:07:07 xephyr smbd[25757]: Mar 12 04:00:39 xephyr kernel: md: sync done. time=57557sec Mar 12 04:00:39 xephyr kernel: md: recovery thread: exit status: 0 Mar 12 04:36:30 xephyr kernel: TCP: request_sock_TCP: Possible SYN flooding on port 39519. Sending cookies. Check SNMP counters. Mar 12 04:40:02 xephyr root: Fix Common Problems Version 2024.02.29 Mar 12 04:40:03 xephyr root: Fix Common Problems: Warning: Unraid OS not up to date Mar 12 05:31:41 xephyr kernel: TCP: request_sock_TCP: Possible SYN flooding on port 20683. Sending cookies. Check SNMP counters. Mar 12 07:08:28 xephyr kernel: TCP: request_sock_TCP: Possible SYN flooding on port 57702. Sending cookies. Check SNMP counters. Mar 12 13:13:52 xephyr kernel: TCP: request_sock_TCP: Possible SYN flooding on port 56787. Sending cookies. Check SNMP counters. Mar 12 13:28:48 xephyr kernel: TCP: request_sock_TCP: Possible SYN flooding on port 52846. Sending cookies. Check SNMP counters. Mar 12 16:29:29 xephyr kernel: traps: smartctl[32392] general protection fault ip:154c136d28e4 sp:7ffc40f06998 error:0 in libc-2.37.so[154c13610000+169000] Mar 12 17:03:04 xephyr kernel: docker0: port 1(veth91a2379) entered disabled state Mar 12 17:03:04 xephyr kernel: veth56ef535: renamed from eth0 Mar 12 17:03:05 xephyr kernel: docker0: port 1(veth91a2379) entered disabled state Mar 12 17:03:05 xephyr kernel: device veth91a2379 left promiscuous mode Mar 12 17:03:05 xephyr kernel: docker0: port 1(veth91a2379) entered disabled state Mar 12 17:03:06 xephyr kernel: docker0: port 1(veth573e246) entered blocking state Mar 12 17:03:06 xephyr kernel: docker0: port 1(veth573e246) entered disabled state Mar 12 17:03:06 xephyr kernel: device veth573e246 entered promiscuous mode Mar 12 17:03:10 xephyr kernel: eth0: renamed from vethd75fbd0 Mar 12 17:03:10 xephyr kernel: IPv6: ADDRCONF(NETDEV_CHANGE): veth573e246: link becomes ready Mar 12 17:03:10 xephyr kernel: docker0: port 1(veth573e246) entered blocking state Mar 12 17:03:10 xephyr kernel: docker0: port 1(veth573e246) entered forwarding state Mar 12 22:13:26 xephyr root: Delaying execution of fix common problems scan for 10 minutes Specifically this line Mar 12 16:29:29 xephyr kernel: traps: smartctl[32392] general protection fault ip:154c136d28e4 sp:7ffc40f06998 error:0 in libc-2.37.so[154c13610000+169000] My best guess would be a hardware failure, however this exact hardware was running fine as a gaming PC for several years with no issues. Every single component has been reseated, new thermal compound applied etc. I have done multiple unraid version upgrades, docker updates, and meters found zero problems. I've added the diagnostics I will be trying a new USB drive but just wanted to get this thread going while I wait for that to arrive. Any ideas before I just buy a new server? xephyr-diagnostics-20240313-0026.zip Quote Link to comment
Solution JonathanM Posted March 13 Solution Share Posted March 13 Have you worked through the steps here? https://forums.unraid.net/topic/46802-faq-for-unraid-v6/page/2/#comment-819173 Quote Link to comment
Hoopster Posted March 13 Share Posted March 13 25 minutes ago, OwenT said: My specs are: AMD Ryzen 5 1400 First generation Ryzen was particularly susceptible to freezing/crashing in Linux because of the issues addressed in the link provided above by JonathanM. Quote Link to comment
OwenT Posted March 13 Author Share Posted March 13 Thanks both, I've disabled c-states, I'll update here if it crashes again Quote Link to comment
OwenT Posted March 27 Author Share Posted March 27 Two weeks with no crashes, I'm going to mark that as the solution. Thanks for the help! Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.