July 21, 20241 yr I recently upgraded my old server from a Supermicro with a Xeon E3 to a Gigabyte Z790 with an i5-13500K. Other than adding more memory and some additional NVME drives for cache, all the other components such as hard drives, flash drives, docker containers, and plugins remain the same. I ran a memtest for over 24 hours on both memory sticks without any errors. My plan is to swap in one memory stick at a time to see if it resolves the issue. The Supermicro server was very stable and rarely crashed. Unfortunately, the new hardware has been experiencing random reboots. The longest it has lasted without crashing is about a day, but sometimes it crashes after just 30 minutes, or a few hours in. Thankfully, no data is being written to the array since everything is stored on the cache drives for now. However, I'm worried about crashes occurring when mover eventually writes to the array. There doesn't seem to be any clear cause of the reboots in the syslog. I've set up a remote syslog server as the previous logs are not being saved when it crashes. But the system appears to be operating normally until the time of the crash, and then picks up again after the system reboots. I would appreciate it if some experts could review the diagnostics and suggest further troubleshooting steps. Thank you! tower-diagnostics-20240720-2122.zip Edited July 21, 20241 yr by arcane
July 21, 20241 yr Community Expert Power supply would be my first suspect - bent CPU LGA pin would be my second
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.