June 15, 20251 yr I keep having challenges with my server randomly re-boots. It can run anywhere between 1-2months and 3 days between unclean shutdowns. It can happen any time of day, with no apparent pattern in terms of load or usage.I got a UPS to see if there was an issue with the power, as we have occasional black/brown-out, but the issues remains. Almost always when I return to the server after such a reboot, it appears that two disks in the array are missing, but when I reboot again they are back to normal. I set the syslog to mirror to flash and have added what I had in syslog and syslog-previous after the most recent reboot.Could someone literate in these logs help me see if there’s any obvious culprit? Or any guidance on which subsequent tests I can do on hardware/settings?Thanks in advance!Hardware:Gigabyte Z790 D DDR4i7 13700k2x32gb Crucial RAM syslog.txt syslog-previous.txt syslog-previous(5)
June 15, 20251 yr Server rebooting by itself is almost always a hardware issue, if you have multiple sticks try using the server with just one, if the same try with a different one, that will basically rule out bad RAM, could also be a bad CPU affected by the Intel 13/14 gen issue, make sure the BIOS is up-to-date, it can sometimes help, unless the CPU it's too far gone.
June 15, 20251 yr Author Thanks, @JorgeB - I appreciate your insight.There was nothing you could spot in the logs that should indicate anything iffy with my unraid config, right?In that case I will sequence the hardware troubleshooting by first updating the BIOS - it was updated in September, so after the supposed 13th gen fix was rolled out as far as I understand, so I will go from F11 to F13c in the hope that might help.For the RAM, would removing one stick, and then another, be preferable over letting the Live Memory Tester plugin run to see if it can diagnostize anything?Last resort seems to be to see how willing Intel would be to honour their extended warranty!
June 16, 20251 yr 16 hours ago, deathprod said:For the RAM, would removing one stick, and then another, be preferable over letting the Live Memory Tester plugin run to see if it can diagnostize anything?You can try memtest or similar first, but remember that it's only definitive if errors are found.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.