February 24, 20251 yr Hello I have an issue where my server running 6.12.15 is freezing and hard rebooting. The hard reboots have been occasionally happening for a little while, and I thought it was something to do with the parity checks - as it always seemed to happen while these were running - but now I'm not sure if it's just because it would crash during a parity check, and then the system would attempt to restart the check after the reboot. Now the server is just freezing and rebooting again and again. I have tried making a new USB with a trial key and just letting it sit there, but it wont crash. I also ran a memtest for a couple hours which came back with a pass. I am about to start the server in safe mode, and see if I can remove components one-by-one to see if it's a hardware issue. I have an array of 4 disks, with a couple of pools of HDD, SATA SSD, and NVMe. I have docker images, mainly the arr's plus pihole, Immich, etc I do have an nVidia GPU installed. Thanks in advance tower-diagnostics-20250223-0823.zip
February 24, 20251 yr Author Dang... Server freezes after about 10mins even in safe mode with array off, doing nothing. I've put the temp UNRAID USB back in, and it stayed up for an hour. I've attached the syslog file which covers several days, as I had shipped that to another Ubuntu server for safekeeping. syslog.log
February 24, 20251 yr Server rebooting by itself is almost always a hardware problem, since you have multiple RAM sticks try using the server with just one, if the same try with a different one, that will basically rule out bad RAM.
February 25, 20251 yr Author Solution I tried removing the RAM sticks one at a time, but the server froze after 10 mins in each configuration. I've removed everything from the motherboard apart from CPU and RAM and it still crashes, so next bet is to try another mobo with the original PSU. I plugged in a spare SSD and tried installing Windows on it, and it got to 33% then froze, so definitely not a USB/UNRAID thing. I'll update this thread if I do get any answers, just for xkcd.com/979/ but it does look like this will be a hardware fault to track down. Thanks JorgeB
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.