Hello everyone, I'm hoping to get some help with my random reboot problem. I just built this system in November and I’ve been struggling to solve this on my own. My server seems to crash or reboot itself after anywhere from 5 – 72 hours of uptime. I have the array set to not start itself on boot, so I will come back to my server and see it just sitting, waiting for me to start the array, with notifications about unclean shutdowns. It seems more likely to occur when I’m doing tasks with a lot of disk activity like a Parity Sync or running Mover (with many TBs to move)
The server is plugged into an APC UPS with fresh batteries which report good health and I have NUT setup with a shutdown method.
So far, I’ve done to following:
Changed the CPU from an i5 13500 to an i7 13700k (upgrade unrelated to this issue).
Removed an old disk with questionable SMART data, without alleviation.
Replaced the TIM on my HBA.
Reseated the RAM.
I'm currently running memtest, but do not have results yet.
I mirrored the syslog to flash and will attach it.
System Specs:
Intel Core i5-13500 > Intel Core i7-13700K.
ASRock Z690 Extreme ATX LGA1700 Motherboard
G.Skill Ripjaws V 64 GB (4 x 16 GB) DDR4-3200 CL16 Memory
Appdata Storage:
Western Digital Black SN850X 2 TB M.2-2280 PCIe 4.0 X4 NVME Solid State Drive
Download Cache:
2x SAMSUNG 870 QVO 2.5"
1x SAMSUNG 850 EVO 2.5”
PSU: SeaSonic FOCUS Plus Platinum 750 W 80+ Platinum
HBA: LSI 9300-16i SAS in IT Mode.
Please let me know if I’ve missed anything important or if more info is needed. Thank you.
syslog