Hello,
My server has been crashing and I would appreciate help with diagnosing it. I've noticed it crashes more often during copying, or working with docker containers. So I replaced the sata cables (brand new) and it let me copy TBs of files with no issues. The next day it crashed again. Sometimes it just straight up randomly hard reboots, no stacktrace, no error nothing. Most of the time when I go to check on it, there is a stack trace on the screen, but never any errors in the log at the time of the crash.
Things I have done so far:
I tried mirroring the syslog to flash, and even through several crashes, nothing is captured in the log at the time of the crash..
Running the built in memtest (that comes with Unraid) for 24 hours - no reported issues
Ran a separate memtest (newest version) from a separate usb, one ram stick at a time until completion (about >4 ish hours each)
Tired different network cables, different nics on the server, and connecting to a different network switch.
Today, I got a new USB, and downloaded a fresh copy (6.8.0) to it, using the unraid tool. From a different pc. - same problems
Please let me know what I should try.
Specs:
CPU - Intel Xeon E3-1270 V2 CPU
Mainboard - Supermicro X9SCM-F
Ram - Kingston 32GB (4x8GB) 240-Pin DDR3 1600 ECC SERVER Unbuffered Memory
PSU - Corsair CX 600
Samsung ssds and wd drives (all new and precleared)