I've had amazing stability running my server on version 6.8.3 but when I upgrade to 6.9.2 or even 6.10.0-rc2 I get random lock-ups that require a reboot to fix. I've not seen anything that seems obvious in the logs as the the cause so I was hoping I could get some outside help.
Hardware Config:
Supermicro X9DRL-iF
2x Xeon E5-2650v2
32gb (8x4gb) Hynix DDR3-1333
3x Seagate Ironwolf 6TB + 1x Ironwolf 6TB for parity
1TB Crucial MX500 cache disk
I'm using only the onboard SATA controller and networking so there are no separate PCIe cards installed.
Some things that I noticed after upgrading:
1. Started seeing MCE messages about memory read errors that I had never seen before when using 6.8.3. Hoping this is a non-issue like I saw in this thread. Ran memtest for an hour but never encountered any errors.
2. The lock-up or crash causes the server to become unresponsive but does not initiate a restart on its own. I have to log into the remote admin interface of the motherboard to trigger a reboot or actually hit the power button to bring the server back.
3. I can't clearly identify any one trigger to the crash, it has happened when updating a minecraft docker, clicking on the "WebUI" button for my jellyfin docker, just watching videos using jellyfin, or even when the server is sitting idle.
I'm currently booted into safe mode with 35 minutes of uptime as a test to see if perhaps there is a problem with a plugin or if it makes any difference at all.
Thanks in advance for any help you guys can provide,
blkjack410
beowulf-diagnostics-20220108-1829.zip
syslog-192.168.101.88.log