December 20, 20232 yr Hi All, Hoping for some help, as I'm experiencing server crashes every 24 hours or so. I initially thought it was RAM issues, but memtest came up clean. - Currently at a loss outside of major hardware rebuilds. - Can't really downgrade to 6.11 as it breaks most apps. Basic specs Unraid: 6.12.6 CPU: 6700K (stock speeds) RAM: 2 x 16GB (3200 stock speeds) What I have tried so far: 1) Run the latest memtest86 overnight with 6 passes succeed (image attached) - I can attempt a single stick of ram but I am assuming this is a good enough test. 2) Syslog server setup, with no apparent errors (attached) SYSLOG All-Messages-search-result.csv 3) Diagnostics are attached diagnostics-20231220-1455.zip 4) Booted up in CLI mode with a monitor attach to monitor standard out 5) Switched to ipvlan in Docker settings and forced upgrade/rebuilt all docker images Any advice would be appreciated, let me know if I have missed something. I do recall it was rock solid before 6.12, but it might be some bias there, the ipvlan fixes havent solved it. Thanks, Memtest results Docker - ipvlan settings Standard output on the monitor attached to the unraid server: - Apologies for the blur diagnostics-20231220-1455.zip SYSLOG All-Messages-search-result.csv Edited December 20, 20232 yr by crunchynuts Formatting easier to read
December 20, 20232 yr Community Expert Unfortunately there's nothing relevant logged, this usually points to a hardware issue, one thing you can try is to boot the server in safe mode with all docker/VMs disabled, let it run as a basic NAS for a few days, if it still crashes it's likely a hardware problem, if it doesn't start turning on the other services one by one.
December 20, 20232 yr Author Server crashed again around 2:40am in the logs, I've removed all RAM and left 1 stick in the default slot. I have attached the diagnostics and syslog from the crash below to see if anything is visible. Diagnostics beast-diagnostics-20231221-0916.zip Syslog All-Messages-search-result-crash0240-20231221.csv
February 7, 20242 yr Author Solution Following this one up incase it helps someone in the future. - nothing useful from memtest/syslog/diagnostics - did all the recommended 6.12.x tweaks. - Removed all but 1 stick of RAM, seems to last longer but still crashes after about a week. Noticed that when it did crash I could still SSH in and access the CLI, although it did not register and commands. - I believe that when there were writes or reads to the flash under circumstances (unsure when unraid does this specifically). It would cause various system components to halt/and not respond (UI/docker/etc).. CLI was strangely accessible. - Hard pressing the power button initiated a hard shutdown, and I noticed that Docker would take a long time to respond to a stop (docker is stored on flash) - After a while the flash would stop detecting in the BIOS POST as a bootable drive. - plugged into a windows machine and it was fine however this pointed to issues with the flash USB. Decided to buy a brand new USB flash and reinstall latest 6.12.6 copied over the config folder. - Chose not to restore backup incase there were other corrupt files Reinstalled all RAM sticks and system is somewhat stable now for 3 days, use to crash daily. If it makes it past 10 days then will consider it stable. TLDR, check your flash if its more than a few years old, it's cheaper than checking other hardware components.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.