Late september ish I started having unraid (6.9 latest beta, currently beta30) lock up and it requires a hard reboot to fix. Initially, docker shuts down and most CPU cores go to 100%. Within 1-5 minutes, the Unraid ui stops responding and the server no longer responds to pings or ssh. Attempting to reboot/shutdown from the UI while it's still responsive does not work and just enters the unresponsive state. A hard reset is the only way to fix this.
I've determined it is extremely likely that it only happens while the organizr docker is running. Possibly only happens while a browser has organizr open but I'm not 100% sure about that. I was having near daily unraid crashes so I spent the last week with organizr not running crash free and two nights ago turned it back on (although wasn't using it) and yesterday when I started using it almost immediately I had another crash. In the syslog, crashes always start with the following message or something very similar:
Oct 29 10:25:30 Mercury kernel: BUG: kernel NULL pointer dereference, address: 0000000000000402
Oct 29 10:25:30 Mercury kernel: #PF: supervisor read access in kernel mode
Oct 29 10:25:30 Mercury kernel: #PF: error_code(0x0000) - not-present page
Oct 29 10:25:30 Mercury kernel: PGD 0 P4D 0
Oct 29 10:25:30 Mercury kernel: Oops: 0000 [#1] SMP NOPTI
Oct 29 10:25:30 Mercury kernel: CPU: 6 PID: 118105 Comm: php-fpm7 Tainted: P O 5.8.13-Unraid #1
Oct 29 10:25:30 Mercury kernel: Hardware name: Gigabyte Technology Co., Ltd. X399 AORUS Gaming 7/X399 AORUS Gaming 7, BIOS F12 12/11/2019
Oct 29 10:25:30 Mercury kernel: RIP: 0010:fuse_readahead+0x124/0x352
Does anyone have any ideas what could be causing this and any suggestions for how I could fix this so I can keep using organizr? It's possible the issue is something to do with one of my other dockers being in an iframe but I don't know why that would be an issue. Yesterday the crash happened while I was looking at nzbget, nzbhydra, and radarr v3.
I posted this on the Organizr discord but they seem to think it's an unraid issue since there are no other reports of similar behaviour. I have a number of unraid plugins and other dockers running although I've managed to trigger a crash with most dockers and some plugins disabled. I've confirmed it's not the unraid Nvidia build (crashes happen on stock). I've also disabled the cachedir plugin which may have been causing some other issues but crashes still happen.
If it is Organizr causing the crashes, how can I prevent a docker from taking down my whole system? Is there perhaps some obscure conflict I'm not aware of?
I appreciate any suggestions and can provide any addition info I missed. Thanks so much for any help!
I've attached diagnostics and the full syslog for yesterday. I've also run several memtests without error. Also attached a list of hardware and plugins.
Tagging per request from organizr discord: @Roxedus @tronyx
mercury-diagnostics-20201030-1740.zip
syslog2020-10-29 copy.txt
hardware.txt
plugins.txt