SteveZ59 Posted April 20, 2023 Share Posted April 20, 2023 Been chilling back on 6.9.2 for quite a while because any attempt to upgrade to 6.10 or now 6.11 results in random reboots of the system. A lot of people had issues with 6.10 initially, so I figured whatever the issue was would be fixed in a later rev and I just needed to be patient. When 6.11 1st came out, I tried it with similar results. Recently tried again with 6.11.5 and am still getting the same results. So whatever the issue is, clearly it's not going to be resolved by just waiting for newer versions. But I'm pretty much at my wits end figuring it out, so I'm hoping someone will have some insight/suggestion. On 6.9.2 it is rock solid. Will run forever, and have never had any reboot issues with this hardware prior to trying to upgrade to 6.10. If I downgrade back to 6.9.2 it is back to being rock solid. I have recently run Memtest with no errors found. I'm baffled, because if I go back to 6.9.2 it will run forever, but anything newer than 6.9 and I will experience spontaneous reboots anywhere from a couple days to two weeks apart. Best I can tell, there are no log entries indicating a problem, right up until the machine spontaneously reboots. This last time I did a telnet session with tails to capture the log in case something wasn't making it to the flash, that is the attachment labled Putty Log. The hardware is a Dell R930 with (4) E7-8867 and 480GB of ram. Prior to the last reboot I had upgraded the bios to the latest rev, not really expecting that to fix it, but you never know. All of my main array disks are in a Supermicro expansion chassis off of a LSI 9302-16e card. The two cache drives are M.2 drives on PCIE cards. The two NVME drives for my VM share are Dell NVME drives off of Dell GY1TD NVME expansion cards. Also have a Quadro P2000 used for Plex transcodes. In my iDrac, some of the times I will get an "OEM software event." in the log. But not every time. Many times, the iDrac never logs anything when it occurs. On the software side, I'm not doing anything special, all standard apps/plugins off the app store. Two Win10 VM's, and One Ubuntu. The Ubuntu is my primary file sharing machine, so it does see a significant amount of disk I/O to the NFS shares. Dockers: Krusader, Plex (binhex container for GPU transcoding), CrashplanPRO, DiskSpeed, FileZXilla, SpeedTest-By-OpenSpeedTest, tautulli, and Virt-Manager. All up to date with latest revs, save a new Plex update that just showed up. Plugins: Unassigned Devices Plus, Unassigned Devices Preclear, CA Auto Turbo Write Mode, CA Cleanup Appdata, Community Applicatinons, docker patch, Dynamix Cache Directories, Dynamix Schedules, Dynamix System Statistics, Enhanced Log Viewer, Fix Common Problems, IPMI support, NerdTools, Nvidea Driver, Tips and Tweaks, Unassigned Devices. All up to date to latest revs. I'm grateful for any suggestions anyone might have because I'm pretty much at the end of my rope and nothing I've tried so far has seen any success. I've considered downgrading back to 6.9, but at this point its pretty clear that would just be kicking the can down the road unless I'm willing to stay there forever. tower-diagnostics-20230414-0855.zip 2023-04-16 Putty Log.txt 2023-04-14 Full Current syslog.txt Quote Link to comment
Frank1940 Posted April 20, 2023 Share Posted April 20, 2023 (edited) Try the syslog server as it continuously records each syslog entry and will bridge the reboot. https://forums.unraid.net/topic/46802-faq-for-unraid-v6/page/2/?tab=comments#comment-781601 Edited April 20, 2023 by Frank1940 Quote Link to comment
SteveZ59 Posted April 21, 2023 Author Share Posted April 21, 2023 (edited) Ok, got a syslog server active now. Will see if it picks anything up that the tail session I'm running in telnet may miss on the next crash. Edited April 21, 2023 by SteveZ59 Quote Link to comment
SteveZ59 Posted April 27, 2023 Author Share Posted April 27, 2023 Ran 6 days and spontaneously rebooted again last night. This time the only things I had running were a Windows 10 VM, an Ubuntu VM, and my Plex Docker. Left everything else shut off. No log entries whatsoever, even in the syslog server I set up. If I roll back to 6.9.2, it will run forever without an issue. Anything newer and I get these random reboots At a loss for what to try next. Anyone have any suggestions? 2023-04-27 Syslog Server.zip tower-diagnostics-20230427-1608.zip 2023-04-27 Full USB syslog.txt Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.