dudly0 Posted September 22, 2023 Share Posted September 22, 2023 I've searched other Unraid forums and saw two similar items, but none of the topics covered described my specific scenario or fixed the issues I'm seeing. I'm trying to debug an issue with my Unraid server where it won't stay online and active for longer than 5-10 minutes without hitting the error that constantly repeats over and over on the plugged in monitor (See attached screenshot), "get_swap_device: Bad swap file entry 3ffffffffffff." When it happens, I can no longer login to the Unraid web GUI, I cannot reach the SMB shares, and I cannot do literally anything on the terminal (I've pressed CTRL+C & other various key combinations, to no avail). The message is spamming the screen very rapidly and I'm unable to use the keyboard for anything once it shows up. Here's what I've done so far to diagnose the best I can: I replaced the USB device as it wasn't happy when I plugged it in to my Windows machine. Doesn't seem to have made much of a stability difference other than my ability to now see the error message in the terminal window described above. I've started the array in safe mode with no plugins enabled, waited overnight for the parity process to run, but the server was frozen by morning. If I had to guess it was about 8 hours in before it froze and was unresponsive. **Note** This was done with the old USB device I've updated the Unraid OS from version 6.12.3 to 6.12.4. Still seeing the freeze. **Note** This was using the new USB device The only other thing of note that I saw prior to the freeze was in the Dashboard menu, I saw under the System widget the LOG was taking up 100% of the system. I also haven't downloaded/installed anything new for at least 5 or so months, just randomly appeared today so I highly doubt I have a new application that's causing the mess. I saw in other forums the Network Statistics app causing issues, but I don't have that installed. I am also unable to run memtest+ prior to Unraid booting. Every time I use the keyboard to select that option (both on old USB device and new USB device) the server reboots to the main screen and starts the boot process over again. Not sure what to make of this yet, but I don't believe it's related yet. Any help in other diagnostic attempts is greatly appreciated. Quote Link to comment
JorgeB Posted September 22, 2023 Share Posted September 22, 2023 Enable the syslog server and post that after a crash together with the diagnostics, diags can be after rebooting. Quote Link to comment
itimpi Posted September 22, 2023 Share Posted September 22, 2023 Do you have the swap file plugin installed? Just asking as by default Unraid does not use a swapfile. 43 minutes ago, dudly0 said: I am also unable to run memtest+ prior to Unraid booting. Every time I use the keyboard to select that option (both on old USB device and new USB device) the server reboots to the main screen and starts the boot process over again. Not sure what to make of this yet, but I don't believe it's related yet. This is normal if you boot in UEFI mode. You need to either boot in legacy mode to get the Unraid supplied version to run, or download a version from memtest86.com that can boot in UEFI mode. Quote Link to comment
dudly0 Posted September 22, 2023 Author Share Posted September 22, 2023 Thanks JorgeB and itimpi, I've enabled the syslog server to write to USB and will post when it crashes again with the logs. I do not have the swap file plugin installed either. I do however boot using UEFI mode so I will likely need to run in legacy mode or download a version from memtest86.com if we need to go that route (I'm still not convinced it's RAM related just yet). Quote Link to comment
itimpi Posted September 22, 2023 Share Posted September 22, 2023 10 minutes ago, dudly0 said: so I will likely need to run in legacy mode or download a version from memtest86.com if we need to go that route I would get the memtest86.com version anyway. It is a much more recent version so probably does any test better. It unfortunately is not included as standard with Unraid for Licencing reasons. Quote Link to comment
dudly0 Posted September 22, 2023 Author Share Posted September 22, 2023 Attached is my syslog file for debugging. I've also attached the diagnostics following the crash and reboot. syslog anonymized-threadripper-diagnostics-20230922-1346.zip Quote Link to comment
Solution JorgeB Posted September 23, 2023 Solution Share Posted September 23, 2023 Btrfs is detecting data corruption and there are several apps segfaulting, start by running memtest. Quote Link to comment
dudly0 Posted September 23, 2023 Author Share Posted September 23, 2023 Thanks JorgeB, I was able to download the latest memtest86 as itimpi recommended. I ran it and wouldn't you know that it failed miserably with 10,000+ errors. I had some spare RAM lying around and replaced what was installed in the server. Things are back up and running smoothly. Thank you for your help! 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.