September 10, 2025Sep 10 Trying to debug why UNRAID crashes approximately once per monthUNRAID Version: 7.1.4Processor: Intel Core i7-14700KMotherboard and Bios:ASUSTeK COMPUTER INC. Pro WS W680-ACE , Version Rev 1.xxAmerican Megatrends Inc., Version 4302BIOS dated: Mon 16 Jun 2025 12:00 AMSteps previously takenRan 2 passes of Memtest86+ v6.20 : PassRemoved usage of Intel Graphics SR-IOVDuring this last crash which happened onSep 9 between03:42:02 : last log written to log server06:43:08 : when I power cycledThe following errors were shown on the monitor of the serverDoing some ChatGPT questioning of lines in my log file. ChatGPT believes the following lines in the log file are the likely root cause of the crashThese lines occurred during mover running (Starts at 3:40:00). I am running a ZFS cache.Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs unmount 'cache_nvme/Downloads' 2>&1Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs destroy 'cache_nvme/Downloads' 2>&1Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs mount 'cache_nvme/Downloads' 2>&1Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs unmount 'cache_nvme/Downloads' 2>&1Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs destroy 'cache_nvme/Downloads' 2>&1Sep 8 03:40:01 Tower shfs: /usr/sbin/zfs mount 'cache_nvme/Downloads' 2>&1Is the unmount, destroy, mount in rapid successions normal when running mover on a ZFS cache? I didn't find any other mention of this in other places.Other items on my listBad USB boot driveI've been using the same drive for many years. I have no other reason to think this but it's easy to try a new oneBad CPUI do have one of the known problem Intel CPUs but I have kept up with the newest BIOS and only have ever run stock voltage and frequency. Thanks for any help you can provide.tower-diagnostics-20250909-2148.zip Edited September 10, 2025Sep 10 by mwasserman
September 10, 2025Sep 10 4 hours ago, mwasserman said:Is the unmount, destroy, mount in rapid successions normal when running mover on a ZFS cache?That's not a problem.Enable the syslog server and post that after a crash, but plenty of confirmed cases of those CPUs being the problem, so it's a good suspect, especially if there's nothing relevant logged when it crashes.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.