October 17, 2025Oct 17 Looking to resolve a potential hardware issue, just not sure which device... I'm struggling between symptoms and root cause. I've had a few lockups over the past 2-3 weeks, created a diagnostics file below, but inside the zip, I also included the full log since Oct 11th.On Oct 11th, clearly something was locking up and having an issue. It ran smooth for a day or two and then this morning (10/17/2025) I woke up to a system freeze. I definitely have a faulty USB backup enclosure that I decided to reformat and pre-clear, and the lockup happened during pre-clear. I'm ready to abandon the USB, but wanted to find root cause. The "old" USB drive is indicated as STATUS_Backup_Old (spelling error for name of server STRATUS). Theres definitely suspicious kernel errors last night and this morning.Also noted, last week, I had numerous BTRFS errors with my Docker image so I moved to directory on the ZFS pool (SSDs) and switched to Overlay2. I don't think I have the logs go back that far as they weren't properly being written to a directory for sustainment. stratus-diagnostics-20251017-0712.zip
October 17, 2025Oct 17 Community Expert Solution Start by running memtest, also check/replace cables for sdi
October 18, 2025Oct 18 Author Thank you. I have 4 sticks of 16GB DDR 3600 ram = 64GB.Just started memtest and within 10 minutes have 5000+ errors. Is it worth narrowing down which DIMM or should I just replace them all? I'll also replace all the SAS breakout cables at the same time.
October 18, 2025Oct 18 Community Expert 5 hours ago, birdsofprey02 said:Is it worth narrowing down which DIMM or should I just replace them all?I think it's worth to try and find the bad one, most likely there's only one, but I would test both pairs first to make sure.
November 12, 2025Nov 12 Author Closing this out and marking resolved for archive.I had thousands and thousands of memtest errors. Unfortunately, my memory banks are under the CPU heatsink/fan, so its a pain to narrow down the culprit. Ended up buying 4 new banks. Also bought all new breakout cables and added two nvme drives for fun. Not sure if this was a normal course of action, but my 9211-8i HBA card had really old firmware/bios installed. Upgraded that to the latest I could find.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.