jazzysmooth Posted August 11, 2023 Share Posted August 11, 2023 (edited) Been running this system for years, with a few upgrades here and there. Been very stable until recently, where every 6+ days I'd notice the Docker service was no longer running. Restarting would fix the issue until another 6+ days would pass. Finally bothered to look at the diagnostics, and the crashing seems to be related to: WARNING: CPU: 0 PID: 16956 at fs/btrfs/extent-tree.c:3061 __btrfs_free_extent+0x466/0xc02 ... Workqueue: events_unbound btrfs_preempt_reclaim_metadata_space ... BTRFS error (device sdh1): unable to find ref byte nr 2845564928 parent 0 root 5 owner 40359587 offset 0 Jul 13 07:04:47 Storage kernel: ------------[ cut here ]------------ Jul 13 07:04:47 Storage kernel: BTRFS: Transaction aborted (error -2) etc. I see this in 6.12.2 and 6.12.3 logs (attached) I'm going to try a BTRFS file system check next; the 2 SSDs that make up the cache drive are definitely old, but never had an issue until 6.12.2 Thoughts? storage-diagnostics-20230715-1605.zip storage-diagnostics-20230810-1055.zip Edited August 11, 2023 by jazzysmooth wrong diagnostics file Quote Link to comment
Solution JorgeB Posted August 11, 2023 Solution Share Posted August 11, 2023 7 hours ago, jazzysmooth said: Thoughts? Newer kernel may be finding some previously undetected corruption, if no errors are detected by the filesystem check, or if it cannot be repaired (and don't try to repair before backing up the pool), recommend backup and re-format. Quote Link to comment
jazzysmooth Posted August 11, 2023 Author Share Posted August 11, 2023 File system check did find and apparently fix issues. Will see what happens in 6 days... Thanks! 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.