Jump to content

Weekly btrfs_preempt_reclaim_metadata_space appears to be crashing the cache drives since 6.12.2?


Go to solution Solved by JorgeB,

Recommended Posts

Been running this system for years, with a few upgrades here and there.  Been very stable until recently, where every 6+ days I'd notice the Docker service was no longer running. Restarting would fix the issue until another 6+ days would pass.  Finally bothered to look at the diagnostics, and the crashing seems to be related to:
WARNING: CPU: 0 PID: 16956 at fs/btrfs/extent-tree.c:3061 __btrfs_free_extent+0x466/0xc02

...

Workqueue: events_unbound btrfs_preempt_reclaim_metadata_space

...

BTRFS error (device sdh1): unable to find ref byte nr 2845564928 parent 0 root 5  owner 40359587 offset 0
Jul 13 07:04:47 Storage kernel: ------------[ cut here ]------------
Jul 13 07:04:47 Storage kernel: BTRFS: Transaction aborted (error -2)

etc.

 

I see this in 6.12.2 and 6.12.3 logs (attached)

 

I'm going to try a BTRFS file system check next; the 2 SSDs that make up the cache drive are definitely old, but never had an issue until 6.12.2

Thoughts?
 

storage-diagnostics-20230715-1605.zip

storage-diagnostics-20230810-1055.zip

Edited by jazzysmooth
wrong diagnostics file
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...