BTRFS pool corrupt


Recommended Posts

Hi Guys,

 

My BTRFS pool just went read-only.

 

Aug 10 22:28:49 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11354, flush 1, corrupt 1, gen 0
Aug 10 22:28:49 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11355, flush 1, corrupt 1, gen 0
Aug 10 22:28:50 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11356, flush 1, corrupt 1, gen 0
Aug 10 22:28:50 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11357, flush 1, corrupt 1, gen 0
Aug 10 22:28:50 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11358, flush 1, corrupt 1, gen 0
Aug 10 22:28:50 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11359, flush 1, corrupt 1, gen 0
Aug 10 22:28:51 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11360, flush 1, corrupt 1, gen 0
Aug 10 22:28:51 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11361, flush 1, corrupt 1, gen 0
Aug 10 22:28:51 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11362, flush 1, corrupt 1, gen 0
Aug 10 22:28:51 Vortex kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 12, rd 11363, flush 1, corrupt 1, gen 0

 

Scrub does not seem to do anything.

 

a check -> btrfs dev stats -c /mnt/cache returns....

 

[/dev/nvme0n1p1].write_io_errs 0
[/dev/nvme0n1p1].read_io_errs 0
[/dev/nvme0n1p1].flush_io_errs 0
[/dev/nvme0n1p1].corruption_errs 0
[/dev/nvme0n1p1].generation_errs 0
[/dev/nvme1n1p1].write_io_errs 12
[/dev/nvme1n1p1].read_io_errs 12152
[/dev/nvme1n1p1].flush_io_errs 1
[/dev/nvme1n1p1].corruption_errs 1
[/dev/nvme1n1p1].generation_errs 0

 

SOS please :)

 

 

vortex-diagnostics-20210810-2230.zip

vortex-syslog-20210810-2033.zip

Edited by Jaster
Link to comment
20 hours ago, Jaster said:

Please do NOT say "memtest" - the memory is fine, I tested it.

We ask to run memtest when checksum errors are detected, that's not your issue, your problem is that one of the NVMe devices dropped offline:

 

Aug 10 21:44:19 Vortex kernel: nvme nvme1: I/O 130 QID 1 timeout, aborting
Aug 10 21:44:19 Vortex kernel: nvme nvme1: Abort status: 0x0
Aug 10 21:44:26 Vortex kernel: nvme nvme1: I/O 183 QID 1 timeout, reset controller
Aug 10 21:45:33 Vortex kernel: nvme nvme1: I/O 15 QID 0 timeout, reset controller
Aug 10 21:46:27 Vortex kernel: nvme nvme1: Device not ready; aborting reset, CSTS=0x1

 

Reboot/power cycle to see if it comes back online and run a scrub, also see here for better pool monitoring.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.