May 18, 20233 yr Saw a bunch of 'BTRFS error' today, mostly "BTRFS warning (device nvme1n1p1): direct IO failed ino 189208 rw 0,0 sector 0x443da678 len 4096 err no 10". Not sure why but seemed like the drive disconnected randomly. Tried rebooting the server and my 'nvme0n1' drive in my cache pool was shown as 'Missing', NOT the nvme1n1 I saw all the errors for. Rebooted into the BIOS, drive was detected okay, showed no errors, booted back into Unraid and it was showing again, and no smart errors. Started array in maintenance mode and ran a Check Filesystem Status and had pages and pages of "BTRFS ERROR (DEVICE LOOP2): PARENT TRANSID VERIFY FAILED ON X WANTED Y FOUND Z" At this point I don't know if I can trust either drive. Any advice would be appreciated for how to resolve this. Scared to start the array and make a backup of everything in case it screws up the data on there. **In theory** it should all be backed up already, but I'd rather not need to put that to the test... Should I even trust this device again? Or just RMA and get a new one, or both? Thanks in advance uri-diagnostics-20230518-1551.zip Edited May 18, 20233 yr by zipt fixing details
May 18, 20233 yr Community Expert Solution Device dropped offline, this is usually not a device problem, try this, on the main GUI page click on the flash drive, scroll down to "Syslinux Configuration", make sure it's set to "menu view" (top right) and add this to your default boot option, after "append initrd=/bzroot" nvme_core.default_ps_max_latency_us=0 pcie_aspm=off e.g.: append initrd=/bzroot nvme_core.default_ps_max_latency_us=0 pcie_aspm=off Reboot and see if it makes a difference.
May 18, 20233 yr Author @JorgeB Thanks for the helpful advice, really appreciate it. I've added that in, but I'm not sure that would be the cause. It's been running fine for almost a year now so I'd be surprised if that was the issue, but I guess I'll see how it goes I've started the array and everything seems to be working correctly. Figured I'd run a scrub as well for good measure but looks okay
May 18, 20233 yr Author Ran a second scrub - found zero errors, then rebooted. No more errors in logs. Looks to be solved!
May 19, 20233 yr Community Expert You can also see here for how to reset the errors and better pool monitoring.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.