zipt Posted May 18, 2023 Share Posted May 18, 2023 (edited) Saw a bunch of 'BTRFS error' today, mostly "BTRFS warning (device nvme1n1p1): direct IO failed ino 189208 rw 0,0 sector 0x443da678 len 4096 err no 10". Not sure why but seemed like the drive disconnected randomly. Tried rebooting the server and my 'nvme0n1' drive in my cache pool was shown as 'Missing', NOT the nvme1n1 I saw all the errors for. Rebooted into the BIOS, drive was detected okay, showed no errors, booted back into Unraid and it was showing again, and no smart errors. Started array in maintenance mode and ran a Check Filesystem Status and had pages and pages of "BTRFS ERROR (DEVICE LOOP2): PARENT TRANSID VERIFY FAILED ON X WANTED Y FOUND Z" At this point I don't know if I can trust either drive. Any advice would be appreciated for how to resolve this. Scared to start the array and make a backup of everything in case it screws up the data on there. **In theory** it should all be backed up already, but I'd rather not need to put that to the test... Should I even trust this device again? Or just RMA and get a new one, or both? Thanks in advance uri-diagnostics-20230518-1551.zip Edited May 18, 2023 by zipt fixing details Quote Link to comment
Solution JorgeB Posted May 18, 2023 Solution Share Posted May 18, 2023 Device dropped offline, this is usually not a device problem, try this, on the main GUI page click on the flash drive, scroll down to "Syslinux Configuration", make sure it's set to "menu view" (top right) and add this to your default boot option, after "append initrd=/bzroot" nvme_core.default_ps_max_latency_us=0 pcie_aspm=off e.g.: append initrd=/bzroot nvme_core.default_ps_max_latency_us=0 pcie_aspm=off Reboot and see if it makes a difference. Quote Link to comment
zipt Posted May 18, 2023 Author Share Posted May 18, 2023 @JorgeB Thanks for the helpful advice, really appreciate it. I've added that in, but I'm not sure that would be the cause. It's been running fine for almost a year now so I'd be surprised if that was the issue, but I guess I'll see how it goes I've started the array and everything seems to be working correctly. Figured I'd run a scrub as well for good measure but looks okay Quote Link to comment
zipt Posted May 18, 2023 Author Share Posted May 18, 2023 Return of the wall of errors, looks like there's still an issue Quote Link to comment
zipt Posted May 18, 2023 Author Share Posted May 18, 2023 Ran a second scrub - found zero errors, then rebooted. No more errors in logs. Looks to be solved! Quote Link to comment
JorgeB Posted May 19, 2023 Share Posted May 19, 2023 You can also see here for how to reset the errors and better pool monitoring. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.