Jump to content

BTRFS error


Go to solution Solved by JorgeB,

Recommended Posts

Was about to add a new 8tb disk then noticed an error pop up when logging in saying cache error

not first time i had this so its annoying :( but i went to have a look at disks and one shown failed but unriad wasn't responding to any changes that i was trying to do so i ended up forcing a reboot and now both cache disks appear to run smart scans and say no issues

 

 Logs before the reboot

vault101-diagnostics-20240308-1515.zip 
 

vault101-syslog-20240308-1514.zip

 

logs after reboot

 

vault101-diagnostics-20240308-1534.zip

vault101-syslog-20240308-1534.zip

 

also checked disk logs and see the below

cache disk sde

1407616583_cachediskSDE.PNG.25a05128d251f4a2396563513dfc7854.PNG

 

cache 2 disk sdk

671702695_cache2diskSDK.PNG.3e1d45aa85617f32d7ffb823e4848b40.PNG

 

 

 

Also i have tried a different sata cable on the disks to see if it helps but to early to tell esscailly with the current errors parity check in process cause i rebooted it to get it to respond

 

this is the scipt results for the cache check

Script Starting Mar 08, 2024 15:47.01


Full logs for this script are available at /tmp/user.scripts/tmpScripts/checkcachedrives/log.txt

[/dev/sde1].write_io_errs 0
[/dev/sde1].read_io_errs 0
[/dev/sde1].flush_io_errs 0
[/dev/sde1].corruption_errs 0
[/dev/sde1].generation_errs 0
[/dev/sdk1].write_io_errs 2668196
[/dev/sdk1].read_io_errs 3936
[/dev/sdk1].flush_io_errs 0
[/dev/sdk1].corruption_errs 57
[/dev/sdk1].generation_errs 0
Script Finished Mar 08, 2024 15:47.03

So i have ran a scrub and it has gave me the following

 

UUID: 5f5d37ad-1efa-46da-b974-d876dfdb375a

Scrub started: Fri Mar 8 16:05:10 2024

Status: finished Duration: 0:43:16

Total to scrub: 142.90GiB

Rate: 56.34MiB/s

Error summary: verify=2592

csum=1678144

Corrected: 1680736

Uncorrectable: 0

Unverified: 0

 

 

is there an indication to why my cache has corrupted ?, i am am happy to move everything off my cache again to see how that goes cause it leaves corrupted stuff behide but looking for input before i do this incase there is a suggestion

 

any help would be great ? thank you

 

is there an indication to why my cache has corrupted ?, i am am happy to move everything off my cache again to see how that goes cause it leaves corrupted stuff behide but looking for input before i do this incase there is a suggestion

 

any help would be great ? thank you

 

EDIT after the scrub it appears the errors have stopped will update a post later on but would still like input please

 

image.thumb.png.784eedff9b0cb241d590c79ba4f57069.png

Edited by LoyalScotsman
Link to comment

Those errors suggest one of the ache devices dropped offline in the past, scrub brought it up to date, and since there aren't any uncorrectable errors it should be fine now, assuming no NOCOW shares.

 

It would be a good idea to check/replace cables for that device, the one that was /dev/sdk on those screenshots.

Link to comment
2 hours ago, JorgeB said:

Those errors suggest one of the ache devices dropped offline in the past, scrub brought it up to date, and since there aren't any uncorrectable errors it should be fine now, assuming no NOCOW shares.

 

It would be a good idea to check/replace cables for that device, the one that was /dev/sdk on those screenshots.

there is no NOCOW shares

 

i have replaced the cable for /dev/sdk i will montior just thought put a post but so far it is looking good so as you say that disk must have dropped offline 

 

Link to comment

the script appears to still be picking up errors is there a command to clear this ? 

 

Script Starting Mar 08, 2024 19:47.01

Full logs for this script are available at /tmp/user.scripts/tmpScripts/checkcachedrives/log.txt

[/dev/sde1].write_io_errs 0
[/dev/sde1].read_io_errs 0
[/dev/sde1].flush_io_errs 0
[/dev/sde1].corruption_errs 0
[/dev/sde1].generation_errs 0
[/dev/sdk1].write_io_errs 2668196
[/dev/sdk1].read_io_errs 3936
[/dev/sdk1].flush_io_errs 0
[/dev/sdk1].corruption_errs 1779689
[/dev/sdk1].generation_errs 2592
Script Finished Mar 08, 2024 19:47.03

Link to comment
  • Solution
13 hours ago, LoyalScotsman said:

the script appears to still be picking up errors is there a command to clear this ? 

You need to reset them, see here:

https://forums.unraid.net/topic/46802-faq-for-unraid-v6/?do=findComment&comment=700582

 

 

13 hours ago, LoyalScotsman said:

there is no NOCOW shares

Suggest converting those to COW, btrfs cannot correct NOCOW data, since it's not checksummed.

Link to comment

So i have done the above and errors cleared hopefully it also stops the pop up when logging in that occurs ever hour image.thumb.png.ae458a0be6d322b170c51ee8ca6db32d.png

 

 

with regards to COW

 

my appdata share is set to auto 

shares system and domains is set to no how do i change this cause it appears greyed out 

 

I assume this COW option is only for shares directly saved on the cache ? 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...