cache pool FS corruption


5 posts in this topic Last Reply

Recommended Posts

hi there, 

 

Just realized my cache pool is read only due to what looks like a file system corruption. I was hoping to run a file system check so I stopped the array but the server got stuck at unmounting the disks. 

 

What's my best course of action here with the server stuck at "unmounting" disks? I'd like to get some advise before causing unnecessary damage.

 

thanks for reading. 

 

 

tower-diagnostics-20210615-1448.zip

Link to post
Posted (edited)

so I rebooted the server, started the array in maintenance mode and ran the btrfs check and scrub, resulting in:

 

Status:           finished
Duration:         0:05:52
Total to scrub:   305.64GiB
Rate:             889.13MiB/s
Error summary:    csum=14
  Corrected:      0
  Uncorrectable:  14
  Unverified:     0

 

SMART coms back clean but I do see those: 

 

181	Program fail count total	0x0022	100	100	000	Old age	Always	Never	47244705802

 

what's the best option here? Replace the drives? 

Edited by daan_SVK
Link to post
7 hours ago, JorgeB said:

It's not a device problem, btrfs is detecting data corruption, most likely the result of you having overclocked RAM (or just bad RAM), see here first:

https://forums.unraid.net/topic/46802-faq-for-unraid-v6/?do=findComment&comment=819173

 

 

thanks for your response, 

 

the RAM was MEMtest stress tested before it was installed in this server but I believe it has XMP enabled, does that count as overclocked under these circumstances? 

 

this might be a BTRFS specific question but how sever is the corruption? Can it be repaired, given the cache is in Raid1? Wouldn't rebuilding the cache from scratch be a way to rectify the corrupted blocks or are my Appdata backups also corrupted? 

 

 

 

 

 

Link to post
9 minutes ago, daan_SVK said:

does that count as overclocked under these circumstances? 

It counts as an overclock every time it's above AMD's max officially supported speeds, and it's a known source of data corruption with Ryzen.

 

10 minutes ago, daan_SVK said:

Can it be repaired, given the cache is in Raid1?

No because usually both copies are corrupted, if it could be a scrub would repair it, this means it can't:

 

14 hours ago, daan_SVK said:

Uncorrectable: 14

 

Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.