cache disk keeps going read only


uek2wooF

Recommended Posts

I keep getting btrfs corruption on the nvme cache disk after about 3 days of uptime.  dmesg shows lots of btrfs errors.  I also get 30-50 errors on parity checks (one 4tb wd red with xfs for array and one for parity).  Memtest has been running for 18 hours with no errors.  Any ideas?

 

The first time the cache disk crashed I removed it as cache and rsynced it to another ssd I had lying around (a handful of unimportant files wouldn't copy).  I reformatted the nmve as btrfs and rscynced the stuff back over and added it back as cache.

 

ryzen 9 3900x, 64 gb ram, 1 tb nvme, 2x 4tb wd red

Link to comment
24 minutes ago, uek2wooF said:

I keep getting btrfs corruption on the nvme cache disk after about 3 days of uptime.  dmesg shows lots of btrfs errors.  I also get 30-50 errors on parity checks (one 4tb wd red with xfs for array and one for parity).  Memtest has been running for 18 hours with no errors.  Any ideas?

 

The first time the cache disk crashed I removed it as cache and rsynced it to another ssd I had lying around (a handful of unimportant files wouldn't copy).  I reformatted the nmve as btrfs and rscynced the stuff back over and added it back as cache.

 

ryzen 9 3900x, 64 gb ram, 1 tb nvme, 2x 4tb wd red

 

I had this with my cache with was 2 X 512GB Toshiba SSDs on my 3900x when I first set it up and was testing. I had my Ram running at 3600MHz, and the system was not stable. What do you have your ram running at? Try backing it off to 2667 and see if things stable down. Also turn off PBO in the BIOS and see if that helps. 

 

I was able to get my system stable at 3200 MHz Ram speed, but 3600 just never worked. System lock ups, and my cache kept getting corrupted. 

 

Also, update your BIOS if it is not already on the latest. Pull your diags and post them up so we can take a look.

Link to comment

I couldn't get my 3600 ram to run at all at 3600, had to drop to 3200.  I will drop it more but shouldn't I be seeing errors from memtest?  21 hours now no errors.

 

I will upload some configs when I am done memtesting, going to let it go a little longer.  I will try to find PBO too before booting back up.

 

Thanks for the reply.  This is so frustrating.

 

(btw asrock taichi x570 is the mobo)

 

 

Link to comment
4 hours ago, uek2wooF said:

I didn't realize that.  I have dropped the ram speed to 2666 and since then I've gotten a clean parity check for the first time, and I even copied 40 gb to the array first to make sure there was some activity.  No cache errors yet.  I am keeping an eye on it.

 

Thanks!

If you don't need 64 GB of ram you could run at a higher ram speed with just 2 dimms, but at the end of the day, the difference in performance is really not that great, especially for a server. 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.