Skip to content
View in the app

A better way to browse. Learn more.

Unraid

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

High Cache IO Error Count - Cache Drives Failing?

Featured Replies

Hi all,

 

I'm running into some significant cache disk IO errors, and am trying to determine if one (or both) cache drives are failing and need replacement.

 

I rebooted my server this morning, only to find that my two cache NVMe drives (2 x Inland Premium 1TB NVMe in Raid 1 cache pool) were not recognized at all. After rebooting again, the system was able to identify and mount both cache drives. However, after array startup, there was a significant amount of cache disk IO errors which kept appearing in the syslog. Additionally, Docker would not start.

 

I balanced and scrubbed the cache drives (with "repair corrupted blocks" enabled) via the GUI, and rebooted, which appears to have stabilized things. Everything appears to be working just fine as of now. With that said, the SMART data for the two cache drives show high error count, and running "btrfs device stats /mnt/cache" shows a huge quantity of IO errors on one of the cache disks. 

 

I've tried to run SMART check, but neither quick or extended SMART check will run on either cache drive

 

Does the high number of disk IO errors indicate cache corruption, or is at least one of the cache disks failing and needs to be replaced?

 

 

Screen Shot 2020-05-27 at 10.28.02 AM.png

Screen Shot 2020-05-27 at 10.28.20 AM.png

Screen Shot 2020-05-27 at 10.28.46 AM.png

 

Edit: Small update for clarity

tower-diagnostics-20200527-0935.zip tower-diagnostics-20200527-0834.zip

Edited by Giggity_Grant
Updated scrub description for more clarity

  • Author

Also, these NVMe drives have been in service for about 1 year. They were in Raid0 config for about 6 months. I moved back to Raid1 a few weeks ago.

 

Granted, I've downloaded alot of Linux ISOs (download and media folder cache use set to "YES"), but 127TB writes seems excessive. I currently only have 39.8TB total stored on my array.

The btrfs device errors are most likely the result of one NVMe device dropping offline, see here how to reset them and monitor so you'll be warned if it happens again, if/when it does grab the diagnostics before rebooting and post them here.

  • Author

Thank you @johnnie.black!! 

 

What are your thoughts on the total TBW over one year?

1 minute ago, Giggity_Grant said:

What are your thoughts on the total TBW over one year?

On the low side, this is mine after about 3 years:

imagem.png.6033b1cdfc95ff1ba42981b82ad65a92.png

 

Archived

This topic is now archived and is closed to further replies.

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.