[6.8.3] docker image huge amount of unnecessary writes on cache

TexasUnraid · July 1, 2021

Ok, one more journal entry then going to let things settle for a few days and if everything checks out I will post a writeup on this process.

Where would I post this that might attract a plugin dev that might be interested in turning this process into a plugin?

I have most of the commands and setup figured out, I just have no clue where to even start with a plugin.

Ok, the final step was to figure out how to handle the appdata ramdrive and I am currently doing this and after several reboots and other tests, it seems to be working good and seamlessly.

The first step after figuring out which appdata needs to move to the ramdisk is to create the ramdisk itself and then copy the appdata into it from the SSD.

First create a folder in /mnt/cache/appdata/, Very important to create the folder on the drive itself and NOT in /user.

mkdir /mnt/cache/appdata/appramdisk
chmod 777 /mnt/cache/appdata/appramdisk

after this I use a very basic user script that is set to "run at array start":

echo ---------------------------------Create ramdisk for appdata----------------------------------
mount -vt tmpfs -o size=8G appramdisk /mnt/write-cache/appdata/appramdisk


echo ---------------------------------rsync to ramdisk in appdata----------------------------------
rsync -ah --stats --delete /mnt/user/appdata/binhex-qbittorrentvpn /mnt/user/appdata/appramdisk
rsync -ah --stats --delete /mnt/user/appdata/binhex-nzbhydra2 /mnt/user/appdata/appramdisk
rsync -ah --stats --delete /mnt/user/appdata/*arr /mnt/user/appdata/appramdisk

I then have a separate script set to run hourly that rsync's the ramdisk back to the SSD:

rsync -ahv --progress --delete /mnt/user/appdata/appramdisk/* /mnt/user/appdata/

Now for the shutdown, I created a "stop" file on the USB drive at /boot/config. It is called first thing when you click shutdown/reboot in the GUI and the rest of the shutdown will wait until it is finished.

touch /boot/config/stop

In the stop file I decided to simply redirect it to a script in user scripts called "Run at Shutdown" to make it easier to manage.

#!/bin/bash

#Runs the user script "Run at Shutdown" during shutdown or reboot.
#it is called before anything else during the shutdown process

# Invoke 'Run at Shutdown' script if present
if [ -f /boot/config/plugins/user.scripts/scripts/Run\ at\ Shutdown/script ]; then
  echo "Preparing Run at Shutdown script"
  cp /boot/config/plugins/user.scripts/scripts/Run\ at\ Shutdown/script /var/tmp/shutdown
  chmod +x /var/tmp/shutdown
  logger Starting Run at Shutdown script
  /var/tmp/shutdown
fi

The run at shutdown script itself first stops all running docker containers so they can close out open files. It then rsyncs the appramdisk back to the SSD before clearing the ramdisk and unmounting it.

#!/bin/bash
#description=This script runs first thing at shutdown or reboot and handles rsyncing appramdisk and unmounting it.

logger Stopping Dockers
docker stop $(docker ps -q)
logger Dockers stopped

logger Started appramdisk rsync
rsync -ah --stats --delete /mnt/user/appdata/appramdisk/* /mnt/user/appdata/ | logger
logger rsync finished

logger clearing appramdisk data
rm -r /mnt/user/appdata/appramdisk/* | logger

logger unmounting appramdisk
umount -v appramdisk | logger

And thats it, seems to be working good, no hang-ups when rebooting and everything is working automatically.

All combined this has reduced me writes by ~10x it seems and my writes were pretty mild compared to a lot of guys to start out with.

It is finally low enough that I would be ok putting appdata and docker back onto my main SSD's with redundancy instead of the single piece of junk drive I am using now.

Edited July 1, 2021 by TexasUnraid

TexasUnraid · July 3, 2021

Wow, I am really impressed with how well this is working.

So 2 days ago I figured it would just be a throwaway day for data as I did a fair amount of messing around with the dockers and testing of the ramdisk setup.

Yet I still only had 13gb of writes!

Yesterday I still had a few extra writes from messing with stuff but pretty close to what I expect regularly with it syncing to the SSD every hour.

13GB of writes!

Guessing total writes would be around 11-12gb/day once it all settles down.

Looks like the average sync was around ~300mb x 24 = 7.2gb

So yeah, big fan of this new setup

Edited July 3, 2021 by TexasUnraid

TexasUnraid · July 3, 2021

Ok, I posted a guide on all this here for anyone interested:

What would be the protocol for seeing if any plugin devs are interested in turning it into a plugin?

boomam · July 4, 2021

On 6/19/2021 at 8:51 AM, TexasUnraid said:

So it has been a few days now and the results are pretty consistent.

BTRFS formatted drive + BTRFS image = 75-85gb/day

BTRFS drive + Docker folder = 60-65gb/day

XFS drive with BTRFS image = 20-25gb/day

This is useful information, as I was about to go to the effort to change over the folder option over this weekend. For such a difference, I dont think its worth the effort for me, its just another workaround in effect for the performance.

One thing I am tempted to do is pickup 2x small SSDs specifically just for the docker.img, and just have it backup once a day.

TexasUnraid · July 4, 2021

10 hours ago, boomam said:

This is useful information, as I was about to go to the effort to change over the folder option over this weekend. For such a difference, I dont think its worth the effort for me, its just another workaround in effect for the performance.

One thing I am tempted to do is pickup 2x small SSDs specifically just for the docker.img, and just have it backup once a day.

Yeah, I am sticking with the image as well. Read through the above thread, with a few tweaks to the docker templates you can vastly reduce the writes by disabling logging and enabling the /tmp ramdisk inside containers that need it.

If you really want to reduce writes you can add a ramdisk for appdata as well but that is a bit more involved.

A plugin for all this would make it super easy though with very little downsides.

I also am not that worried about the docker image itself being backed up, appdata is much more important. The docker image can be re-downlaoded pretty easily, just takes time.

TexasUnraid · July 6, 2021

So it has been a few more days and writes are holding steady at ~11-12gb/day.

I am calling this a success!

boomam · August 1, 2021

Other than the common ones that we'd expect, such as Plex (due to bug with official container), are there any overarching notes on what containers are known to be big offenders for this bug?

I assume any container that does continuous writes (such as a logging service/poller) would be big culprits?

I forgot to pull the trigger and get a small/cheap SSD specifically for containers to help preserve the lifetime of my existing SSDs, and upon taking a quick look at my 2x MX500 1Tb's, life left is 33% and 34% respectively -

LBA's Written Tb Written   Percent lifetime remain
Cache        168821397049 691.49 34%
Cache 2      167088799024 684.40 35%

Running iotop -oa for 10-15mins shows just 69Mb written by loop2, with a large 105Mb written by 'btrfs-tractacti' (with the above types of containers off). Re-running now with them restarted just as a quick compare...

Edited August 1, 2021 by boomam

TexasUnraid · August 1, 2021

database dockers are bad as well.

Thats a lot of writes, how long were the drives in there? I am afraid there are a LOT more people with write issues but they are not even aware of it.

Easiest way to get an idea of your writes is to use the logging command that was posted earlier or in my writeup thread. That will show you the files that are causing writes and you can then trace them to the containers.

The writeup shows how to trace and deal with most of the writes.

I have been sitting at a steady ~11gb of writes a day since finishing the write modifications.

boomam · August 1, 2021

1 minute ago, TexasUnraid said:

database dockers are bad as well.

Thats a lot of writes, how long were the drives in there? I am afraid there are a LOT more people with write issues but they are not even aware of it.

Easiest way to get an idea of your writes is to use the logging command that was posted earlier or in my writeup thread. That will show you the files that are causing writes and you can then trace them to the containers.

The writeup shows how to trace and deal with most of the writes.

I have been sitting at a steady ~11gb of writes a day since finishing the write modifications.

Read further up the thread, we've spoken before, quite a few times

Just retesting now as I've a little time -

10mins with all the logging containers back on shows a loop2 usage of 67Mb (so within a margin of error for the 'bad' containers being off).

Extrapolated that would mean -

67 Mb per 10mins.

402 Mb per hour.

9.64 Gb per day.

282.65 Gb per month (30 day month)

3.35 Tb per year

That's not as bad as I thought it would be - I'd guess the large loss in lifetime I'm seeing probably happened before the 6.9 update and some other changes and I'm just forgetting about the last set of numbers the last time I measured.

Also, I wouldn't discount a container as its a database as being 'bad' - its characteristics depends on the app and its usage of the database - what is cached in RAM, how it writes to the database itself, etc.

For example, NextCloud (popular with unraid users) is a low overhead Db writing app if the user/client count isn't high and files aren't updated often. Its a great example of disk IO scaling with users/data being changed.

...Regardless, I think I'll look into some new, smaller SSDs specifically for the containers the next time I go by a Microcenter.

Reserve the larger drives remaining life span for 'normal' caching.

TexasUnraid · August 1, 2021

I would check the LBA's written to truly see how many writes you have.

A bunch of small writes will have write amplification but a few large writes will not.

I log my writes every day with a script and track them that way.

boomam · August 1, 2021

Good point.

I've made note and will check daily.

Still though, I'm wondering if its worth putting together a defaco list of particularly bad apps.

boomam · August 2, 2021

It just occurred to me -

The loop2 device is literally just the docker.img file - correct?

If so, then the loss of a SSD that just holds the image, has no effect on the appdata/what we actually want to recover.

So having a SSD literally just for the container images, holds little to no risk to data, just an uptime risk due to no redundancy...

Squid · August 2, 2021

3 minutes ago, boomam said:

just an uptime risk due to no redundancy.

1-2 minute recovery time when you notice it.

itimpi · August 2, 2021

2 hours ago, boomam said:

It just occurred to me -

The loop2 device is literally just the docker.img file - correct?

If so, then the loss of a SSD that just holds the image, has no effect on the appdata/what we actually want to recover.

So having a SSD literally just for the container images, holds little to no risk to data, just an uptime risk due to no redundancy...

recreating the Docker.img and it’s contents with previous settings is a trivial and quick operation. It is the appdata contents used by each docker container that is what tends to really matter.

boomam · August 2, 2021

1 minute ago, itimpi said:

recreating the Docker.img and it’s contents with previous settings is a trivial and quick operation. It is the appdata contents used by each docker container that is what tends to really matter.

Yup.

Can likely create a script to monitor, and remediate if needed too...

boomam · August 3, 2021

So this is odd - put in 2x 250Gb SSDs to act as a second cache pool, strictly for the docker.img file.

Moved the file, expanded it a little and restarted the service.

...whilst it does seem to consistently do writes, I'm seeing an equal number of writes, offset, to the main SSD cache pool.

Strange...

mgutt · August 3, 2021

7 hours ago, boomam said:

put in 2x 250Gb SSDs to act as a second cache pool, strictly for the docker.img file.

I would use one SSD, format it as XFS and set docker to path mode. By that the write amplification is even more reduced. Disabling healthcheck on all containers is an additional easy task.

TexasUnraid · August 3, 2021

Depends on the containers from my testing, in my case the docker logs caused a lot more writes then the health checks overall. But both are issues for sure.

boomam · August 3, 2021

7 hours ago, mgutt said:

I would use one SSD, format it as XFS and set docker to path mode. By that the write amplification is even more reduced. Disabling healthcheck on all containers is an additional easy task.

Ordinarily i would, but i wasn't bothered about 2x cheap second hand SSDs dying due to amplification.

Having 2x gives me that failover and enough time to react if needed.

Regardless, i'm a little confused why i'm still seeing an amount of IO on the main cache though considering loop2/docker.img is moved to the secondary cache.

mgutt · August 4, 2021

18 hours ago, boomam said:

i'm a little confused why i'm still seeing an amount of IO

Your containers are reading from / writing on files in /mnt/cache/appdata

boomam · August 4, 2021

3 hours ago, mgutt said:

Your containers are reading from / writing on files in /mnt/cache/appdata

Yes, this i know.

But there are no containers actively creating that level of disk IO, or at least shouldn't be.

TexasUnraid · August 4, 2021

Log it and find it out what is causing it.

boomam · August 9, 2021

Looping back - getting quite a lot of consistent writes from the 'Ghost' container.

Seems to be dumping a few Kb of text to its log files every few seconds due to a random bug - investigating the cause and will post a fix here should i find one, for others reference.

TexasUnraid · August 9, 2021

Not sure what ghost is but post the file activity log for the problem. Most of those writes can be dealt with using the methods in the writes thread I made.

If it is writing to a log file and the log file is in it's own folder the easiest option is to simply make that folder a ramdrive in the container.

boomam · August 9, 2021

Its a website/CMS, similar to WordPress.

The issue is 'in-app' as the errors it displays that get written out are to do an issue with its tagging. Its not a docker/loop2 issue, but an in-app issue thats causing more things to be logged than needed.

[6.8.3] docker image huge amount of unnecessary writes on cache

User Feedback

Recommended Comments

TexasUnraid 113

Link to comment

TexasUnraid 113

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

TexasUnraid 113

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

boomam 15

Link to comment

Squid 4987

Link to comment

itimpi 2246

Link to comment

boomam 15

Link to comment

boomam 15

Link to comment

mgutt 2528

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

mgutt 2528

Link to comment

boomam 15

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

TexasUnraid 113

Link to comment

boomam 15

Link to comment

Join the conversation