[6.8.3] docker image huge amount of unnecessary writes on cache

T0rqueWr3nch · June 18, 2020

10 minutes ago, TexasUnraid said:

Agreed, I can't make sense of it.

I think most of you that have the truly extreme write black holes are running things like plex, my best guess is that these fixes help the issue those dockers have but not the underlying issue.

I only run very mild dockers, lancache, krusader, mumble, qbittorrent etc that are not actively doing anything right now.

The difference from putting docker/appdata on an XFS array drive vs the btrfs cache is undeniable though at around 200-300mb/hour vs 1000-1500mb/hour and climbing in most cases.

I would have loved to have blamed it on your individual Docker containers, but I agree, those don't seem like extravagant containers. PMS is definitely a clunker. A lot of database containers also seem to be particularly bad about cache writes. MongoDB was horrendous for me:

Since you seem to still be experiencing this issue, could I get you to run

docker stats

I'm curious if Block I/O identifies a particular container.

-TorqueWrench

TexasUnraid · June 18, 2020

Yeah, these are all pretty well behaved containers in theory. Particularly since they are not actively doing anything right now, still just testing things. I have a few other containers like nextcloud and mainandb installed but they are not setup and not sure if I will use them so I don't have the dockers running for most of the testing. I did see increased writes with them running but I wanted to keep things consistent.

Here is docker stats, nothing stands out to me. This is after having everything running for over 3 hours and 1.5GB being written per hour according to the LBA logging.

Interestingly, if I add up all the block I/O numbers and divide by 3 hours and change, it works out to almost exactly what I was seeing with docker and appdata on an XFS drive.

Edited June 18, 2020 by TexasUnraid

Kevek79 · June 18, 2020

Just read this thread today, as I wanted to see if there are any major roadblocks for me when upgrading my system from 6.7.2 to the current stable release.

After reading thrue all ten pages of this thread upgrading to 6.8.x did not seam to be a good idea

Now that I am home and have access to my system I checked my cache pool drives on a otherwise well behaving unraid install and found out, that my two Samsung EVO 860 500 GB Sata drives which are only 3 months old have written about 50 TB if the LBA calculator did not lie to me.

Reading the thread it looked like a 6.8.x issue allone, but as I am on 6.7.2. and I am pretty sure that I have in no way intentionally written that vast amount of data to those poor little SSDs I might be in the same boat as the rest of the bunch, but can only confirm that it is the same issue after further investigation.

With this rate I would chew thrue the 300TBW warranty limit in about 18 months - puh

Edited June 18, 2020 by Kevek79
Typo

grigsby · June 18, 2020

1 hour ago, Kevek79 said:

Reading the thread it looked like a 6.8.x issue allone, but as I am on 6.7.2.

The bug was originally reported in 6.7.2. The thread title was changed to 6.8.3 when it was discovered that it still exists in the current release.

Edited June 18, 2020 by grigsby

Kevek79 · June 18, 2020

7 minutes ago, grigsby said:

The bug was originally reported in 6.7.2. The thread title was change to 6.8.3 when it was discovered that it still exists in the current release.

I did not realize that.
That makes it more likely that its the same issue.

Kevek79 · June 18, 2020

On 5/14/2020 at 1:59 PM, johnnie.black said:

While we wait for the fix, anyone reached a PB? I'm not that far:

Just because I'm curious: Did you hit the PB yet @johnnie.black

Edited June 18, 2020 by Kevek79

JorgeB · June 19, 2020

10 hours ago, Kevek79 said:

Just because I'm curious: Did you hit the PB yet @johnnie.black

image.png.4a3c98b2348ed525254816f7ffeb12e0.png

Still a few months away, at current pace I estimate hitting 1PB around Halloween, this assuming the NVMe device doesn't give up the ghost, since it's well past its 300TBW rating.

thecode · June 19, 2020

I see that the topic is referring to docker running on cache, I'm not running any dockers on the cache (at least not ones that I can't stop for few days).
After reading this thread I checked my setup (it is a new setup, first version used is 6.8.3) and noticed around 40mb/sec writes to the cache, a new drive already got 1.5TB written.
I'm using two NVMes in a raid1 cache pool, When I tested the system I had only one drive and did not notice high writes, but I might have missed it.
2 VMs are running on the cache, Windows 10 with Blue Iris that store data on the Array, and HassOS (which uses MariaDB inside).
My first assumption was that it is related to the DB inside the HassOS VM, I have installed MariaDB as a docker and let it store on the cache, writing dropped from 40mb/sec to around 5-6 which is still high. On the other hand, the MariaDB only writes about 100-200Kb/sec on the array. Moving forward I moved the whole HassOS VM + Maria DB data to an unassigned SSD (xfs), cache writes dropped down to 1-2mb/sec which is still high, the Windows VM has most of it services disabled and I doubt it write so much data.
Monitoring the HassOS VM + DB on SSD using LBA showed about 6GB for 12 hours (around 140kB/sec).
The cache which has nothing on it besides the Win 10 VM has already accumulated more than 20GB of writes in the same 12 hours period.
I am thinking of moving the Win 10 VM to the unassigned SSD also, but have no idea what should be the next step, my original plan was to use the brtfs mirror on the cache as a sort of fault tolerance, but l doubt it will live long with such high write rate.

JorgeB · June 19, 2020

18 minutes ago, thecode said:

I see that the topic is referring to docker running on cache

Yes, the topic is mostly about that, but for example I have the problem on one of my VMs, and only one, despite having 3 on the same device, and no issues with the docker image which also is on the same device, it's kind of a strange issue.

Kevek79 · June 19, 2020

2 hours ago, johnnie.black said:

Still a few months away, at current pace I estimate hitting 1PB around Halloween, this assuming the NVMe device doesn't give up the ghost, since it's well past its 300TBW rating.

Good luck for that, but as your numbers are way higher and the nvme is still working I can sleep a bit better with my 50TBW up till now

As everything else is working great and the only issue is still in the current stable release I might skip 6.8 totaly and wait for 6.9.

So lets hope that 6.9 rc1 is just around the corner and has this one fixed.

I am eager to test that new release out because of the multiple cache pool options, but as I have only one system available I want to wait at least for a RC version before upgrading.

TexasUnraid · June 19, 2020

Ok, I left the beta running overnight with docker and app data on the cache.

Sure enough, it also started steadily climbing, was up to almost 2GB/hour this morning.

So the beta did not help my write issues and if anything they are worse.

So looks like I need to find another drive to use for docker formatted as XFS in the array.

Edited June 19, 2020 by TexasUnraid

zoggy · June 19, 2020

Looks like I've not been impacted?

cache drive is btrfs, with the following dockers: mariadb, kodi-server, duplicati

# cat /etc/unraid-version; /usr/sbin/smartctl -A /dev/sdb | awk '$0~/Power_On_Hours/{ printf "Days: %.1f\n", $10 / 24} $0~/LBAs/{ printf "TBW: %.1f\n", $10 * 512 / 1024^4 }'
version="6.8.3"
Days: 646.9
TBW: 10.3

-Daedalus · June 19, 2020

Just spit-balling here, but I seem to remember an issue with Samsung drives (mostly 850s at the time). Something to do with a non-standard starting block.

I don't suppose anyone with this issue is using non-Samsung disks?

Niklas · June 19, 2020

4 minutes ago, -Daedalus said:

Just spit-balling here, but I seem to remember an issue with Samsung drives (mostly 850s at the time). Something to do with a non-standard starting block.

I don't suppose anyone with this issue is using non-Samsung disks?

No Samsungs here.

(Seagate IronWolf 110 SATA SSDs)

Edited June 19, 2020 by Niklas

TexasUnraid · June 19, 2020

Only samsung driver here is the one I added in to log LBA's to make logging over time easier. The issue presented itself when I was only using other brand drives.

TexasUnraid · June 19, 2020

I just reinstalled unraid after all this testing to get a fresh start before I put this server into use.

The writes are all of the sudden extreme. Been getting 5GB+/hour or more then last few hours with the same dockers and settings as before.

No idea why, going to move things to an XFS drive in the morning but no idea why it is so much worse now, docker stats still show they are all very well behaved like before.

limetech · June 19, 2020

12 hours ago, johnnie.black said:

Yes, the topic is mostly about that, but for example I have the problem on one of my VMs, and only one, despite having 3 on the same device, and no issues with the docker image which also is on the same device, it's kind of a strange issue.

This topic is tldr but wondering if anyone has tried turning off btrfs COW? Either on the docker.img file itself (if stored on a btrfs volume) or within the btrfs file system image.

TexasUnraid · June 19, 2020

1 minute ago, limetech said:

This topic is tldr but wondering if anyone has tried turning off btrfs COW? Either on the docker.img file itself (if stored on a btrfs volume) or within the btrfs file system image.

I am happy to test if you can tell me how.

grigsby · June 19, 2020

32 minutes ago, limetech said:

This topic is tldr

Well, I gotta say, LimeTech's response to this bug has been impressive -- in a not good way. This is a major, potentially catastrophic bug that could result in loss of data, time, and hardware/money that was first reported seven months ago, and the only two comments LimeTech makes about it are dismissing it as "tldr"?

I first installed Unraid in May on a new server build and promptly purchased a license for $89. Obviously I don't have much history with Unraid or the company, but their total non-response to this bug report is disheartening.

limetech · June 19, 2020

33 minutes ago, grigsby said:

dismissing it as "tldr"

This is not the only issue or the only thing we are working in. The 'tldr' was meant as a solicitation for someone to summarize the issue to save time, not being dismissive. I've seen this kind of thing before where I/O reporting is wildly off vs. what's happening on the media, especially with btrfs.

goodGame · June 19, 2020

1 hour ago, limetech said:

This topic is tldr but wondering if anyone has tried turning off btrfs COW? Either on the docker.img file itself (if stored on a btrfs volume) or within the btrfs file system image.

Would this do it?

Shut down docker/array

chattr -R +C /mnt/user/system/docker
rm -rf /mnt/user/system/docker/docker.img

start docker/array

TexasUnraid · June 19, 2020

6 minutes ago, limetech said:

This is not the only issue or the only thing we are working in. The 'tldr' was meant as a solicitation for someone to summarize the issue to save time, not being dismissive. I've seen this kind of thing before where I/O reporting is wildly off vs. what's happening on the media, especially with btrfs.

In this case we can assure you that it is not a reporting issue as iotop and the raw LBA's written to the drives both show heavily inflated writes.

For example on an XFS drive I get around 200-300mb/hour writes which lines up with what docker stats says.

On the cache the LBA's were 1GB/hour and climbing over time, upwards of 2GB/hour when left overnight.

I just reinstalled a few hours ago, now writes as measured by the LBA's of the smart output are upwards of 5GB an hour and climbing.

One thing I did on the old setup was move the cache to an XFS drive and back to cache, someone else reported this helped, maybe it helped me as well, just didn't fix the issue.

limetech · June 19, 2020

13 minutes ago, TexasUnraid said:

For example on an XFS drive

You mean an SSD device formatted with xfs?

TexasUnraid · June 19, 2020

12 minutes ago, limetech said:

You mean an SSD device formatted with xfs?

Either SSD or HDD, it didn't matter, XFS writes were what they should be.

Any BTRFS drive would have anywhere from 5x-15x+ the writes and it would climb over time. Although the amount of writes would vary some depending on factors we could not understand.

For example just the appdata being on the cache but docker on an XFS would still cause some very inflated writes 100x more then if reversed.

On my current setup, I should see 200-300mb/hour writes. I am actually seeing 5bg/hour writes and climbing.

At this rate my SSD's will not even last 2 years.

The only fix I have found is to move appdata and docker to an XFS formatted drive in the array. Multiple cache pools would be real handy since I could just make another cache pool for it but can't wait that long. Still got to waste a whole drive just for dockers to keep it from killing my drives.

Edited June 20, 2020 by TexasUnraid

limetech · June 20, 2020

9 minutes ago, TexasUnraid said:

On my current setup, I should see 200-300mb/hour writes. I am actually seeing 5bg/hour writes and climbing.

If you click on the device on Main and look at the SMART data, what's the value "data units written" attribute? Does it line up with what you are measuring as MB/hour being written?

[6.8.3] docker image huge amount of unnecessary writes on cache

User Feedback

Recommended Comments

T0rqueWr3nch 43

Link to comment

TexasUnraid 113

Link to comment

Kevek79 12

Link to comment

grigsby 6

Link to comment

Kevek79 12

Link to comment

Kevek79 12

Link to comment

JorgeB 7474

Link to comment

thecode 49

Link to comment

JorgeB 7474

Link to comment

Kevek79 12

Link to comment

TexasUnraid 113

Link to comment

zoggy 37

Link to comment

-Daedalus 73

Link to comment

Niklas 57

Link to comment

TexasUnraid 113

Link to comment

TexasUnraid 113

Link to comment

limetech 3326

Link to comment

TexasUnraid 113

Link to comment

grigsby 6

Link to comment

limetech 3326

Link to comment

goodGame 3

Link to comment

TexasUnraid 113

Link to comment

limetech 3326

Link to comment

TexasUnraid 113

Link to comment

limetech 3326

Link to comment

Join the conversation