July 8, 2025Jul 8 Since upgrading to 7.1.4 from 7.1.2 I've been getting random occurrences of the following error "Docker Service failed to start".Steps I've tried to resolve thisAfter error occurred, go to Settings, Docker, Enable Docker, off, apply, than turning back on, apply. Error remained the same, Docker service was not restarted.On reddit, https://www.reddit.com/r/unRAID/comments/uhdhuz/docker_service_failed_to_start/ it was suggested to increase the Docker vDisk size. I tried this with no effect on the error.from terminal, call "/etc/rc.d/rc.docker restart" . This did get docker running again but several of my containers which use static assigned IPs would not startReboot server, this always gets everything back working but for an unknown amount of time.Prior to 7.1.4 I've not seen this failure mode. diagnostics attachedThanks for any help to resolve this issue.Error shown on docker tab. totower-diagnostics-20250707-0751.zip Edited July 8, 2025Jul 8 by mwasserman
July 8, 2025Jul 8 Community Expert Appears you are having issues with your cache pool... Might want to check thereJun 24 08:14:18 Tower kernel: BTRFS info (device nvme0n1p1): bdev /dev/nvme1n1p1 errs: wr 26624243, rd 19056641, flush 145986, corrupt 216843, gen 0Looks like read/write errors and corruption. Edited July 8, 2025Jul 8 by MowMdown
July 8, 2025Jul 8 Community Expert One of the pool devices dropped offline in the past, scrub the pool and post the results from the GUI.
July 9, 2025Jul 9 Author Thanks for the tips, yes. I've been having random drop outs of my Cache nvme drives. I'm running 2x btrfs as a mirrord set. I have found that every time I have a nvme drive drop out, it comes right back after a system reboot. I did run scrub (sorry I don't have a screen shot). Scrub found many correctable errors. On a 2nd pass of scrub no errors were detected. Others seem to think there may be an issue with btrfs and unraid. https://www.reddit.com/r/unRAID/comments/18wkga3/frequent_crashing_resolved_by_cache_migration/ .My plan is to Pull all the data off CacheFormat as ZFSbring the data back to Cacheremove and rebuild the docker.imgI'll report back with any results.
July 9, 2025Jul 9 Community Expert Devices dropping offline is not a BTRFS issue, likely the same will happen with zfs, recomend better pool monitoring with either: https://forums.unraid.net/topic/46802-faq-for-unraid-v6/page/2/#findComment-700582
July 15, 2025Jul 15 Author Solution Wanted to report back with some updates.Hard to know if this was related but had been using Intel Graphics SR-IOV to split the GPU of my i7-14700K for use with Plex (Container) + VM (games). Unknown if this was the root cause of any of the drive dropouts but I disabled that plugin for now and moved to a dedicated pass though GPU.Moved catch from BTRFS to ZFS. Hoping that if I do get a drive dropout, I won't be dealing with some of the corruption problems I had with BTRFS during a drive dropout.4 days running this setup so far with zero issuesUpdate: 24 days, no issues after the above changes. I'm officially a ZFS fan Edited August 2, 2025Aug 2 by mwasserman Update to issue
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.