Hello everyone,
I've been encountering some issues with my Unraid server recently and could use some assistance. Here's a brief summary of the problem:
I experienced corrupted data following a power loss incident.
Numerous BTRFS errors have been occurring since then.
Initially, I managed to keep things running by restarting the system occasionally and recreating the docker.img file.
However, I decided to address the problem this weekend after receiving my UPS to protect against future power losses. Here's what I did:
Backed up my data using the "CA Backup Appdata" tool after deleting any corrupted temporary files to ensure a clean backup.
Completely wiped the BTRFS pool and formatted the two cache drives.
Started fresh by restoring the appdata and recreating the docker.img file.
Unfortunately, the BTRFS pool started exhibiting errors again just two days after the cleanup. Additionally, the Docker service intermittently fails to start, although the containers themselves remain responsive in their respective WebUIs. However, communication between containers becomes impossible. I have temporarily made the docker.img 40GB to remove storage space issues from the equation, but the problems still persist.
The only way to stop the containers now is to reboot the server, which results in an unclean shutdown because Unraid fails to stop everything properly.
I suspect that the issue might be related to my RAM, but memtest did not detect any errors.
I searched on other forum threads but couldn't find a solution. I would greatly appreciate any guidance or suggestions to help resolve these ongoing issues. Thank you in advance for your assistance!
This is what the Docker tab looks after a few minutes/hours of working properly
Log looks like this:
All of the drives still have plenty of space left:
unraid-diagnostics-20230607-1110.zip