I've noticed my Ollama docker container is not using the GPU even though it is available when you exec into it so looking for help.
I have the nVidia plugin and driver installed. I have tried restarting, removing the container and pulling it back down but it's still not working.
Config below. I have deliberately pinned it to 4 cores to make sure it does not consumer all my CPU's whilst it's running and also limited the memory (bad experience with llama3.3). Any help would be appreciated.
Docker Template:
SMI output from within the container:
Top showing it's using CPU executed inside the container: