having spent all weekend and this evening i can say yes, i'm using OpenWebUI, Ollama and local AI. Now if there is just some way for ollama to not use my CPU and use my new RTX 5090 instead I would be something approaching a happy man. I've watched every video I can find, added numerous suggested parameters and restarted more times than i can remember. It still the same, the driver app sees the card, ollama goes, "feck that i'm having your CPU cores." For the love of all that is holy, have you seen a post or a guide that forces it to use my GPU? I'm using the open driver, latest version of unraid and the last of my will to live.