May 5May 5 So I got the Intel Arc Pro B70 because I didn't wanted to support NVIDIA, AMD, or OpenAI. I want to support affordable local AI options, and privacy. However, I have only been able to use Intel-IPEX-LLM-Ollama (uberchuckie/ollama-intel-gpu:latest) with Open-WebUI to support a very limited amount of models.Qwen 3.6 - Gives me an error (unable to load model: /root/.ollama/models/blobs/sha256-f5ee307a29.....)Gemma4 - Won't even download (pull model manifest: 412: The model you are attempting to pull requires a newer version of Ollama. Please download the latest version at: https://ollama.com/download) Note: Ollama is already fully up-to-date.Ministral-3 - Won't download either (pull model manifest: 412: The model you are attempting to pull requires a newer version of Ollama. Please download the latest version at: https://ollama.com/download)nemotron-3-nano:30b - Won't download...So I'm just wondering if I'm just stuck with llama3.1 and Deepseek-r1, and waiting for further development or how can I actually use the card now ?
May 5May 5 Community Expert 2 minutes ago, JustOverride said:Note: Ollama is already fully up-to-date.That image you're using with intel GPU support says it's using ollama 0.9.3The latest Ollama portable supported by intel/ipex-llm is 0.9.3.but ollama's currently at version 0.23.1.So yeah you're kinda stuck unless you find something else that supports those cards. Maybe can run the standard build using Vulkan?
May 7May 7 i plan to buy a B70 too. But for the moment i try to build an ai stack with the igpu and the npu of my core ultra 285HX.I tried several solutions with ipex-llm, ollama-intel. But it's very limited ipex-llm is no more under dev; ollama-intel seems to be compiled with an old version of ollama and is not able to load recent models.This week i found Openarc. Looks very promising but for me it's a steep learning curve. It's based on openvino and models from huggingface.So far the container runs and i've been able to connect openwebui. The drivers are correctly loaded and the available devices are correctly reported (CPU,iGPU,NPU)But i have not been able to download a model.Apparently you need to install huggingface-cli tool to manage models.I'm still looking for better alternatives to run local ai on intel hardware.
May 10May 10 Author On 5/7/2026 at 2:26 PM, caplam said:i plan to buy a B70 too. But for the moment i try to build an ai stack with the igpu and the npu of my core ultra 285HX.I tried several solutions with ipex-llm, ollama-intel. But it's very limited ipex-llm is no more under dev; ollama-intel seems to be compiled with an old version of ollama and is not able to load recent models.This week i found Openarc. Looks very promising but for me it's a steep learning curve. It's based on openvino and models from huggingface.So far the container runs and i've been able to connect openwebui. The drivers are correctly loaded and the available devices are correctly reported (CPU,iGPU,NPU)But i have not been able to download a model.Apparently you need to install huggingface-cli tool to manage models.I'm still looking for better alternatives to run local ai on intel hardware.I agree, that's exactly what I found. I've basically have a very expensive transcoding card, and can only use old models. But I'll be patient, I'll happily put my money where it hurts these other greedy companies.
May 12May 12 i am making progress. I use openarc and have loaded a qwen3 model using igpu from my core ultra. I use openwebui as gui. As i begin with ai i can see that i have a ton to learn to make a better usage (how to choose a model, system prompt, skills, agent, knowledge base,....) but it looks promising. There is valuable information on openwebui communities.You should try openarc and explore their github.Another option would be to go with openvino model server but i find their documentation quite confusing, even the name of the different components are confusing
May 18May 18 Phew! Not just me, currently running my OG A770 LE but was eying up the B70 for my Local AI/Security features. guess I'll stop tinkering with settings and just hold out for the internal Ollama kernal update to 0.23.0.
May 21May 21 Author Hopefully we should have better support soon. Also Ed (spaceinvader) said in his latest video (in the comments) that he would include a setup for intel cards in his upcoming video. He created one of the docker containers that support Intel Arc, so I’m hopeful.
May 21May 21 Community Expert Ed, has setup this up on my B50 which he has access to. I believe he will be making a follow up video at some point.
June 14Jun 14 Author I made a new post to get AI working well on these cards.https://forums.unraid.net/topic/199450-intel-arc-battlemage-b570-b580-b50-b70-etc-ai-support-in-unraid/
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.