[GH-ISSUE #13567] gpt-oss slow using vulkan #70993

Open
opened 2026-05-04 23:41:28 -05:00 by GiteaMirror · 5 comments
Owner

Originally created by @Swagatade on GitHub (Dec 26, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/13567

What is the issue?

I used all model using Intel Vulkan support very fast I used 130v gpu 16gb. But vulkan support very slow. I used only cpu this time very fast but using only gpu this time very slow. I used llama.cpp this application using vulkan same gpu very fast. Ollama very slow.

Relevant log output


OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @Swagatade on GitHub (Dec 26, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/13567 ### What is the issue? I used all model using Intel Vulkan support very fast I used 130v gpu 16gb. But vulkan support very slow. I used only cpu this time very fast but using only gpu this time very slow. I used llama.cpp this application using vulkan same gpu very fast. Ollama very slow. ### Relevant log output ```shell ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the bug label 2026-05-04 23:41:28 -05:00
Author
Owner

@0x7CFE commented on GitHub (Dec 26, 2025):

What do you mean by being fast or slow? Tokens per second? Load time?

<!-- gh-comment-id:3693283284 --> @0x7CFE commented on GitHub (Dec 26, 2025): What do you mean by being fast or slow? Tokens per second? Load time?
Author
Owner

@anumukul commented on GitHub (Dec 26, 2025):

Hi, I’d like to take this issue and can deliver a fix within 24 hours.
I’ve worked on similar projects before and have relevant experience, so I should be able to handle this efficiently.

<!-- gh-comment-id:3693471550 --> @anumukul commented on GitHub (Dec 26, 2025): Hi, I’d like to take this issue and can deliver a fix within 24 hours. I’ve worked on similar projects before and have relevant experience, so I should be able to handle this efficiently.
Author
Owner

@rick-github commented on GitHub (Dec 27, 2025):

@anumukul Feel free to work on this issue or any other that piques your interest. There's no need to ask to be assigned an issue, just analyze the problem, develop a solution, submit a PR.

<!-- gh-comment-id:3693523984 --> @rick-github commented on GitHub (Dec 27, 2025): @anumukul Feel free to work on this issue or any other that piques your interest. There's no need to ask to be assigned an issue, just analyze the problem, develop a solution, submit a PR.
Author
Owner

@Swagatade commented on GitHub (Dec 27, 2025):

Tokens per second.

<!-- gh-comment-id:3693634165 --> @Swagatade commented on GitHub (Dec 27, 2025): Tokens per second.
Author
Owner

@rick-github commented on GitHub (Jan 4, 2026):

Server log may help in debugging.

<!-- gh-comment-id:3707676020 --> @rick-github commented on GitHub (Jan 4, 2026): [Server log](https://github.com/ollama/ollama/blob/main/docs/troubleshooting.mdx) may help in debugging.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#70993