[GH-ISSUE #8834] clip and mllama models run on CPU on main #67782

Closed
opened 2026-05-04 11:39:55 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @jmorganca on GitHub (Feb 5, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/8834

What is the issue?

clip and mllama models run on CPU on main

mllama_model_load: model name:   Llama-3.2-11B-Vision-Instruct
mllama_model_load: description:  vision encoder for Mllama
mllama_model_load: GGUF version: 3
mllama_model_load: alignment:    32
mllama_model_load: n_tensors:    512
mllama_model_load: n_kv:         17
mllama_model_load: ftype:        f16
mllama_model_load:
mllama_model_load: vision using CPU backend
mllama_model_load: compute allocated memory: 2853.34 MB

OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @jmorganca on GitHub (Feb 5, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/8834 ### What is the issue? clip and mllama models run on CPU on main ``` mllama_model_load: model name: Llama-3.2-11B-Vision-Instruct mllama_model_load: description: vision encoder for Mllama mllama_model_load: GGUF version: 3 mllama_model_load: alignment: 32 mllama_model_load: n_tensors: 512 mllama_model_load: n_kv: 17 mllama_model_load: ftype: f16 mllama_model_load: mllama_model_load: vision using CPU backend mllama_model_load: compute allocated memory: 2853.34 MB ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the bug label 2026-05-04 11:39:55 -05:00
Author
Owner

@mxyng commented on GitHub (Feb 5, 2025):

fixed by #8835

<!-- gh-comment-id:2637799839 --> @mxyng commented on GitHub (Feb 5, 2025): fixed by #8835
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#67782