[GH-ISSUE #5801] unknown architecture DeepseekV2ForCausalLM #65654

Closed
opened 2026-05-03 22:05:48 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @DevLLM on GitHub (Jul 19, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/5801

i have this problem:

docker run --rm -v .:/model ollama/quantize -q q4_K_M /model
unknown architecture DeepseekV2ForCausalLM

ollama --version
ollama version is 0.2.7`

Originally created by @DevLLM on GitHub (Jul 19, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/5801 i have this problem: docker run --rm -v .:/model ollama/quantize -q q4_K_M /model unknown architecture DeepseekV2ForCausalLM ollama --version ollama version is 0.2.7`
GiteaMirror added the model label 2026-05-03 22:05:48 -05:00
Author
Owner

@rick-github commented on GitHub (Jul 19, 2024):

DeepseekV2ForCausalLM is not a supported architecture, see the project page. You might be able to do it with the llama.cpp container, which is more up to date:

docker run -rm --gpus all -v .:/models ghcr.io/ggerganov/llama.cpp:full-cuda -q /models/ggml-model-f16.gguf Q4_K_M

Although it's not clear what model you are trying to quantize, both deepseek-v2 and deepseek-coder-v2 models are already available on ollama.com at https://ollama.com/search?q=deepseek, so perhaps you don't need to.

<!-- gh-comment-id:2240506793 --> @rick-github commented on GitHub (Jul 19, 2024): DeepseekV2ForCausalLM is not a supported architecture, see the [project page](https://registry.hub.docker.com/r/ollama/quantize). You might be able to do it with the llama.cpp container, which is more up to date: ``` docker run -rm --gpus all -v .:/models ghcr.io/ggerganov/llama.cpp:full-cuda -q /models/ggml-model-f16.gguf Q4_K_M ``` Although it's not clear what model you are trying to quantize, both deepseek-v2 and deepseek-coder-v2 models are already available on ollama.com at https://ollama.com/search?q=deepseek, so perhaps you don't need to.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#65654