[GH-ISSUE #4248] error loading model architecture: unknown model architecture: 'qwen2moe' #2649

Closed
opened 2026-04-12 12:59:36 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @li904775857 on GitHub (May 8, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/4248

What is the issue?

Qwen1.5-MoE-A2.7B-Chat is installed by convert-hf-to-gguf.py according to the process. After 4-bit quantization, ollamamodelfile is created, but it is not supported when loading. What is the cause of this?

OS

Windows

GPU

Nvidia

CPU

AMD

Ollama version

0.1.32

Originally created by @li904775857 on GitHub (May 8, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/4248 ### What is the issue? Qwen1.5-MoE-A2.7B-Chat is installed by convert-hf-to-gguf.py according to the process. After 4-bit quantization, ollamamodelfile is created, but it is not supported when loading. What is the cause of this? ### OS Windows ### GPU Nvidia ### CPU AMD ### Ollama version 0.1.32
GiteaMirror added the model label 2026-04-12 12:59:36 -05:00
Author
Owner

@dhiltgen commented on GitHub (Jul 25, 2024):

Qwen support has been added more recently. Can you try on the latest release?

<!-- gh-comment-id:2251063563 --> @dhiltgen commented on GitHub (Jul 25, 2024): Qwen support has been added more recently. Can you try on the latest release?
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#2649