[GH-ISSUE #14631] 500 Internal Server Error when loading hf.co/HauhauCS/Qwen3.5-4B-Uncensored-HauhauCS-Aggressive:Q4_K_M via ollama run #71539

Closed
opened 2026-05-05 02:04:59 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @pinghe on GitHub (Mar 5, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/14631

What is the issue?

I'm encountering a 500 Internal Server Error when attempting to run a quantized GGUF model from Hugging Face using Ollama.

arch ollama[1158318]: llama_model_loader: - type q4_K:  203 tensors
arch ollama[1158318]: llama_model_loader: - type q5_K:   24 tensors
arch ollama[1158318]: llama_model_loader: - type q6_K:   22 tensors
arch ollama[1158318]: print_info: file format = GGUF V3 (latest)
arch ollama[1158318]: print_info: file type   = Q4_K - Medium
arch ollama[1158318]: print_info: file size   = 2.51 GiB (5.13 BPW)
arch ollama[1158318]: llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'qwen35'
arch ollama[1158318]: llama_model_load_from_file_impl: failed to load model

Ollama version

0.17.6

Originally created by @pinghe on GitHub (Mar 5, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/14631 ### What is the issue? I'm encountering a 500 Internal Server Error when attempting to run a quantized GGUF model from Hugging Face using Ollama. ``` arch ollama[1158318]: llama_model_loader: - type q4_K: 203 tensors arch ollama[1158318]: llama_model_loader: - type q5_K: 24 tensors arch ollama[1158318]: llama_model_loader: - type q6_K: 22 tensors arch ollama[1158318]: print_info: file format = GGUF V3 (latest) arch ollama[1158318]: print_info: file type = Q4_K - Medium arch ollama[1158318]: print_info: file size = 2.51 GiB (5.13 BPW) arch ollama[1158318]: llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'qwen35' arch ollama[1158318]: llama_model_load_from_file_impl: failed to load model ``` ### Ollama version 0.17.6
GiteaMirror added the bug label 2026-05-05 02:04:59 -05:00
Author
Owner

@ch3rn0v commented on GitHub (Apr 18, 2026):

Same with ollama 0.19.0 and Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive

<!-- gh-comment-id:4274598066 --> @ch3rn0v commented on GitHub (Apr 18, 2026): Same with ollama 0.19.0 and Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#71539