[PR #12603] [MERGED] fix(qwen3): deepseek distill #12620

Closed
opened 2025-11-12 16:41:04 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/12603
Author: @mxyng
Created: 10/13/2025
Status: Merged
Merged: 10/13/2025
Merged by: @mxyng

Base: mainHead: mxyng/qwen3


📝 Commits (2)

📊 Changes

2 files changed (+41 additions, -19 deletions)

View changed files

📝 fs/ggml/ggml.go (+1 -0)
📝 model/models/qwen3/model.go (+40 -19)

📄 Description

deepseek's qwen3 distill uses a different rope scheme so support both


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/12603 **Author:** [@mxyng](https://github.com/mxyng) **Created:** 10/13/2025 **Status:** ✅ Merged **Merged:** 10/13/2025 **Merged by:** [@mxyng](https://github.com/mxyng) **Base:** `main` ← **Head:** `mxyng/qwen3` --- ### 📝 Commits (2) - [`d891f8a`](https://github.com/ollama/ollama/commit/d891f8a1c49cc74e8adc56a293795d11f6f7d051) fix(qwen3): deepseek distill - [`dcdb965`](https://github.com/ollama/ollama/commit/dcdb965ad234f7bd2c41a9e788b7486821f3c856) Revert "use llama runner for qwen3 (#12556)" ### 📊 Changes **2 files changed** (+41 additions, -19 deletions) <details> <summary>View changed files</summary> 📝 `fs/ggml/ggml.go` (+1 -0) 📝 `model/models/qwen3/model.go` (+40 -19) </details> ### 📄 Description deepseek's qwen3 distill uses a different rope scheme so support both --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-12 16:41:04 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama-ollama#12620