[PR #14075] [MERGED] qwen3next: fix issue in delta net #25039

Closed
opened 2026-04-19 17:57:58 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/14075
Author: @jmorganca
Created: 2/4/2026
Status: Merged
Merged: 2/4/2026
Merged by: @jmorganca

Base: mainHead: qwen3-next-fix


📝 Commits (1)

  • 794ea02 qwen3next: fix issue in delta net

📊 Changes

1 file changed (+6 additions, -7 deletions)

View changed files

📝 model/models/qwen3next/deltanet.go (+6 -7)

📄 Description

gDiffExp was being broadcast across the wrong axis when multiplying with k. This fix reshapes gDiffExp to [1, chunkSize, nChunks, ...]


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/14075 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 2/4/2026 **Status:** ✅ Merged **Merged:** 2/4/2026 **Merged by:** [@jmorganca](https://github.com/jmorganca) **Base:** `main` ← **Head:** `qwen3-next-fix` --- ### 📝 Commits (1) - [`794ea02`](https://github.com/ollama/ollama/commit/794ea02bc8ad8f3e822443d430e39d92c9ba48dd) qwen3next: fix issue in delta net ### 📊 Changes **1 file changed** (+6 additions, -7 deletions) <details> <summary>View changed files</summary> 📝 `model/models/qwen3next/deltanet.go` (+6 -7) </details> ### 📄 Description `gDiffExp` was being broadcast across the wrong axis when multiplying with k. This fix reshapes `gDiffExp` to [1, chunkSize, nChunks, ...] --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 17:57:58 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#25039