[PR #14343] [CLOSED] Add qwen3.5-next-moe support to MLX runner and models #25171

Closed
opened 2026-04-19 18:03:02 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/14343
Author: @pdevine
Created: 2/21/2026
Status: Closed

Base: mainHead: pdevine/qwen3next


📝 Commits (1)

  • 00f67e8 Add qwen3.5-next-moe support to MLX runner and models

📊 Changes

12 files changed (+1949 additions, -17 deletions)

View changed files

📝 x/mlxrunner/cache/cache.go (+20 -0)
x/mlxrunner/cache/cache_test.go (+17 -0)
x/mlxrunner/cache/recurrent.go (+162 -0)
📝 x/mlxrunner/imports.go (+2 -0)
📝 x/mlxrunner/mlx/array.go (+36 -0)
📝 x/mlxrunner/mlx/ops_extra.go (+64 -0)
📝 x/mlxrunner/pipeline.go (+123 -17)
x/mlxrunner/pipeline_helpers_test.go (+83 -0)
📝 x/mlxrunner/runner.go (+52 -0)
x/models/qwen3_5/qwen3_5.go (+1254 -0)
x/models/qwen3_5/qwen3_5_test.go (+120 -0)
x/models/qwen3_5_moe/qwen3_5_moe.go (+16 -0)

📄 Description

This change:

  • adds support for qwen3.5-next-moe models (qwen3-next/qwen3.5-next/qwen3-coder) to the MLX runner
  • introduces recurrent cache support and related MLX ops
  • updates pipeline/runner integration and adds tests

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/14343 **Author:** [@pdevine](https://github.com/pdevine) **Created:** 2/21/2026 **Status:** ❌ Closed **Base:** `main` ← **Head:** `pdevine/qwen3next` --- ### 📝 Commits (1) - [`00f67e8`](https://github.com/ollama/ollama/commit/00f67e807a42b3ba809822ee92987be36177ad6e) Add qwen3.5-next-moe support to MLX runner and models ### 📊 Changes **12 files changed** (+1949 additions, -17 deletions) <details> <summary>View changed files</summary> 📝 `x/mlxrunner/cache/cache.go` (+20 -0) ➕ `x/mlxrunner/cache/cache_test.go` (+17 -0) ➕ `x/mlxrunner/cache/recurrent.go` (+162 -0) 📝 `x/mlxrunner/imports.go` (+2 -0) 📝 `x/mlxrunner/mlx/array.go` (+36 -0) 📝 `x/mlxrunner/mlx/ops_extra.go` (+64 -0) 📝 `x/mlxrunner/pipeline.go` (+123 -17) ➕ `x/mlxrunner/pipeline_helpers_test.go` (+83 -0) 📝 `x/mlxrunner/runner.go` (+52 -0) ➕ `x/models/qwen3_5/qwen3_5.go` (+1254 -0) ➕ `x/models/qwen3_5/qwen3_5_test.go` (+120 -0) ➕ `x/models/qwen3_5_moe/qwen3_5_moe.go` (+16 -0) </details> ### 📄 Description This change: * adds support for qwen3.5-next-moe models (qwen3-next/qwen3.5-next/qwen3-coder) to the MLX runner * introduces recurrent cache support and related MLX ops * updates pipeline/runner integration and adds tests --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 18:03:02 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#25171