[PR #12293] [MERGED] use split activations when possible #24310

Closed
opened 2026-04-19 17:30:32 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/12293
Author: @mxyng
Created: 9/15/2025
Status: Merged
Merged: 9/16/2025
Merged by: @mxyng

Base: mainHead: mxyng/split-activations


📝 Commits (2)

  • 2959b55 use ggml_*_split activations when possible
  • e181c00 forward qkv

📊 Changes

16 files changed (+59 additions, -50 deletions)

View changed files

📝 ml/backend.go (+6 -5)
📝 ml/backend/ggml/ggml.go (+21 -10)
📝 ml/nn/attention.go (+2 -0)
📝 model/models/gemma2/model.go (+1 -1)
📝 model/models/gemma3/model_text.go (+1 -1)
📝 model/models/gemma3n/model_text.go (+2 -3)
📝 model/models/gptoss/model.go (+1 -1)
📝 model/models/llama/model.go (+1 -1)
📝 model/models/llama4/model_text.go (+8 -8)
📝 model/models/mistral3/model_text.go (+1 -1)
📝 model/models/mistral3/model_vision.go (+1 -1)
📝 model/models/mllama/model_text.go (+1 -1)
📝 model/models/qwen2/model.go (+1 -1)
📝 model/models/qwen25vl/model_text.go (+1 -1)
📝 model/models/qwen25vl/model_vision.go (+1 -2)
📝 model/models/qwen3/model.go (+10 -13)

📄 Description

No description provided


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/12293 **Author:** [@mxyng](https://github.com/mxyng) **Created:** 9/15/2025 **Status:** ✅ Merged **Merged:** 9/16/2025 **Merged by:** [@mxyng](https://github.com/mxyng) **Base:** `main` ← **Head:** `mxyng/split-activations` --- ### 📝 Commits (2) - [`2959b55`](https://github.com/ollama/ollama/commit/2959b55ca65b03b447c462434f96892802b6f3f9) use ggml_*_split activations when possible - [`e181c00`](https://github.com/ollama/ollama/commit/e181c0047b386750e9c62257bfa7b021a29d3c49) forward qkv ### 📊 Changes **16 files changed** (+59 additions, -50 deletions) <details> <summary>View changed files</summary> 📝 `ml/backend.go` (+6 -5) 📝 `ml/backend/ggml/ggml.go` (+21 -10) 📝 `ml/nn/attention.go` (+2 -0) 📝 `model/models/gemma2/model.go` (+1 -1) 📝 `model/models/gemma3/model_text.go` (+1 -1) 📝 `model/models/gemma3n/model_text.go` (+2 -3) 📝 `model/models/gptoss/model.go` (+1 -1) 📝 `model/models/llama/model.go` (+1 -1) 📝 `model/models/llama4/model_text.go` (+8 -8) 📝 `model/models/mistral3/model_text.go` (+1 -1) 📝 `model/models/mistral3/model_vision.go` (+1 -1) 📝 `model/models/mllama/model_text.go` (+1 -1) 📝 `model/models/qwen2/model.go` (+1 -1) 📝 `model/models/qwen25vl/model_text.go` (+1 -1) 📝 `model/models/qwen25vl/model_vision.go` (+1 -2) 📝 `model/models/qwen3/model.go` (+10 -13) </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 17:30:32 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#24310