[PR #12482] [MERGED] llm: Enable flash attention by default for qwen3 and qwen3moe #13835

Closed
opened 2026-04-13 00:38:11 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/12482
Author: @jessegross
Created: 10/2/2025
Status: Merged
Merged: 10/3/2025
Merged by: @jessegross

Base: mainHead: jessegross/qwen3_flash


📝 Commits (1)

  • ba1ff47 llm: Enable flash attention by default for qwen3 and qwen3moe

📊 Changes

1 file changed (+2 additions, -0 deletions)

View changed files

📝 fs/ggml/ggml.go (+2 -0)

📄 Description

No description provided


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/12482 **Author:** [@jessegross](https://github.com/jessegross) **Created:** 10/2/2025 **Status:** ✅ Merged **Merged:** 10/3/2025 **Merged by:** [@jessegross](https://github.com/jessegross) **Base:** `main` ← **Head:** `jessegross/qwen3_flash` --- ### 📝 Commits (1) - [`ba1ff47`](https://github.com/ollama/ollama/commit/ba1ff473f11254a5b455bed97feba4c4264032cf) llm: Enable flash attention by default for qwen3 and qwen3moe ### 📊 Changes **1 file changed** (+2 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `fs/ggml/ggml.go` (+2 -0) </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-13 00:38:11 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#13835