[PR #11862] [MERGED] discover: CPU supports flash attention #12374

Closed
opened 2025-11-12 16:34:28 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/11862
Author: @jessegross
Created: 8/11/2025
Status: Merged
Merged: 8/11/2025
Merged by: @jessegross

Base: mainHead: jessegross/flash


📝 Commits (1)

  • 9dffb01 discover: CPU supports flash attention

📊 Changes

1 file changed (+2 additions, -1 deletions)

View changed files

📝 discover/types.go (+2 -1)

📄 Description

We already run flash attention on CPUs in cases where we have partial offloading but were disabling it if running on pure CPU, which is unnecessary.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/11862 **Author:** [@jessegross](https://github.com/jessegross) **Created:** 8/11/2025 **Status:** ✅ Merged **Merged:** 8/11/2025 **Merged by:** [@jessegross](https://github.com/jessegross) **Base:** `main` ← **Head:** `jessegross/flash` --- ### 📝 Commits (1) - [`9dffb01`](https://github.com/ollama/ollama/commit/9dffb017e93678a24efa3e79b6e93477da5b3f2a) discover: CPU supports flash attention ### 📊 Changes **1 file changed** (+2 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `discover/types.go` (+2 -1) </details> ### 📄 Description We already run flash attention on CPUs in cases where we have partial offloading but were disabling it if running on pure CPU, which is unnecessary. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-12 16:34:28 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama-ollama#12374