[PR #13052] flash attn: add auto mode for llama engine #12785

Open
opened 2025-11-12 17:06:43 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/13052
Author: @dhiltgen
Created: 11/11/2025
Status: 🔄 Open

Base: mainHead: fa_auto


📝 Commits (2)

  • 9cf5945 flash attn: add auto mode for llama engine
  • 6351898 review comments

📊 Changes

6 files changed (+61 additions, -19 deletions)

View changed files

📝 llama/llama.go (+8 -5)
📝 llm/server.go (+25 -9)
📝 ml/backend.go (+1 -1)
📝 ml/backend/ggml/ggml.go (+3 -3)
📝 ml/device.go (+22 -0)
📝 runner/llamarunner/runner.go (+2 -1)

📄 Description

If the user does not specify fa in the environment, use auto-mode.

Draft until I can verify more models


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/13052 **Author:** [@dhiltgen](https://github.com/dhiltgen) **Created:** 11/11/2025 **Status:** 🔄 Open **Base:** `main` ← **Head:** `fa_auto` --- ### 📝 Commits (2) - [`9cf5945`](https://github.com/ollama/ollama/commit/9cf59457199d2a699504025d75357b686e0d0df1) flash attn: add auto mode for llama engine - [`6351898`](https://github.com/ollama/ollama/commit/6351898494baffd00dcfd9b55e96ae80f993ec74) review comments ### 📊 Changes **6 files changed** (+61 additions, -19 deletions) <details> <summary>View changed files</summary> 📝 `llama/llama.go` (+8 -5) 📝 `llm/server.go` (+25 -9) 📝 `ml/backend.go` (+1 -1) 📝 `ml/backend/ggml/ggml.go` (+3 -3) 📝 `ml/device.go` (+22 -0) 📝 `runner/llamarunner/runner.go` (+2 -1) </details> ### 📄 Description If the user does not specify fa in the environment, use auto-mode. Draft until I can verify more models --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the
pull-request
label 2025-11-12 17:06:43 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama-ollama#12785
No description provided.