[PR #4734] [MERGED] partial offloading: allow flash attention and disable mmap #11575

Closed
opened 2026-04-12 23:32:38 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/4734
Author: @jmorganca
Created: 5/30/2024
Status: Merged
Merged: 5/30/2024
Merged by: @jmorganca

Base: mainHead: jmorganca/loadparams


📝 Commits (2)

  • 562b0aa partial offloading: allow flash attention and disable mmap
  • 7078a01 allow mmap with num_gpu=0

📊 Changes

1 file changed (+21 additions, -18 deletions)

View changed files

📝 llm/server.go (+21 -18)

📄 Description

No description provided


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/4734 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 5/30/2024 **Status:** ✅ Merged **Merged:** 5/30/2024 **Merged by:** [@jmorganca](https://github.com/jmorganca) **Base:** `main` ← **Head:** `jmorganca/loadparams` --- ### 📝 Commits (2) - [`562b0aa`](https://github.com/ollama/ollama/commit/562b0aa53f7273bac5bea8cc06ac7e5c19721e97) partial offloading: allow flash attention and disable mmap - [`7078a01`](https://github.com/ollama/ollama/commit/7078a01a2e91309c2e18102eee2f0e46995623fa) allow mmap with num_gpu=0 ### 📊 Changes **1 file changed** (+21 additions, -18 deletions) <details> <summary>View changed files</summary> 📝 `llm/server.go` (+21 -18) </details> ### 📄 Description _No description provided_ --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:32:39 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11575