[PR #5588] [MERGED] Remove GGML_CUDA_FORCE_MMQ=on from build #11840

Closed
opened 2026-04-12 23:40:23 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/5588
Author: @jmorganca
Created: 7/10/2024
Status: Merged
Merged: 7/10/2024
Merged by: @jmorganca

Base: mainHead: jmorganca/no-mmq


📝 Commits (1)

  • 2d23119 remove GGML_CUDA_FORCE_MMQ=on from build

📊 Changes

1 file changed (+1 additions, -1 deletions)

View changed files

📝 llm/generate/gen_linux.sh (+1 -1)

📄 Description

Removing this build flag seems to cause mmq to be decided at runtime vs always forcing it on. This stops some cuda calls from failing on V100 and other CC 7.0 cards. This may also help with RDNA3 cards

Fixes https://github.com/ollama/ollama/issues/5571

Changes to mmq functionality related to this bug: 7c26775...a8db2a9c (diff-b2fe862fcd5119199ae59ea13d1b6a46e0d23e41e727e39d90913f828a5ff66bR78)


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/5588 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 7/10/2024 **Status:** ✅ Merged **Merged:** 7/10/2024 **Merged by:** [@jmorganca](https://github.com/jmorganca) **Base:** `main` ← **Head:** `jmorganca/no-mmq` --- ### 📝 Commits (1) - [`2d23119`](https://github.com/ollama/ollama/commit/2d23119bc847ee4001b02fa8e1648db32d953185) remove `GGML_CUDA_FORCE_MMQ=on` from build ### 📊 Changes **1 file changed** (+1 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `llm/generate/gen_linux.sh` (+1 -1) </details> ### 📄 Description Removing this build flag seems to cause mmq to be decided at runtime vs always forcing it on. This stops some cuda calls from failing on V100 and other CC 7.0 cards. This may also help with [RDNA3 cards](https://github.com/ggerganov/llama.cpp/blob/a59f8fdc85e1119d470d8766e29617962549d993/docs/build.md?plain=1#L175) Fixes https://github.com/ollama/ollama/issues/5571 Changes to mmq functionality related to this bug: https://github.com/ggerganov/llama.cpp/compare/7c26775...a8db2a9c#diff-b2fe862fcd5119199ae59ea13d1b6a46e0d23e41e727e39d90913f828a5ff66bR78 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:40:23 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11840