[PR #1896] [MERGED] Increase minimum CUDA memory allocation overhead and fix minimum overhead for multi-gpu #10715

Closed
opened 2026-04-12 23:08:38 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/1896
Author: @jmorganca
Created: 1/10/2024
Status: Merged
Merged: 1/11/2024
Merged by: @jmorganca

Base: mainHead: overhead-fixes


📝 Commits (6)

  • db53635 increase minimum cuda overhead and fix minimum overhead for multi-gpu
  • d81944c fix multi gpu overhead
  • 6675989 limit overhead to 10% of all gpus
  • f208251 better wording
  • 65c0dec allocate fixed amount before layers
  • 32c40a6 fixed only includes graph alloc

📊 Changes

2 files changed (+19 additions, -12 deletions)

View changed files

📝 gpu/gpu.go (+5 -4)
📝 llm/llm.go (+14 -8)

📄 Description

Fixes https://github.com/jmorganca/ollama/issues/1887


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/1896 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 1/10/2024 **Status:** ✅ Merged **Merged:** 1/11/2024 **Merged by:** [@jmorganca](https://github.com/jmorganca) **Base:** `main` ← **Head:** `overhead-fixes` --- ### 📝 Commits (6) - [`db53635`](https://github.com/ollama/ollama/commit/db53635c97a57d035f2e28e5bddc7a6f1ad7d884) increase minimum cuda overhead and fix minimum overhead for multi-gpu - [`d81944c`](https://github.com/ollama/ollama/commit/d81944c5cd3945dd04ee60afe242ebd21d8fa5eb) fix multi gpu overhead - [`6675989`](https://github.com/ollama/ollama/commit/6675989feac2a3928ad4ddc54fd592b18cc409ae) limit overhead to 10% of all gpus - [`f208251`](https://github.com/ollama/ollama/commit/f2082518988eae8e232259d6b308cc0a31c76015) better wording - [`65c0dec`](https://github.com/ollama/ollama/commit/65c0dec811fda29d03f5528c8748093d9a3babb9) allocate fixed amount before layers - [`32c40a6`](https://github.com/ollama/ollama/commit/32c40a644bd4137acea2735c5927e14de82b9bd6) fixed only includes graph alloc ### 📊 Changes **2 files changed** (+19 additions, -12 deletions) <details> <summary>View changed files</summary> 📝 `gpu/gpu.go` (+5 -4) 📝 `llm/llm.go` (+14 -8) </details> ### 📄 Description Fixes https://github.com/jmorganca/ollama/issues/1887 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:08:39 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#10715