[PR #5859] [MERGED] Prevent partial loading on mixed GPU brands #37820

Closed
opened 2026-04-22 22:28:56 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/5859
Author: @dhiltgen
Created: 7/22/2024
Status: Merged
Merged: 7/30/2024
Merged by: @dhiltgen

Base: mainHead: homogeneous_gpus


📝 Commits (1)

  • 3454209 Prevent partial loading on mixed GPU brands

📊 Changes

2 files changed (+66 additions, -4 deletions)

View changed files

📝 server/sched.go (+27 -4)
📝 server/sched_test.go (+39 -0)

📄 Description

In mult-brand GPU setups, if we couldn't fully load the model we would fall through the scheduler and mistakenly try to load across a mix of brands. This makes sure we find the set of GPU(s) that best fit for the partial load.

Fixes #5476


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/5859 **Author:** [@dhiltgen](https://github.com/dhiltgen) **Created:** 7/22/2024 **Status:** ✅ Merged **Merged:** 7/30/2024 **Merged by:** [@dhiltgen](https://github.com/dhiltgen) **Base:** `main` ← **Head:** `homogeneous_gpus` --- ### 📝 Commits (1) - [`3454209`](https://github.com/ollama/ollama/commit/345420998e90090d2d6fba38ad5c2f3f5512adf4) Prevent partial loading on mixed GPU brands ### 📊 Changes **2 files changed** (+66 additions, -4 deletions) <details> <summary>View changed files</summary> 📝 `server/sched.go` (+27 -4) 📝 `server/sched_test.go` (+39 -0) </details> ### 📄 Description In mult-brand GPU setups, if we couldn't fully load the model we would fall through the scheduler and mistakenly try to load across a mix of brands. This makes sure we find the set of GPU(s) that best fit for the partial load. Fixes #5476 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-22 22:28:56 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#37820