[PR #4316] [MERGED] Bump VRAM buffer back up #11452

Closed
opened 2026-04-12 23:30:26 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/4316
Author: @dhiltgen
Created: 5/10/2024
Status: Merged
Merged: 5/10/2024
Merged by: @dhiltgen

Base: mainHead: more_buffer


📝 Commits (1)

📊 Changes

2 files changed (+3 additions, -3 deletions)

View changed files

📝 gpu/gpu.go (+2 -2)
📝 gpu/gpu_darwin.go (+1 -1)

📄 Description

Under stress scenarios we're seeing OOMs so this should help stabilize the allocations under heavy concurrency stress.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/4316 **Author:** [@dhiltgen](https://github.com/dhiltgen) **Created:** 5/10/2024 **Status:** ✅ Merged **Merged:** 5/10/2024 **Merged by:** [@dhiltgen](https://github.com/dhiltgen) **Base:** `main` ← **Head:** `more_buffer` --- ### 📝 Commits (1) - [`30a7d70`](https://github.com/ollama/ollama/commit/30a7d7096c1ca4e90e859efce23042a27487fa8c) Bump VRAM buffer back up ### 📊 Changes **2 files changed** (+3 additions, -3 deletions) <details> <summary>View changed files</summary> 📝 `gpu/gpu.go` (+2 -2) 📝 `gpu/gpu_darwin.go` (+1 -1) </details> ### 📄 Description Under stress scenarios we're seeing OOMs so this should help stabilize the allocations under heavy concurrency stress. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:30:26 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11452