[PR #14382] [CLOSED] mlxrunner: Report actual memory usage from runner #45896

Closed
opened 2026-04-25 01:30:27 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/14382
Author: @jessegross
Created: 2/24/2026
Status: Closed

Base: mainHead: jessegross/mlx-usage


📝 Commits (1)

  • 4d5ff25 mlxrunner: Report actual memory usage from runner

📊 Changes

7 files changed (+56 additions, -63 deletions)

View changed files

📝 llm/server.go (+11 -25)
📝 server/routes.go (+3 -0)
📝 server/sched.go (+6 -4)
📝 server/sched_test.go (+1 -2)
📝 x/imagegen/server.go (+3 -8)
📝 x/mlxrunner/client.go (+28 -21)
📝 x/mlxrunner/server.go (+4 -3)

📄 Description

The MLX runner previously reported a static VRAM estimate that was computed at load time and consisted only of the weights. This is strictly less than the actual memory usage, as it does not include the KV cache or compute graph.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/14382 **Author:** [@jessegross](https://github.com/jessegross) **Created:** 2/24/2026 **Status:** ❌ Closed **Base:** `main` ← **Head:** `jessegross/mlx-usage` --- ### 📝 Commits (1) - [`4d5ff25`](https://github.com/ollama/ollama/commit/4d5ff25724c6749f2e855471d9ca2ff26ef04059) mlxrunner: Report actual memory usage from runner ### 📊 Changes **7 files changed** (+56 additions, -63 deletions) <details> <summary>View changed files</summary> 📝 `llm/server.go` (+11 -25) 📝 `server/routes.go` (+3 -0) 📝 `server/sched.go` (+6 -4) 📝 `server/sched_test.go` (+1 -2) 📝 `x/imagegen/server.go` (+3 -8) 📝 `x/mlxrunner/client.go` (+28 -21) 📝 `x/mlxrunner/server.go` (+4 -3) </details> ### 📄 Description The MLX runner previously reported a static VRAM estimate that was computed at load time and consisted only of the weights. This is strictly less than the actual memory usage, as it does not include the KV cache or compute graph. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-25 01:30:27 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#45896