[PR #4547] [MERGED] Wire up load progress #22062

Closed
opened 2026-04-19 16:04:00 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/4547
Author: @dhiltgen
Created: 5/20/2024
Status: Merged
Merged: 5/23/2024
Merged by: @dhiltgen

Base: mainHead: load_progress


📝 Commits (1)

📊 Changes

3 files changed (+61 additions, -8 deletions)

View changed files

📝 llm/ext_server/server.cpp (+13 -1)
llm/patches/01-load-progress.diff (+31 -0)
📝 llm/server.go (+17 -7)

📄 Description

This doesn't expose a UX yet, but wires the initial server portion of progress reporting during load

TODO

  • Adjust waitUntilRunning to be smarter and look for stalled loads instead of a dumb 10m timer
  • expose progress in ollama run UX can come in a follow up PR
  • expose percent loaded in ollama ps UX can come in a follow up PR

Fixes #4350
Replaces #4123 #4419

This should provide a good balance between slow model loads vs. detecting stalls without taking too long before giving up.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/4547 **Author:** [@dhiltgen](https://github.com/dhiltgen) **Created:** 5/20/2024 **Status:** ✅ Merged **Merged:** 5/23/2024 **Merged by:** [@dhiltgen](https://github.com/dhiltgen) **Base:** `main` ← **Head:** `load_progress` --- ### 📝 Commits (1) - [`b37b496`](https://github.com/ollama/ollama/commit/b37b496a12ebad0105ed17826d838346bff6e5ef) Wire up load progress ### 📊 Changes **3 files changed** (+61 additions, -8 deletions) <details> <summary>View changed files</summary> 📝 `llm/ext_server/server.cpp` (+13 -1) ➕ `llm/patches/01-load-progress.diff` (+31 -0) 📝 `llm/server.go` (+17 -7) </details> ### 📄 Description This doesn't expose a UX yet, but wires the initial server portion of progress reporting during load TODO - [X] Adjust waitUntilRunning to be smarter and look for stalled loads instead of a dumb 10m timer - [ ] ~~expose progress in `ollama run`~~ UX can come in a follow up PR - [ ] ~~expose percent loaded in `ollama ps`~~ UX can come in a follow up PR Fixes #4350 Replaces #4123 #4419 This should provide a good balance between slow model loads vs. detecting stalls without taking too long before giving up. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 16:04:00 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#22062