[PR #5606] [CLOSED] server/llama.cpp: stream quantize progress to client #11844

Closed
opened 2026-04-12 23:40:29 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/5606
Author: @joshyan1
Created: 7/10/2024
Status: Closed

Base: jyan/progressHead: jyan/quant3


📝 Commits (10+)

📊 Changes

5 files changed (+110 additions, -5 deletions)

View changed files

📝 api/types.go (+1 -0)
📝 cmd/cmd.go (+10 -0)
📝 llm/llm.go (+42 -1)
llm/patches/10-quantize-progress.diff (+53 -0)
📝 server/images.go (+4 -4)

📄 Description

added in streaming to Quantize to stream back tensor progress on quantization.
patched llama.cpp to involve a callback containing streaming progress

quantizing model tensors 69/195 ⠹


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/5606 **Author:** [@joshyan1](https://github.com/joshyan1) **Created:** 7/10/2024 **Status:** ❌ Closed **Base:** `jyan/progress` ← **Head:** `jyan/quant3` --- ### 📝 Commits (10+) - [`e87eafe`](https://github.com/ollama/ollama/commit/e87eafe5cd0757f22a6fcca2dc6a34665aa627dd) quantize percentage - [`1344843`](https://github.com/ollama/ollama/commit/134484351584dcaad6c98ccfd1a3859cd2ca3c54) image - [`bec9100`](https://github.com/ollama/ollama/commit/bec9100f324b94854e04197eb431d1c10d1b418d) tensor count - [`ee2b9b0`](https://github.com/ollama/ollama/commit/ee2b9b076c69bd44064d838e5ba11c0363514c47) stop spinner - [`c63b4ec`](https://github.com/ollama/ollama/commit/c63b4ecbf77974648ac58e1c1614a6089c26ca71) quantize - [`24e8292`](https://github.com/ollama/ollama/commit/24e8292e94fbd3b94e04a147e161b5db21b6c056) new changes - [`dfc6264`](https://github.com/ollama/ollama/commit/dfc62648f3da0994cb4006a709ede84ac4348a43) cast - [`c800a67`](https://github.com/ollama/ollama/commit/c800a67f1b6c86a8ebe39f54d363d1037ef51426) cast - [`c498609`](https://github.com/ollama/ollama/commit/c498609ba3e05433f990bd288fd889d980d97dad) cast - [`9e0b8f1`](https://github.com/ollama/ollama/commit/9e0b8f1fe257918d9849b1e19040ff28644f6517) another change ### 📊 Changes **5 files changed** (+110 additions, -5 deletions) <details> <summary>View changed files</summary> 📝 `api/types.go` (+1 -0) 📝 `cmd/cmd.go` (+10 -0) 📝 `llm/llm.go` (+42 -1) ➕ `llm/patches/10-quantize-progress.diff` (+53 -0) 📝 `server/images.go` (+4 -4) </details> ### 📄 Description added in streaming to `Quantize` to stream back tensor progress on quantization. patched llama.cpp to involve a callback containing streaming progress `quantizing model tensors 69/195 ⠹ ` --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:40:29 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11844