[PR #3506] [MERGED] cgo quantize #11181

Closed
opened 2026-04-12 23:23:28 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/3506
Author: @mxyng
Created: 4/5/2024
Status: Merged
Merged: 4/9/2024
Merged by: @mxyng

Base: mainHead: mxyng/quantize-redux


📝 Commits (2)

📊 Changes

7 files changed (+145 additions, -45 deletions)

View changed files

📝 api/client.go (+1 -13)
📝 api/types.go (+28 -27)
📝 cmd/cmd.go (+4 -1)
📝 llm/llm.go (+71 -0)
📝 server/images.go (+21 -2)
📝 server/routes.go (+19 -1)
📝 server/routes_test.go (+1 -1)

📄 Description

revive #307

this will only quantize a converted model. quantizing an arbitrary fp16/fp32 will be a follow up


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/3506 **Author:** [@mxyng](https://github.com/mxyng) **Created:** 4/5/2024 **Status:** ✅ Merged **Merged:** 4/9/2024 **Merged by:** [@mxyng](https://github.com/mxyng) **Base:** `main` ← **Head:** `mxyng/quantize-redux` --- ### 📝 Commits (2) - [`e1c9a2a`](https://github.com/ollama/ollama/commit/e1c9a2a00fd555f33dae7f97b7900a9d636166b3) no blob create if already exists - [`9502e56`](https://github.com/ollama/ollama/commit/9502e5661f97287064e264091b3bbc88646ac1fb) cgo quantize ### 📊 Changes **7 files changed** (+145 additions, -45 deletions) <details> <summary>View changed files</summary> 📝 `api/client.go` (+1 -13) 📝 `api/types.go` (+28 -27) 📝 `cmd/cmd.go` (+4 -1) 📝 `llm/llm.go` (+71 -0) 📝 `server/images.go` (+21 -2) 📝 `server/routes.go` (+19 -1) 📝 `server/routes_test.go` (+1 -1) </details> ### 📄 Description revive #307 this will _only_ quantize a converted model. quantizing an arbitrary fp16/fp32 will be a follow up --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:23:28 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11181