[PR #307] [CLOSED] quantize f32, f16 #8827

Closed
opened 2025-11-12 14:53:20 -06:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/307
Author: @mxyng
Created: 8/7/2023
Status: Closed

Base: mainHead: auto-quantize


📝 Commits (2)

📊 Changes

5 files changed (+187 additions, -2 deletions)

View changed files

📝 docs/modelfile.md (+30 -1)
📝 llm/ggml.go (+40 -0)
📝 llm/llama.go (+79 -0)
📝 parser/parser.go (+11 -0)
📝 server/images.go (+27 -1)

📄 Description

if the input model in a modelfile is a ggml f32 or f16 file type, and the FROM line contains the AS keyword, quantize the model to the specified level

Example Modelfile:

FROM /path/to/my/f32.bin AS Q4_0

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/307 **Author:** [@mxyng](https://github.com/mxyng) **Created:** 8/7/2023 **Status:** ❌ Closed **Base:** `main` ← **Head:** `auto-quantize` --- ### 📝 Commits (2) - [`535eaf5`](https://github.com/ollama/ollama/commit/535eaf553720e591f1266290e8f5973009199589) quantize f32, f16 - [`7db4e9e`](https://github.com/ollama/ollama/commit/7db4e9efd7ddd31be7afdf60ef5b5db01dd72467) docs for from ... as ... ### 📊 Changes **5 files changed** (+187 additions, -2 deletions) <details> <summary>View changed files</summary> 📝 `docs/modelfile.md` (+30 -1) 📝 `llm/ggml.go` (+40 -0) 📝 `llm/llama.go` (+79 -0) 📝 `parser/parser.go` (+11 -0) 📝 `server/images.go` (+27 -1) </details> ### 📄 Description if the input model in a modelfile is a ggml f32 or f16 file type, and the `FROM` line contains the `AS` keyword, quantize the model to the specified level Example Modelfile: ``` FROM /path/to/my/f32.bin AS Q4_0 ``` --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2025-11-12 14:53:20 -06:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama-ollama#8827