[PR #11119] [MERGED] Reapply "feat: incremental gguf parser (#10822)" (#11114) #23985

Closed
opened 2026-04-19 17:19:13 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/11119
Author: @mxyng
Created: 6/18/2025
Status: Merged
Merged: 6/20/2025
Merged by: @mxyng

Base: mainHead: mxyng/gguf-redo


📝 Commits (2)

📊 Changes

13 files changed (+1362 additions, -169 deletions)

View changed files

fs/gguf/gguf.go (+347 -0)
fs/gguf/gguf_test.go (+249 -0)
fs/gguf/keyvalue.go (+90 -0)
fs/gguf/keyvalue_test.go (+208 -0)
fs/gguf/lazy.go (+89 -0)
fs/gguf/reader.go (+23 -0)
fs/gguf/tensor.go (+288 -0)
📝 go.mod (+1 -1)
📝 go.sum (+2 -2)
📝 server/images.go (+11 -15)
📝 server/images_test.go (+44 -129)
📝 server/quantization_test.go (+2 -10)
📝 server/sched_test.go (+8 -12)

📄 Description

The previous parser only supported gguf version 3 while some older models are in gguf version 2. V3 added big-endian support for gguf files which has no practical impact for ollama

Tested with alfred, codellama, codeup, orca2, llava:7b which are some of the oldest models published by ollama


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/11119 **Author:** [@mxyng](https://github.com/mxyng) **Created:** 6/18/2025 **Status:** ✅ Merged **Merged:** 6/20/2025 **Merged by:** [@mxyng](https://github.com/mxyng) **Base:** `main` ← **Head:** `mxyng/gguf-redo` --- ### 📝 Commits (2) - [`6a3c14f`](https://github.com/ollama/ollama/commit/6a3c14f2f28dce0f83a3ed510da570ba154f2d5d) Reapply "feat: incremental gguf parser (#10822)" (#11114) - [`91d6aa7`](https://github.com/ollama/ollama/commit/91d6aa73bebe0063eb5d8b2982af64bbf0159dd7) fix older ggufs ### 📊 Changes **13 files changed** (+1362 additions, -169 deletions) <details> <summary>View changed files</summary> ➕ `fs/gguf/gguf.go` (+347 -0) ➕ `fs/gguf/gguf_test.go` (+249 -0) ➕ `fs/gguf/keyvalue.go` (+90 -0) ➕ `fs/gguf/keyvalue_test.go` (+208 -0) ➕ `fs/gguf/lazy.go` (+89 -0) ➕ `fs/gguf/reader.go` (+23 -0) ➕ `fs/gguf/tensor.go` (+288 -0) 📝 `go.mod` (+1 -1) 📝 `go.sum` (+2 -2) 📝 `server/images.go` (+11 -15) 📝 `server/images_test.go` (+44 -129) 📝 `server/quantization_test.go` (+2 -10) 📝 `server/sched_test.go` (+8 -12) </details> ### 📄 Description The previous parser only supported gguf version 3 while some older models are in gguf version 2. V3 added big-endian support for gguf files which has no practical impact for ollama Tested with alfred, codellama, codeup, orca2, llava:7b which are some of the oldest models published by ollama --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 17:19:13 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#23985