[PR #9826] [MERGED] runner: remove cache prompt flag from ollama runner #59762

Closed
opened 2026-04-29 14:42:13 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/9826
Author: @BruceMacD
Created: 3/17/2025
Status: Merged
Merged: 3/17/2025
Merged by: @BruceMacD

Base: mainHead: brucemacd/no-ollama-cache-bool


📝 Commits (1)

  • c6c8984 runner: remove cache prompt flag from ollama runner

📊 Changes

3 files changed (+130 additions, -7 deletions)

View changed files

📝 runner/ollamarunner/cache.go (+1 -6)
📝 runner/ollamarunner/cache_test.go (+128 -0)
📝 runner/ollamarunner/runner.go (+1 -1)

📄 Description

We do not need to bypass the prompt caching in the ollama runner yet, as only embedding models needed to bypass the prompt caching. When embedding models are implemented they can skip initializing this cache completely.

This refactor came from discussion here: https://github.com/ollama/ollama/pull/9324#discussion_r1996418867


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/9826 **Author:** [@BruceMacD](https://github.com/BruceMacD) **Created:** 3/17/2025 **Status:** ✅ Merged **Merged:** 3/17/2025 **Merged by:** [@BruceMacD](https://github.com/BruceMacD) **Base:** `main` ← **Head:** `brucemacd/no-ollama-cache-bool` --- ### 📝 Commits (1) - [`c6c8984`](https://github.com/ollama/ollama/commit/c6c89849211969ee96ae8714b37e828909e4751b) runner: remove cache prompt flag from ollama runner ### 📊 Changes **3 files changed** (+130 additions, -7 deletions) <details> <summary>View changed files</summary> 📝 `runner/ollamarunner/cache.go` (+1 -6) 📝 `runner/ollamarunner/cache_test.go` (+128 -0) 📝 `runner/ollamarunner/runner.go` (+1 -1) </details> ### 📄 Description We do not need to bypass the prompt caching in the ollama runner yet, as only embedding models needed to bypass the prompt caching. When embedding models are implemented they can skip initializing this cache completely. This refactor came from discussion here: https://github.com/ollama/ollama/pull/9324#discussion_r1996418867 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-29 14:42:13 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#59762