[PR #7424] [CLOSED] boost embed endpoint #17687

Closed
opened 2026-04-16 06:11:04 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/7424
Author: @liuy
Created: 10/30/2024
Status: Closed

Base: mainHead: boost-embed


📝 Commits (1)

📊 Changes

3 files changed (+47 additions, -62 deletions)

View changed files

📝 llama/runner/runner.go (+9 -1)
📝 llm/server.go (+18 -14)
📝 server/routes.go (+20 -47)

📄 Description

just get token numbers in the runner instead of route.

Even on following simplest request, I got nearly 20x boost.

curl http://localhost:11434/api/embed -d '{
"model": "all-minilm",
"input": ["Why is the sky blue?", "Why is the grass green?"]
}'

new approach: "total_duration":14239148
old approach: "total_duration":240871657

fix #7400


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/7424 **Author:** [@liuy](https://github.com/liuy) **Created:** 10/30/2024 **Status:** ❌ Closed **Base:** `main` ← **Head:** `boost-embed` --- ### 📝 Commits (1) - [`d24f0b1`](https://github.com/ollama/ollama/commit/d24f0b12b2d64dfb556512371762c7cdbdca5113) boost embed endpoint ### 📊 Changes **3 files changed** (+47 additions, -62 deletions) <details> <summary>View changed files</summary> 📝 `llama/runner/runner.go` (+9 -1) 📝 `llm/server.go` (+18 -14) 📝 `server/routes.go` (+20 -47) </details> ### 📄 Description just get token numbers in the runner instead of route. Even on following simplest request, I got nearly 20x boost. curl http://localhost:11434/api/embed -d '{ "model": "all-minilm", "input": ["Why is the sky blue?", "Why is the grass green?"] }' new approach: "total_duration":14239148 old approach: "total_duration":240871657 fix #7400 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-16 06:11:04 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#17687