[PR #4160] [MERGED] Fix no slots available error with concurrent requests #58100

Closed
opened 2026-04-29 12:55:28 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/4160
Author: @jmorganca
Created: 5/5/2024
Status: Merged
Merged: 5/6/2024
Merged by: @jmorganca

Base: mainHead: jmorganca/sched-improvements


📝 Commits (7)

📊 Changes

1 file changed (+115 additions, -112 deletions)

View changed files

📝 llm/server.go (+115 -112)

📄 Description

This fixes a few issues with queuing requests:

  • Always wait for the server.cpp ready state to avoid "server busy" errors this should fix no slots available

Fixes #4159


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/4160 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 5/5/2024 **Status:** ✅ Merged **Merged:** 5/6/2024 **Merged by:** [@jmorganca](https://github.com/jmorganca) **Base:** `main` ← **Head:** `jmorganca/sched-improvements` --- ### 📝 Commits (7) - [`e10299b`](https://github.com/ollama/ollama/commit/e10299bc6e4afb2c4c981998115d151b2ff6412c) Don't check server status as it will queue the request anyways - [`e0c64e5`](https://github.com/ollama/ollama/commit/e0c64e573e84a1c8a54b1df430a3a9aa12e6838e) remove retry on completion - [`aa23672`](https://github.com/ollama/ollama/commit/aa236720df73efdab071ca8d3c67f674ba24218c) fix build - [`ba9ff64`](https://github.com/ollama/ollama/commit/ba9ff6455ca505b7811ac9133076b52882bfd0f0) fix linter error - [`e99a433`](https://github.com/ollama/ollama/commit/e99a4339a457f3b250f6c472094fc60900c0669f) still check server status in case of hangs - [`401859b`](https://github.com/ollama/ollama/commit/401859b94ab4db6270a579d4c261474812ac90fe) cleanup - [`62be4e3`](https://github.com/ollama/ollama/commit/62be4e3ff0b0f82baaf488c543cfb0a297a77fa3) cleanup ### 📊 Changes **1 file changed** (+115 additions, -112 deletions) <details> <summary>View changed files</summary> 📝 `llm/server.go` (+115 -112) </details> ### 📄 Description This fixes a few issues with queuing requests: - Always wait for the server.cpp ready state to avoid "server busy" errors this should fix `no slots available` Fixes #4159 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-29 12:55:28 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#58100