[GH-ISSUE #2741] Using embedding and llm at the same time #1648

Closed
opened 2026-04-12 11:36:24 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @Bearsaerker on GitHub (Feb 25, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/2741

Originally assigned to: @dhiltgen on GitHub.

I just have the question: can I use embedding and llm at the same time without it loading the embedding and unloading the llm? I wanted to use it in llama index and want to know if this works before trying to make it work

Originally created by @Bearsaerker on GitHub (Feb 25, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/2741 Originally assigned to: @dhiltgen on GitHub. I just have the question: can I use embedding and llm at the same time without it loading the embedding and unloading the llm? I wanted to use it in llama index and want to know if this works before trying to make it work
GiteaMirror added the feature request label 2026-04-12 11:36:24 -05:00
Author
Owner

@dhiltgen commented on GitHub (May 2, 2024):

concurrency support is now available in 0.1.33 as an experimental opt-in feature. See the release notes for instructions on enabling.

https://github.com/ollama/ollama/releases

<!-- gh-comment-id:2091846936 --> @dhiltgen commented on GitHub (May 2, 2024): concurrency support is now available in 0.1.33 as an experimental opt-in feature. See the release notes for instructions on enabling. https://github.com/ollama/ollama/releases
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#1648