mirror of
https://github.com/open-webui/open-webui.git
synced 2026-06-02 23:08:26 -05:00
When two pages use two different models, the two models cannot be started at the same time. #888
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ALixuhui on GitHub (May 11, 2024).
When two pages use two different models, the two models cannot be started at the same time.
I have some concurrent needs to support different users calling different models,. However, I've noticed that when multiple users chat with different models concurrently, Ollama only keeps one model in the startup state at a time. Whenever a new user starts chatting with a different model, the current model is killed and the new one is restarted. This impacts the response speed. How can I configure Ollama to start and maintain multiple models concurrently, without killing other models, to improve response speed?
@ALixuhui commented on GitHub (May 11, 2024):
Environment="OLLAMA_MAX_LOADED_MODELS=2"
Solved by setting the ollama environment variable