[GH-ISSUE #7370] [Solved] Load and Unload model #51194

Closed
opened 2026-04-28 18:54:10 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @Khampol on GitHub (Oct 26, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/7370

If I load a model then choose to load another one, looks like the 1st model is not unload ... ? Why ? 🤨 Is there a way to NOT keeping both as it use a LOT of my vram 😥

Originally created by @Khampol on GitHub (Oct 26, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/7370 If I load a model then choose to load another one, looks like the 1st model is not unload ... ? Why ? 🤨 Is there a way to NOT keeping both as it use a LOT of my vram 😥
GiteaMirror added the question label 2026-04-28 18:54:10 -05:00
Author
Owner

@rick-github commented on GitHub (Oct 26, 2024):

ollama will not evict a model if it doesn't have to - if they both fit in VRAM, ollama sees nothing wrong with loading both. If you want to evict a model before it's 5 minute timeout, you can use the stop command.

<!-- gh-comment-id:2439129333 --> @rick-github commented on GitHub (Oct 26, 2024): ollama will not evict a model if it doesn't have to - if they both fit in VRAM, ollama sees nothing wrong with loading both. If you want to evict a model before it's 5 minute timeout, you can use the [`stop`](https://github.com/ollama/ollama/blob/main/docs/faq.md#how-do-i-keep-a-model-loaded-in-memory-or-make-it-unload-immediately) command.
Author
Owner

@Khampol commented on GitHub (Oct 26, 2024):

Hi,
This is the solution, thanks a lot ! ❤️👍
It would be very useful to add a node for this for Comfyui !
( Ps : I use ollma and comfy at same time so very tight often on the vram )

<!-- gh-comment-id:2439145164 --> @Khampol commented on GitHub (Oct 26, 2024): Hi, This is the solution, thanks a lot ! ❤️👍 It would be very useful to add a node for this for Comfyui ! ( Ps : I use ollma and comfy at same time so very tight often on the vram )
Author
Owner

@dhiltgen commented on GitHub (Oct 28, 2024):

The behavior of the scheduler is configurable and documented here - https://github.com/ollama/ollama/blob/main/docs/faq.md#how-does-ollama-handle-concurrent-requests

<!-- gh-comment-id:2441947615 --> @dhiltgen commented on GitHub (Oct 28, 2024): The behavior of the scheduler is configurable and documented here - https://github.com/ollama/ollama/blob/main/docs/faq.md#how-does-ollama-handle-concurrent-requests
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#51194