[GH-ISSUE #1951] Ollama GPU Process does not automatically terminate after inactivity #47637

Closed
opened 2026-04-28 04:39:24 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @chereszabor on GitHub (Jan 12, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/1951

Originally assigned to: @dhiltgen on GitHub.

Noticed with recent releases the ollama process does not get automatically terminated after a period of inactivity, idling the GPU process and keeping the last used model in VRAM. This also increases the time required to load a new model into VRAM and increases 'standby' power usage of the GPU.

I am deploying ollama via Docker and tested with the latest version v0.1.20.

Originally created by @chereszabor on GitHub (Jan 12, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/1951 Originally assigned to: @dhiltgen on GitHub. Noticed with recent releases the ollama process does not get automatically terminated after a period of inactivity, idling the GPU process and keeping the last used model in VRAM. This also increases the time required to load a new model into VRAM and increases 'standby' power usage of the GPU. I am deploying ollama via Docker and tested with the latest version v0.1.20.
Author
Owner

@newjcj commented on GitHub (Jan 13, 2024):

me to!!!

<!-- gh-comment-id:1890327212 --> @newjcj commented on GitHub (Jan 13, 2024): me to!!!
Author
Owner

@cassidea commented on GitHub (Jan 17, 2024):

Same here. Model gets unloaded after some time but still ~120MB on the GPU preventing to switch into lower power states.

<!-- gh-comment-id:1895892869 --> @cassidea commented on GitHub (Jan 17, 2024): Same here. Model gets unloaded after some time but still ~120MB on the GPU preventing to switch into lower power states.
Author
Owner

@dhiltgen commented on GitHub (Jan 18, 2024):

Closing as dup of #1848

<!-- gh-comment-id:1898867155 --> @dhiltgen commented on GitHub (Jan 18, 2024): Closing as dup of #1848
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#47637