[GH-ISSUE #11003] Freeing VRAM when Whisper is not being used #16112

Closed
opened 2026-04-19 22:08:08 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @mbeltagy on GitHub (Feb 28, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/11003

Currently, Whisper works very nicely on the GPU. However, once it has taken up VRAM, it never releases it, even when it is no longer in use. It would be a nice feature if we could release VRAM after a few minutes of inactivity, similar to how Ollama handles this.

Originally created by @mbeltagy on GitHub (Feb 28, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/11003 Currently, Whisper works very nicely on the GPU. However, once it has taken up VRAM, it never releases it, even when it is no longer in use. It would be a nice feature if we could release VRAM after a few minutes of inactivity, similar to how Ollama handles this.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#16112