mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-08 04:16:03 -05:00
[GH-ISSUE #14809] feat: unload STT model from memory #32899
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @lrnd1 on GitHub (Jun 9, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/14809
Check Existing Issues
Problem Description
I am running Open-WebUI in docker on a mac mini.
I've been playing around with different local STT models, and noticed that they wouldn't get unloaded from RAM, unless I restart the container.
The models are only loaded when the voice function is used, which is good. However, it seemingly never gets unloaded, and keeps the RAM reserved.
Another thing I noticed, that there seems to be a minor bug. When switching between the STT models, the previous model still doesn't get unloaded from RAM.
Also when I switch from a working model to one that should not work (for ex. because vocabulary file is missing), the previous model will be used, instead of throwing an error.
Desired Solution you'd like
It would be convenient if the model would simply deallocate memory after voice or dictate functions are no longer in use.
An even better solution would be to provide an option to set how long the model would remain loaded after use. This would be particularly useful in cases where multiple dictations are being processed, as it might be impractical to keep loading and unloading the model immediately.
Alternatives Considered
No response
Additional Context
No response
@tjbck commented on GitHub (Jun 16, 2025):
Should be addressed with
72df23ed79@lrnd1 commented on GitHub (Jun 18, 2025):
v0.6.15 did not solve the issue.
After using STT, the CPU goes back to normal, but memory consumption stays active.