mirror of
https://github.com/open-webui/open-webui.git
synced 2026-03-11 00:04:08 -05:00
issue: Web UI performance issues #5155
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ErmakovDmitriy on GitHub (May 13, 2025).
Check Existing Issues
Installation Method
Docker
Open WebUI Version
v0.6.9
Ollama Version (if applicable)
0.6.8 (but likely not applicable)
Operating System
Fedora 43
Browser (if applicable)
Chromium 136.0.7103.92
Confirmation
README.md.Expected Behavior
The Web UI part should not use that much CPU and RAM.
Actual Behavior
While generating an answer (on a remote OLLAMA) the browser consumes 100% of one CPU core (I suspect that it could have consumed more but it is limited by the single thread of Javascript).
After a while (1-10 minutes) the RAM consumption becomes quite high which is not a thing I expect.
It impossible to get any response as the browser is essentially freezes after 1 minute of executing (rendering) response from an ML model.
In addition to that, it is unclear to me why
/api/modelsrequest returns UUIDs of all the files in knowledge base. With the number of files in my knowledge base about 60000+, it makes the response about ~43-50MB which is a lot, if one wants to use the Open WebUI via mobile connection and a lot to Parse at the client browser side.Steps to Reproduce
Logs & Screenshots
chromium-console.log
No errors, warnings in server log.
Additional Information
Adding
/nothinkto the prompt or system prompt (to force the QWEN model to disablethinkingmode and immediately generate response) somewhat reduces load but the Web UI is still slow-ish.