mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 02:48:13 -05:00
[GH-ISSUE #19007] issue: Performance regression in Open WebUI v0.6.36 – noticeable slowdown with local models #18744
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @manhtv46k55 on GitHub (Nov 7, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/19007
Check Existing Issues
Installation Method
Pip Install
Open WebUI Version
0.6.36
Ollama Version (if applicable)
0.12.10
Operating System
Windows 10
Browser (if applicable)
Chrome 142.0.7444.135
Confirmation
README.md.Expected Behavior
In Open WebUI v0.6.36, running a local model should maintain similar responsiveness as in v0.6.31.
Queries should be processed and streamed without noticeable delay compared to previous versions.
Actual Behavior
In v0.6.36, the same local model is significantly slower than in v0.6.31.
After sending a query, the system waits much longer before starting to stream responses.
The slowdown appears even on simple queries.
Steps to Reproduce
Start Open WebUI v0.6.36 and load a local model.
Ask a question via the chat interface.
The system takes noticeably longer to respond compared to v0.6.31.
Logs & Screenshots
Additional Information
Windows 10
Ram 64GB
RTX 4070s 12GB vRam
Run with CPU not u GPU
Python 3.11