mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 19:38:46 -05:00
[GH-ISSUE #16938] issue: response latency increases by 6s when any file is attached (captions, pasted files, etc.) #18103
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @lucyknada on GitHub (Aug 26, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/16938
Check Existing Issues
Installation Method
Docker
Open WebUI Version
v0.6.25
Ollama Version (if applicable)
No response
Operating System
debian 13
Browser (if applicable)
No response
Confirmation
README.md.Expected Behavior
Queries should be near-instant with or without file attachments, similar to when pasting the content directly into the prompt.
Actual Behavior
As soon as any file is attached (txt file, youtube caption file, etc.), each query takes ~6 seconds, even for simple prompts. Without files, responses are instant.
Steps to Reproduce
Logs & Screenshots
Additional Information
@tjbck commented on GitHub (Aug 26, 2025):
File processing will take extra time.
@lucyknada commented on GitHub (Aug 26, 2025):
@tjbck a file already has been uploaded, it has to re-process on every single turn?
@Classic298 commented on GitHub (Aug 27, 2025):
Yes
@lucyknada commented on GitHub (Aug 27, 2025):
@Classic298 whats causing this to take 6 seconds though? the 22m embedding model cant take that long on modern hardware especially on tiny amounts of text, is it the reranking?
@Classic298 commented on GitHub (Aug 27, 2025):
Depends on the file.
For me it rarely takes 6 seconds
Depends on the file type and file size how long content extraction will take
@rgaricano commented on GitHub (Aug 27, 2025):
mesured on my end (low & slow) in dev & DEBUG mode (this causes all embeds to be logged, json files generate thousands of entries, which also slows down the process and makes logging difficult because the journal bundles those excessive lines)