mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-11 08:22:09 -05:00
[GH-ISSUE #23057] issue: Files uploaded with "Use entire document" force context refresh every reply #19875
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @frenzybiscuit on GitHub (Mar 26, 2026).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/23057
Check Existing Issues
Installation Method
Docker Cuda
Open WebUI Version
0.8.11
Ollama Version (if applicable)
N/A
Operating System
Linux
Browser (if applicable)
Firefox
Confirmation
README.md.Expected Behavior
Does not reprocess context every reply
Actual Behavior
Reprocesses context every reply
Steps to Reproduce
If you take a large script (say 50k context) and shove it into the system prompt of a model, it works fine. Context doesn't reprocess every reply.
If you take that same script and upload it into OWUI and use "Entire Document" it will force refresh the context every reply. This leads to very long wait times between replies. It's quite frustrating.
I have verified this happens on both vllm and llamacpp (ik_llamacpp).
This happens all the time with SillyTavern when you use lorebooks, but only if you inject the new context at a depth that causes reprocessing. I'm assuming the same thing happens here.
Logs & Screenshots
Additional Information
No response
@adhusch commented on GitHub (Mar 27, 2026):
Hi @frenzybiscuit ,
did you try setting RAG_SYSTEM_CONTEXT=True ?
Best
@frenzybiscuit commented on GitHub (Mar 27, 2026):
No, but I'm not using rag. I'm using the entire document in full context according to OWUI (entire document).
Would setting RAG_SYSTEM_CONTEXT=True fix this?
@frenzybiscuit commented on GitHub (Mar 27, 2026):
I mean yes I use rag, but that's not what this option does..
@frenzybiscuit commented on GitHub (Mar 27, 2026):
Will try and let you know.
@frenzybiscuit commented on GitHub (Mar 27, 2026):
That does fix it, yes...
Is full context injected into the prompt like regular rag?
Well, that solved my issue.