mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 10:58:17 -05:00
[PR #22678] [CLOSED] fix: optimize file handling for full context mode to reduce chat latency #49858
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/22678
Author: @a86582751
Created: 3/14/2026
Status: ❌ Closed
Base:
dev← Head:fix-full-context-bypass-rag📝 Commits (10+)
fe6783cMerge pull request #19030 from open-webui/devfc05e0aMerge pull request #19405 from open-webui/deve3faec6Merge pull request #19416 from open-webui/dev9899293Merge pull request #19448 from open-webui/dev140605eMerge pull request #19462 from open-webui/dev6f1486fMerge pull request #19466 from open-webui/devd95f533Merge pull request #19729 from open-webui/deva7271530.6.43 (#20093)6adde20Merge pull request #20394 from open-webui/devf9b0534Merge pull request #20522 from open-webui/dev📊 Changes
1 file changed (+100 additions, -33 deletions)
View changed files
📝
backend/open_webui/utils/middleware.py(+100 -33)📄 Description
Pull Request Checklist
Before submitting, make sure you've checked the following:
devbranch. PRs targetingmainwill be immediately closed.Filesmodel.fix:prefix for bug fix.Changelog Entry
Description
Fixes the issue where sending messages with large files in 'Full Context' mode causes significant delays. When users select 'Full Context' mode for uploaded files, the system was still generating queries for RAG retrieval, serializing large file content in frontend JSON, and emitting unnecessary status updates. This resulted in noticeable delays when chatting with documents.
Fixed
Changed
chat_completion_files_handlerinbackend/open_webui/utils/middleware.pyAdditional Information
Root Cause:
The original code checked
all_full_contextbut still calledget_sources_from_itemswithfull_context=True, which triggered the entire RAG pipeline including query generation and status emissions. Additionally, the frontend was serializing large file content in JSON, causing significant delays.Solution:
When
all_full_contextis True ORbypass_embeddingis True:Testing Results:
Files Changed:
backend/open_webui/utils/middleware.py(+39 lines, -19 lines)Contributor License Agreement
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.