mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 19:08:59 -05:00
[PR #6122] [CLOSED] perf: Improve Speed of Document Insertion in Knowledge Bases #21826
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/6122
Author: @RobinBially
Created: 10/11/2024
Status: ❌ Closed
Base:
dev← Head:main📝 Commits (3)
e8babe6refac0165bffMerge branch 'open-webui:main' into main57b2e62reuse vectors when inserting file to collection📊 Changes
4 files changed (+47 additions, -6 deletions)
View changed files
📝
backend/open_webui/apps/retrieval/main.py(+36 -4)📝
backend/open_webui/apps/retrieval/vector/dbs/qdrant.py(+7 -1)📝
backend/open_webui/apps/retrieval/vector/main.py(+2 -0)📝
backend/open_webui/apps/webui/routers/auths.py(+2 -1)📄 Description
Description
This pull request optimizes the process of inserting documents into knowledge bases by eliminating redundant embedding operations. Previously, embedding was performed twice: once for the file and again for the collection. Now, the vectors from the file embedding are reused, reducing the overall processing time. This optimization has been implemented for Qdrant at this stage.
Changelog Entry
Changed
Performance
Additional Information
This change significantly enhances the efficiency of document insertion, particularly for large files or when dealing with a high volume of documents. The optimization is currently limited to Qdrant, but may be extended to other vector databases in future updates.
Checklist
Testing
Additional Notes
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.