mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 19:38:46 -05:00
[GH-ISSUE #23730] feat: Add configuration option RAG_RERANKING_BATCH_SIZE for reranker batch size #20055
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @oofnikj on GitHub (Apr 14, 2026).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/23730
Check Existing Issues
Verify Feature Scope
Problem Description
Currently, the RAG reranker batch size in
ColBERT.predict()is hard-coded asbsize=32and cannot be changed without editing the code. This makes it difficult for users to tune performance or adapt to different hardware constraints, for example when running in a containerized environment without access to GPU. Making this option configurable will enable the use of SoTA rerankers in a fully offline RAG pipeline configuration.Desired Solution you'd like
Allow the internal reranker batch size to be set through a config value (such as
RAG_RERANKING_BATCH_SIZE), loaded from environment variables or config file, similar to howRAG_EMBEDDING_BATCH_SIZEworks. If not set, the current default (32) can be retained.Alternatives Considered
Additional Context
backend/open_webui/retrieval/models/colbert.pyline:bsize=32inpredict()RAG_EMBEDDING_BATCH_SIZEconfig already exists@Classic298 commented on GitHub (Apr 16, 2026):
4d2f189810