[PR #20103] [CLOSED] fix: pass batch_size to sentence-transformers encode method #48517

Closed
opened 2026-04-30 00:30:48 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/open-webui/open-webui/pull/20103
Author: @majiayu000
Created: 12/22/2025
Status: Closed

Base: devHead: fix/embedding-batch-size-20053


📝 Commits (9)

📊 Changes

3 files changed (+101 additions, -1 deletions)

View changed files

📝 backend/open_webui/retrieval/utils.py (+3 -1)
backend/open_webui/test/retrieval/__init__.py (+0 -0)
backend/open_webui/test/retrieval/test_utils.py (+98 -0)

📄 Description

Pull Request Checklist

  • Target branch: Verify that the pull request targets the dev branch.
  • Description: Provided below.
  • Testing: Manually tested with large documents.
  • Agentic AI Code: This PR has gone through human review and manual testing.
  • Code review: Self-reviewed.

Changelog Entry

Description

  • Pass batch_size parameter to embedding_function.encode() for SentenceTransformers
  • Allow users to control memory usage by configuring Embedding Batch Size in Admin Settings

Fixed

  • Fixed high memory usage when embedding large documents by respecting the batch_size setting (#20053)

Additional Information

Contributor License Agreement

By submitting this pull request, I confirm that I have read and fully agree to the Contributor License Agreement (CLA), and I am providing my contributions under its terms.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/open-webui/open-webui/pull/20103 **Author:** [@majiayu000](https://github.com/majiayu000) **Created:** 12/22/2025 **Status:** ❌ Closed **Base:** `dev` ← **Head:** `fix/embedding-batch-size-20053` --- ### 📝 Commits (9) - [`fe6783c`](https://github.com/open-webui/open-webui/commit/fe6783c16699911c7be17392596d579333fb110c) Merge pull request #19030 from open-webui/dev - [`fc05e0a`](https://github.com/open-webui/open-webui/commit/fc05e0a6c5d39da60b603b4d520f800d6e36f748) Merge pull request #19405 from open-webui/dev - [`e3faec6`](https://github.com/open-webui/open-webui/commit/e3faec62c58e3a83d89aa3df539feacefa125e0c) Merge pull request #19416 from open-webui/dev - [`9899293`](https://github.com/open-webui/open-webui/commit/9899293f050ad50ae12024cbebee7e018acd851e) Merge pull request #19448 from open-webui/dev - [`140605e`](https://github.com/open-webui/open-webui/commit/140605e660b8186a7d5c79fb3be6ffb147a2f498) Merge pull request #19462 from open-webui/dev - [`6f1486f`](https://github.com/open-webui/open-webui/commit/6f1486ffd0cb288d0e21f41845361924e0d742b3) Merge pull request #19466 from open-webui/dev - [`d95f533`](https://github.com/open-webui/open-webui/commit/d95f533214e3fe5beb5e41ec1f349940bc4c7043) Merge pull request #19729 from open-webui/dev - [`a727153`](https://github.com/open-webui/open-webui/commit/a7271532f8a38da46785afcaa7e65f9a45e7d753) 0.6.43 (#20093) - [`b440a95`](https://github.com/open-webui/open-webui/commit/b440a951a5fd369f4d4bc60679349d0ab753d812) fix: pass batch_size to sentence-transformers encode method ### 📊 Changes **3 files changed** (+101 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `backend/open_webui/retrieval/utils.py` (+3 -1) ➕ `backend/open_webui/test/retrieval/__init__.py` (+0 -0) ➕ `backend/open_webui/test/retrieval/test_utils.py` (+98 -0) </details> ### 📄 Description # Pull Request Checklist - [x] **Target branch:** Verify that the pull request targets the `dev` branch. - [x] **Description:** Provided below. - [x] **Testing:** Manually tested with large documents. - [x] **Agentic AI Code:** This PR has gone through human review and manual testing. - [x] **Code review:** Self-reviewed. # Changelog Entry ### Description - Pass `batch_size` parameter to `embedding_function.encode()` for SentenceTransformers - Allow users to control memory usage by configuring Embedding Batch Size in Admin Settings ### Fixed - Fixed high memory usage when embedding large documents by respecting the batch_size setting (#20053) --- ### Additional Information - Fixes #20053 - Users can configure Embedding Batch Size in Admin Settings > Documents to control RAM/VRAM usage ### Contributor License Agreement By submitting this pull request, I confirm that I have read and fully agree to the [Contributor License Agreement (CLA)](https://github.com/open-webui/open-webui/blob/main/CONTRIBUTOR_LICENSE_AGREEMENT), and I am providing my contributions under its terms. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-30 00:30:48 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#48517