[PR #12845] [CLOSED] feat: Add parameter BYPASS_WEB_LOADING_FOR_WEB_SEARCH to skip web page loading in search #23033

Closed
opened 2026-04-20 04:35:13 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/open-webui/open-webui/pull/12845
Author: @Youggls
Created: 4/14/2025
Status: Closed

Base: devHead: dev


📝 Commits (3)

  • cf1c43d feat: Add parameter to skip web page loading in search, add empty web loader.
  • c92165e fix: use BaseLoader as the SafeEmptyLoader baseclass.
  • 5051ea9 fix: fix the init functino of SaveEmptyLoader.

📊 Changes

5 files changed (+73 additions, -0 deletions)

View changed files

📝 backend/open_webui/config.py (+6 -0)
📝 backend/open_webui/main.py (+4 -0)
📝 backend/open_webui/retrieval/web/utils.py (+36 -0)
📝 backend/open_webui/routers/retrieval.py (+7 -0)
📝 src/lib/components/admin/Settings/WebSearch.svelte (+20 -0)

📄 Description

feat: Add parameter BYPASS_WEB_LOADING_FOR_WEB_SEARCH to skip web page loading in search, add empty web loader.

Pull Request Checklist

Before submitting, make sure you've checked the following:

  • Target branch: Please verify that the pull request targets the dev branch.
  • Description: Provide a concise description of the changes made in this pull request.
  • Changelog: Ensure a changelog entry following the format of Keep a Changelog is added at the bottom of the PR description.
  • Documentation: Have you updated relevant documentation Open WebUI Docs, or other documentation sources?
  • Dependencies: Are there any new dependencies? Have you updated the dependency versions in the documentation?
  • Testing: Have you written and run sufficient tests to validate the changes?
  • Code review: Have you performed a self-review of your code, addressing any coding standard issues and ensuring adherence to the project's coding standards?
  • Prefix: To clearly categorize this pull request, prefix the pull request title using one of the following:
    • BREAKING CHANGE: Significant changes that may affect compatibility
    • build: Changes that affect the build system or external dependencies
    • ci: Changes to our continuous integration processes or workflows
    • chore: Refactor, cleanup, or other non-functional code changes
    • docs: Documentation update or addition
    • feat: Introduces a new feature or enhancement to the codebase
    • fix: Bug fix or error correction
    • i18n: Internationalization or localization changes
    • perf: Performance improvement
    • refactor: Code restructuring for better maintainability, readability, or scalability
    • style: Changes that do not affect the meaning of the code (white space, formatting, missing semi-colons, etc.)
    • test: Adding missing tests or correcting existing tests
    • WIP: Work in progress, a temporary label for incomplete or ongoing work

Changelog Entry

Description

This PR implements a parameter to control web page loading functionality during online searches. The goal is to accelerate web search by skipping the actual page loading process in scenarios where it can become a bottleneck.

Added

  • Added a parameter BYPASS_WEB_LOADING_FOR_WEB_SEARCH to toggle web page loading during web searches
  • Implemented SafeEmptyLoader to initialize Document objects directly with search engine snippets

Changed

  • Modified the search process to optionally bypass web page loading and use search engine snippets directly as page content

Performance

  • Improved search performance by reducing network requests and processing time when full page content isn't necessary

Additional Information

Screenshots or Videos

image
image


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/open-webui/open-webui/pull/12845 **Author:** [@Youggls](https://github.com/Youggls) **Created:** 4/14/2025 **Status:** ❌ Closed **Base:** `dev` ← **Head:** `dev` --- ### 📝 Commits (3) - [`cf1c43d`](https://github.com/open-webui/open-webui/commit/cf1c43da3760025ade2ca2fde3a86ea5949c5508) feat: Add parameter to skip web page loading in search, add empty web loader. - [`c92165e`](https://github.com/open-webui/open-webui/commit/c92165e85fe1bdad123eaac5eb5cd9b3b3b91d6c) fix: use BaseLoader as the SafeEmptyLoader baseclass. - [`5051ea9`](https://github.com/open-webui/open-webui/commit/5051ea91be09a45fde903efe6384f821ab35ad17) fix: fix the init functino of SaveEmptyLoader. ### 📊 Changes **5 files changed** (+73 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `backend/open_webui/config.py` (+6 -0) 📝 `backend/open_webui/main.py` (+4 -0) 📝 `backend/open_webui/retrieval/web/utils.py` (+36 -0) 📝 `backend/open_webui/routers/retrieval.py` (+7 -0) 📝 `src/lib/components/admin/Settings/WebSearch.svelte` (+20 -0) </details> ### 📄 Description feat: Add parameter `BYPASS_WEB_LOADING_FOR_WEB_SEARCH` to skip web page loading in search, add empty web loader. # Pull Request Checklist **Before submitting, make sure you've checked the following:** - [x] **Target branch:** Please verify that the pull request targets the `dev` branch. - [x] **Description:** Provide a concise description of the changes made in this pull request. - [x] **Changelog:** Ensure a changelog entry following the format of [Keep a Changelog](https://keepachangelog.com/) is added at the bottom of the PR description. - [ ] **Documentation:** Have you updated relevant documentation [Open WebUI Docs](https://github.com/open-webui/docs), or other documentation sources? - [ ] **Dependencies:** Are there any new dependencies? Have you updated the dependency versions in the documentation? - [x] **Testing:** Have you written and run sufficient tests to validate the changes? - [x] **Code review:** Have you performed a self-review of your code, addressing any coding standard issues and ensuring adherence to the project's coding standards? - [x] **Prefix:** To clearly categorize this pull request, prefix the pull request title using one of the following: - **BREAKING CHANGE**: Significant changes that may affect compatibility - **build**: Changes that affect the build system or external dependencies - **ci**: Changes to our continuous integration processes or workflows - **chore**: Refactor, cleanup, or other non-functional code changes - **docs**: Documentation update or addition - **feat**: Introduces a new feature or enhancement to the codebase - **fix**: Bug fix or error correction - **i18n**: Internationalization or localization changes - **perf**: Performance improvement - **refactor**: Code restructuring for better maintainability, readability, or scalability - **style**: Changes that do not affect the meaning of the code (white space, formatting, missing semi-colons, etc.) - **test**: Adding missing tests or correcting existing tests - **WIP**: Work in progress, a temporary label for incomplete or ongoing work # Changelog Entry ### Description This PR implements a parameter to control web page loading functionality during online searches. The goal is to accelerate web search by skipping the actual page loading process in scenarios where it can become a bottleneck. ### Added - Added a parameter BYPASS_WEB_LOADING_FOR_WEB_SEARCH to toggle web page loading during web searches - Implemented SafeEmptyLoader to initialize Document objects directly with search engine snippets ### Changed - Modified the search process to optionally bypass web page loading and use search engine snippets directly as page content ### Performance - Improved search performance by reducing network requests and processing time when full page content isn't necessary --- ### Additional Information - This implementation is inspired by Perplexica's approach: https://github.com/ItzCrazyKns/Perplexica/blob/master/src/lib/search/metaSearchAgent.ts#L213 - The feature addresses performance issues in web searches where page loading can significantly slow down the process - No new dependencies were added for this implementation ### Screenshots or Videos ![image](https://github.com/user-attachments/assets/cb30c4b0-dad0-4581-8993-621d5639c840) ![image](https://github.com/user-attachments/assets/1ecc9067-fced-4d27-84a2-1ad9a3c7d1da) --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-20 04:35:13 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#23033