mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-08 12:58:11 -05:00
[PR #16736] [CLOSED] **Fix** Hashtag webscrape doc handler #47269
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/16736
Author: @biebiep
Created: 8/19/2025
Status: ❌ Closed
Base:
dev← Head:webscrape_fix📝 Commits (1)
644c786Fix webscrape doc handler📊 Changes
1 file changed (+1 additions, -1 deletions)
View changed files
📝
backend/open_webui/retrieval/utils.py(+1 -1)📄 Description
Pull Request Checklist
Before submitting, make sure you've checked the following:
devbranch.Changelog Entry
Description
web scraped documents have metadata "type" == "doc". There was no handler for the "doc" type. This patch makes scraped pages equivalent to "text".
Changed
Retrieval/utils.py now also handles "doc" types and treats them the same as "text" types.
Fixed
Webscraping on non-ollama models through # addition in chat
Additional Information
Discussion here: https://github.com/open-webui/open-webui/discussions/6189
Contributor License Agreement
By submitting this pull request, I confirm that I have read and fully agree to the Contributor License Agreement (CLA), and I am providing my contributions under its terms.
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.