mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 10:58:17 -05:00
[PR #17938] [CLOSED] feat+FIX: Add website/webpage support to knowledge bases #24608
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/17938
Author: @Classic298
Created: 9/30/2025
Status: ❌ Closed
Base:
dev← Head:add-website-knowledgebase📝 Commits (7)
b08fe86Add website to knowledge base (#21)0ff6cd1Update KnowledgeBase.svelte5cfc35dUpdate retrieval.pye3e5135Update retrieval.py199c0d2Merge branch 'dev' into add-website-knowledgebasec5d833dMerge branch 'dev' into add-website-knowledgebase5e40b67remove whitespace📊 Changes
3 files changed (+131 additions, -12 deletions)
View changed files
📝
backend/open_webui/routers/retrieval.py(+65 -12)📝
src/lib/components/workspace/Knowledge/KnowledgeBase.svelte(+46 -0)📝
src/lib/components/workspace/Knowledge/KnowledgeBase/AddContentMenu.svelte(+20 -0)📄 Description
Pull Request Checklist
Before submitting, make sure you've checked the following:
devbranch.Changelog Entry
Description
Added
uploadWebHandlerfunction to handle web scraping and file creation for knowledge basesprocessWebresponse to enable proper knowledge base associationChanged
process_webendpoint to create persistent file records with IDssave_docs_to_vector_dbto skip duplicate content check whenadd=True(adding to existing knowledge bases)process_weblogic to differentiate between chat context (auto-save to vector DB) and knowledge base context (defer toprocess_file)addFileHandlerto return boolean success statusFixed
"I upload files in the knowledgebase, even though I upload a new file with new content that is never uploaded, there is also this duplicate error, only one file can be uploaded to this knowledgebase."
Testing conducted:
5-> problem encountered was: **if you added a specific website in a chat - then created a new chat, the exact same website couldn't be added again, resulting in duplicate content warning just like the user reported in issue 17088 hereScreenshots
Contributor License Agreement
By submitting this pull request, I confirm that I have read and fully agree to the [Contributor License Agreement (CLA)](https://github.com/open-webui/open-webui/blob/main/CONTRIBUTOR_LICENSE_AGREEMENT), and I am providing my contributions under its terms.
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.