mirror of
https://github.com/open-webui/open-webui.git
synced 2026-03-22 06:02:06 -05:00
[PR #1292] [MERGED] Add htm/html support for RAG documents #7424
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/1292
Author: @ddanat-smm
Created: 3/25/2024
Status: ✅ Merged
Merged: 3/26/2024
Merged by: @tjbck
Base:
dev← Head:dev📝 Commits (5)
784a6ecinclude html langchain loader for RAG77f4ffdadd htm/html to supported extensions in uic91a5d8switch to using BeautifulSoup HTML loader so title is also captured6307adffeat: better error handling3688955fix: encoding issue📊 Changes
3 files changed (+64 additions, -43 deletions)
View changed files
📝
backend/apps/rag/main.py(+59 -43)📝
backend/constants.py(+2 -0)📝
src/lib/constants.ts(+3 -0)📄 Description
hey folks, here's a quick and dirty PR for HTML document support in RAG documents. If there's anything I missed, just let me know :)
Pull Request Checklist
Description
Add htm/html support for RAG documents
Changelog Entry
Added
Fixed
n/a
Changed
Removed
n/a
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.