mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 10:58:17 -05:00
[PR #22321] feat: table-aware RAG ingestion for CSV, TSV, and Excel files #49660
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/22321
Author: @salim4n
Created: 3/6/2026
Status: 🔄 Open
Base:
dev← Head:feat/table-aware-rag-ingestion📝 Commits (10+)
fe6783cMerge pull request #19030 from open-webui/devfc05e0aMerge pull request #19405 from open-webui/deve3faec6Merge pull request #19416 from open-webui/dev9899293Merge pull request #19448 from open-webui/dev140605eMerge pull request #19462 from open-webui/dev6f1486fMerge pull request #19466 from open-webui/devd95f533Merge pull request #19729 from open-webui/deva7271530.6.43 (#20093)6adde20Merge pull request #20394 from open-webui/devf9b0534Merge pull request #20522 from open-webui/dev📊 Changes
7 files changed (+626 additions, -54 deletions)
View changed files
📝
backend/open_webui/config.py(+6 -0)📝
backend/open_webui/retrieval/loaders/main.py(+35 -5)➕
backend/open_webui/retrieval/loaders/table.py(+213 -0)📝
backend/open_webui/routers/retrieval.py(+70 -49)➕
backend/open_webui/test/retrieval/__init__.py(+0 -0)➕
backend/open_webui/test/retrieval/loaders/__init__.py(+0 -0)➕
backend/open_webui/test/retrieval/loaders/test_table.py(+302 -0)📄 Description
Summary
Closes discussion #22319
Replace
CSVLoaderandUnstructuredExcelLoaderwith custom table-aware loaders that preserve row integrity for better RAG retrieval on tabular data.TABLE_ROWS_PER_CHUNKenv var / config (default: 1)Changed files
backend/open_webui/retrieval/loaders/table.py— newTableAwareCSVLoader,TableAwareExcelLoaderbackend/open_webui/retrieval/loaders/main.py— route CSV/TSV/Excel to new loadersbackend/open_webui/routers/retrieval.py— bypass text splitting forfile_type="table"docsbackend/open_webui/config.py— addTABLE_ROWS_PER_CHUNKTesting
pytest)Contributor License Agreement
I have read and agree to the Contributor License Agreement.
🤖 Generated with Claude Code
Co-Authored-By: Claude Opus 4.6 noreply@anthropic.com
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.