mirror of
https://github.com/open-webui/open-webui.git
synced 2026-06-03 07:24:59 -05:00
[PR #19086] refactor+feat+breaking: Make markdown header splitting a configurable preprocessing step #11882
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/19086
Author: @Classic298
Created: 11/10/2025
Status: 🔄 Open
Base:
dev← Head:markdown-chunking-refac📝 Commits (4)
7ee9b00Implement message cleaning before API call070a6c6Filter out empty assistant messages before cleaningc88147frefac+feat+breaking: Make markdown header splitting a configurable preprocessing step (#27)192d81dUpdate Chat.svelte📊 Changes
5 files changed (+62 additions, -50 deletions)
View changed files
📝
backend/open_webui/config.py(+6 -0)📝
backend/open_webui/main.py(+2 -0)📝
backend/open_webui/routers/retrieval.py(+35 -42)📝
src/lib/components/admin/Settings/Documents.svelte(+14 -1)📝
src/lib/components/notes/NoteEditor/Chat.svelte(+5 -7)📄 Description
devbranch. Not targeting thedevbranch will lead to immediate closure of the PR.Changelog Entry
Description
Key changes:
Benefits:
Why This Architecture?
The two-stage approach provides several benefits:
Related: #18715
Changed
Removed
Breaking Changes
BREAKING CHANGE: The TEXT_SPLITTER config value "markdown_header" is no longer supported. Users who previously selected "Markdown (Header)" from the dropdown will need to:
This provides equivalent functionality with better flexibility, but requires manual reconfiguration for existing users using the markdown header option.
Screenshots or Videos
Contributor License Agreement
By submitting this pull request, I confirm that I have read and fully agree to the Contributor License Agreement (CLA), and I am providing my contributions under its terms.
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.