mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 19:38:46 -05:00
[GH-ISSUE #22158] feat: dynamic chunk size and overlap values #35174
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @rawaha-e on GitHub (Mar 2, 2026).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/22158
Check Existing Issues
Verify Feature Scope
Problem Description
OpenWebUI currently only supports fixed CHUNK_SIZE and CHUNK_OVERLAP values when processing documents. This works for standard documents but is problematic in practice:
Desired Solution you'd like
Implement dynamic chunk sizing based on document length or token count as following:
Alternatives Considered
No response
Additional Context
No response
@Classic298 commented on GitHub (Mar 2, 2026):
why not use markdown header splitting with minimum chunk size merging? that's exactly what you're describing
@rawaha-e commented on GitHub (Mar 2, 2026):
@Classic298 There are documents where markdown headers are not present, for example if I want to process an unstructured table. My feature request makes more sense when using character splitting.
@Classic298 commented on GitHub (Mar 2, 2026):
aha in this case this should be discussed first this would be a massive feature