mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 19:38:46 -05:00
issue: Tika is unable to extract text from .rtf when it's attached from Windows browser #6606
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @mkhludnev on GitHub (Oct 7, 2025).
Check Existing Issues
Installation Method
Git Clone
Open WebUI Version
0.6.26
Ollama Version (if applicable)
No response
Operating System
Windows
Browser (if applicable)
Chrome
Confirmation
README.md.Expected Behavior
Tika loader extracts text from rtf for browser running in Windows.
Actual Behavior
For Tika loader (at least) attaching .rtf file from Windows (Browser OS) leads to submitting wrong
Content-Type: application/ms-word, which is passed to Tika, and causes an extraction error.Content-Type: application/rtfor use a browser in Linux it works fine. It's known Windows issue.Steps to Reproduce
Enable Tika for content extraction.
Attach .rtf from Windows browser.
Logs & Screenshots
Additional Information
No response
@mkhludnev commented on GitHub (Oct 7, 2025):
Quickcheck: it works if we just don't pass Content-type to tika/text, since it might be incorrect.
@tjbck commented on GitHub (Oct 28, 2025):
Open to PRs!