[PR #18765] [MERGED] fix: Don't missguide Tika with mime-type #40542

Closed
opened 2026-04-25 13:00:46 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/open-webui/open-webui/pull/18765
Author: @mkhludnev
Created: 10/31/2025
Status: Merged
Merged: 10/31/2025
Merged by: @tjbck

Base: devHead: patch-2


📝 Commits (1)

  • 24aeec9 Don't missguide Tika with mime-type

📊 Changes

1 file changed (+0 additions, -1 deletions)

View changed files

📝 backend/open_webui/retrieval/loaders/main.py (+0 -1)

📄 Description

Fix #18683

  • Tika is smart enough to detect content type.
  • Windows browsers just misguides Tika providing application/ms-word for .rtf files.

Pull Request Checklist

Before submitting, make sure you've checked the following:

  • [v] Target branch: Verify that the pull request targets the dev branch. Not targeting the dev branch may lead to immediate closure of the PR.
  • [v] Description: Provide a concise description of the changes made in this pull request.
  • [v] Changelog: Ensure a changelog entry following the format of Keep a Changelog is added at the bottom of the PR description.
  • [-] Documentation: If necessary, update relevant documentation Open WebUI Docs like environment variables, the tutorials, or other documentation sources.
  • [-] Dependencies: Are there any new dependencies? Have you updated the dependency versions in the documentation?
  • [-] Testing: Perform manual tests to verify the implemented fix/feature works as intended AND does not break any other functionality. Take this as an opportunity to make screenshots of the feature/fix and include it in the PR description.
  • [v] Agentic AI Code:: Confirm this Pull Request is not written by any AI Agent or has at least gone through additional human review and manual testing. If any AI Agent is the co-author of this PR, it may lead to immediate closure of the PR.
  • [v] Code review: Have you performed a self-review of your code, addressing any coding standard issues and ensuring adherence to the project's coding standards?
  • [v] Title Prefix: To clearly categorize this pull request, prefix the pull request title using one of the following:

Changelog Entry

Description

  • Don't missguide Tika with potentially wrong mime-type, eg Windows browsers erroneously pass application/ms-word for .rtf

Fixed

  • Don't pass mime-type to Tika.

Additional Information

Screenshots or Videos

Contributor License Agreement

By submitting this pull request, I confirm that I have read and fully agree to the Contributor License Agreement (CLA), and I am providing my contributions under its terms.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/open-webui/open-webui/pull/18765 **Author:** [@mkhludnev](https://github.com/mkhludnev) **Created:** 10/31/2025 **Status:** ✅ Merged **Merged:** 10/31/2025 **Merged by:** [@tjbck](https://github.com/tjbck) **Base:** `dev` ← **Head:** `patch-2` --- ### 📝 Commits (1) - [`24aeec9`](https://github.com/open-webui/open-webui/commit/24aeec9120cffb1969b0d4e374cfa1765749c9f8) Don't missguide Tika with mime-type ### 📊 Changes **1 file changed** (+0 additions, -1 deletions) <details> <summary>View changed files</summary> 📝 `backend/open_webui/retrieval/loaders/main.py` (+0 -1) </details> ### 📄 Description Fix #18683 - Tika is smart enough to detect content type. - Windows browsers just misguides Tika providing `application/ms-word` for .rtf files. # Pull Request Checklist **Before submitting, make sure you've checked the following:** - [v] **Target branch:** Verify that the pull request targets the `dev` branch. Not targeting the `dev` branch may lead to immediate closure of the PR. - [v] **Description:** Provide a concise description of the changes made in this pull request. - [v] **Changelog:** Ensure a changelog entry following the format of [Keep a Changelog](https://keepachangelog.com/) is added at the bottom of the PR description. - [-] **Documentation:** If necessary, update relevant documentation [Open WebUI Docs](https://github.com/open-webui/docs) like environment variables, the tutorials, or other documentation sources. - [-] **Dependencies:** Are there any new dependencies? Have you updated the dependency versions in the documentation? - [-] **Testing:** Perform manual tests to verify the implemented fix/feature works as intended AND does not break any other functionality. Take this as an opportunity to make screenshots of the feature/fix and include it in the PR description. - [v] **Agentic AI Code:**: Confirm this Pull Request is **not written by any AI Agent** or has at least gone through additional human review **and** manual testing. If any AI Agent is the co-author of this PR, it may lead to immediate closure of the PR. - [v] **Code review:** Have you performed a self-review of your code, addressing any coding standard issues and ensuring adherence to the project's coding standards? - [v] **Title Prefix:** To clearly categorize this pull request, prefix the pull request title using one of the following: # Changelog Entry ### Description - Don't missguide Tika with potentially wrong mime-type, eg Windows browsers erroneously pass `application/ms-word` for `.rtf` ### Fixed - Don't pass mime-type to Tika. --- ### Additional Information ### Screenshots or Videos ### Contributor License Agreement By submitting this pull request, I confirm that I have read and fully agree to the [Contributor License Agreement (CLA)](https://github.com/open-webui/open-webui/blob/main/CONTRIBUTOR_LICENSE_AGREEMENT), and I am providing my contributions under its terms. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-25 13:00:46 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#40542