Improving Multilingual Support and Image Processing with Tika Engine in Open WebUI: Challenges and Recommendations #2798

New Issue

GiteaMirror · 2025-11-11T15:14:40-06:00

GiteaMirror commented

2025-11-11 15:14:40 -06:00

Originally created by @ahm631 on GitHub (Nov 23, 2024).

I would like to express my sincere appreciation for your outstanding efforts in developing Open WebUI. I have been utilizing Tika as a file-reading engine, and it performs excellently with English text. However, I have encountered some challenges when working with other languages, such as Arabic.

OCR Challenges with Arabic Language

After installing the tesseract-ocr-ara package on Tika, I did not observe any effect within the Open WebUI interface. Through testing with Postman, I discovered that it is necessary to specify the OCR language of the image by passing it via the X-Tika-OCRLanguage parameter. Currently, Open WebUI does not provide an option to set this language, and it seems that the system defaults to sending the primary language as English (eng).

I attempted to modify Tika by replacing tesseract-ocr-eng with tesseract-ocr-ara, which resulted in successful operation within Open WebUI. However, when the file contains English text, the system is unable to read it. Additionally, there is an issue when multiple languages are present in the file.

Therefore, I suggest implementing a feature in Open WebUI that allows users to specify the languages present in files or images before sending them to the Tika server, ensuring more accurate results.

Processing Standalone Images

When uploading an image directly to Open WebUI and querying the language model about it, the system is unable to read the content. In contrast, if the image is embedded within a PDF file, it is processed successfully with OCR, particularly for English text.

I recommend enhancing the support for processing standalone images in Open WebUI, enabling the system to apply OCR directly to images without the need to embed them in PDF files.

Thank you once again for your dedication, and I look forward to future improvements that will enhance multilingual support and image processing capabilities in Open WebUI.

Originally created by @ahm631 on GitHub (Nov 23, 2024). I would like to express my sincere appreciation for your outstanding efforts in developing Open WebUI. I have been utilizing Tika as a file-reading engine, and it performs excellently with English text. However, I have encountered some challenges when working with other languages, such as Arabic. 1. OCR Challenges with Arabic Language After installing the tesseract-ocr-ara package on Tika, I did not observe any effect within the Open WebUI interface. Through testing with Postman, I discovered that it is necessary to specify the OCR language of the image by passing it via the X-Tika-OCRLanguage parameter. Currently, Open WebUI does not provide an option to set this language, and it seems that the system defaults to sending the primary language as English (eng). I attempted to modify Tika by replacing tesseract-ocr-eng with tesseract-ocr-ara, which resulted in successful operation within Open WebUI. However, when the file contains English text, the system is unable to read it. Additionally, there is an issue when multiple languages are present in the file. Therefore, I suggest implementing a feature in Open WebUI that allows users to specify the languages present in files or images before sending them to the Tika server, ensuring more accurate results. 2. Processing Standalone Images When uploading an image directly to Open WebUI and querying the language model about it, the system is unable to read the content. In contrast, if the image is embedded within a PDF file, it is processed successfully with OCR, particularly for English text. I recommend enhancing the support for processing standalone images in Open WebUI, enabling the system to apply OCR directly to images without the need to embed them in PDF files. Thank you once again for your dedication, and I look forward to future improvements that will enhance multilingual support and image processing capabilities in Open WebUI.

GiteaMirror closed this issue