[GH-ISSUE #6510] Scanned PDF Not Uploading or Extracting Text in App – HTTPException 400: "The content provided is empty" #14391

Closed
opened 2026-04-19 20:46:11 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @medrpa on GitHub (Oct 28, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/6510

I'm encountering an issue where scanned PDFs are not uploading correctly, nor is the text being extracted to create searchable PDFs in my application. I am running the application directly on Windows, without Docker. Although I am using the Tika server for text extraction, it is not functioning as expected. The document uploads successfully to the Tika server, but the text is not being extracted.

Thank you in advance please help me in. I will be thank ful to you.

Originally created by @medrpa on GitHub (Oct 28, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/6510 I'm encountering an issue where scanned PDFs are not uploading correctly, nor is the text being extracted to create searchable PDFs in my application. I am running the application directly on Windows, without Docker. Although I am using the Tika server for text extraction, it is not functioning as expected. The document uploads successfully to the Tika server, but the text is not being extracted. Thank you in advance please help me in. I will be thank ful to you.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#14391