mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 19:08:59 -05:00
[GH-ISSUE #12295] issue: PDFs Blank after uploading (but text is showing) #55205
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @wm-ek on GitHub (Apr 1, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/12295
Check Existing Issues
Installation Method
Git Clone
Open WebUI Version
v0.6.0(and v0.5.20)
Ollama Version (if applicable)
No response
Operating System
Debian 12
Browser (if applicable)
Chrome Version 134.0.6998.165, Firefox 136.0.3
Confirmation
README.md.Expected Behavior
PDF Content showing up
Actual Behavior
PDF Content is blank
Steps to Reproduce
Or include a pdf in a prompt and click on the source of the pdf
Logs & Screenshots
Docker Container Logs
2025-04-01T14:33:12.779801414Z 2025-04-01 14:33:12.779 | WARNING | python_multipart.multipart:_internal_write:1401 - Skipping data after last boundary - {} 2025-04-01T14:33:12.781440580Z 2025-04-01 14:33:12.781 | INFO | open_webui.routers.files:upload_file:42 - file.content_type: application/pdf - {} 2025-04-01T14:33:12.841663820Z 2025-04-01 14:33:12.841 | INFO | open_webui.routers.retrieval:save_docs_to_vector_db:782 - save_docs_to_vector_db: document colors.pdf file-b56ccccf-6102-4123-8040-0017e00c0df3 - {} 2025-04-01T14:33:12.844974591Z 2025-04-01 14:33:12.844 | INFO | open_webui.routers.retrieval:save_docs_to_vector_db:865 - adding to collection file-b56ccccf-6102-4123-8040-0017e00c0df3 - {} 2025-04-01T14:33:13.310931796Z 2025-04-01 14:33:13.310 | INFO | uvicorn.protocols.http.httptools_impl:send:476 - 162.158.148.232:0 - "POST /api/v1/files/ HTTP/1.1" 200 - {} 2025-04-01T14:33:25.402515921Z 2025-04-01 14:33:25.402 | INFO | uvicorn.protocols.http.httptools_impl:send:476 - 162.158.148.232:0 - "GET /api/v1/files/b56ccccf-6102-4123-8040-0017e00c0df3/content HTTP/1.1" 200 - {}Additional Information
apache tika 3.1.0.0-full as ocr-server-
not with full context mode
-openwebui v0.5.20
-the pdf does not exceed the upload limit
-it happens with both small, single page pdfs and large multi-page pdfs
it happens
@wm-ek commented on GitHub (Apr 1, 2025):
we used a cloudflare worker to inject a customer feedback tool; deactivating the worker resolved the problem. we will investigate the worker.