[GH-ISSUE #5164] PDF Extract Images (OCR) Fails with 'Unsupported Filter' on Scanned Documents #29408

Closed
opened 2026-04-25 03:49:53 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @Genai-labs on GitHub (Sep 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5164

Is your feature request related to a problem? Please describe.

In the Admin Dashboard, under the Settings > Document section, when using the "PDF Extract Images (OCR)" option, the process fails for scanned documents and returns the error message: "Unsupported filter". This issue specifically occurs with documents that have been printed and then scanned back into PDF format.

Describe alternatives you've considered

It could be beneficial to have an extraction method that supports this type of document.

Additional context

version : 3.18

image

Originally created by @Genai-labs on GitHub (Sep 5, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/5164 **Is your feature request related to a problem? Please describe.** In the Admin Dashboard, under the Settings > Document section, when using the "PDF Extract Images (OCR)" option, the process fails for scanned documents and returns the error message: "Unsupported filter". This issue specifically occurs with documents that have been printed and then scanned back into PDF format. **Describe alternatives you've considered** It could be beneficial to have an extraction method that supports this type of document. **Additional context** version : 3.18 ![image](https://github.com/user-attachments/assets/2453f667-f9c3-49ef-ba4d-95fe3076b178)
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#29408