[GH-ISSUE #5164] PDF Extract Images (OCR) Fails with 'Unsupported Filter' on Scanned Documents #29408

New Issue

GiteaMirror · 2026-04-25T03:49:53-05:00

GiteaMirror commented

2026-04-25 03:49:53 -05:00

Originally created by @Genai-labs on GitHub (Sep 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5164

Is your feature request related to a problem? Please describe.

In the Admin Dashboard, under the Settings > Document section, when using the "PDF Extract Images (OCR)" option, the process fails for scanned documents and returns the error message: "Unsupported filter". This issue specifically occurs with documents that have been printed and then scanned back into PDF format.

Describe alternatives you've considered

It could be beneficial to have an extraction method that supports this type of document.

Additional context

version : 3.18

Originally created by @Genai-labs on GitHub (Sep 5, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/5164 **Is your feature request related to a problem? Please describe.** In the Admin Dashboard, under the Settings > Document section, when using the "PDF Extract Images (OCR)" option, the process fails for scanned documents and returns the error message: "Unsupported filter". This issue specifically occurs with documents that have been printed and then scanned back into PDF format. **Describe alternatives you've considered** It could be beneficial to have an extraction method that supports this type of document. **Additional context** version : 3.18 ![image](https://github.com/user-attachments/assets/2453f667-f9c3-49ef-ba4d-95fe3076b178)

GiteaMirror closed this issue

2026-04-25 03:49:54 -05:00

Sign in to join this conversation.

Branches Tags

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/open-webui#29408