mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 10:58:17 -05:00
[GH-ISSUE #5164] PDF Extract Images (OCR) Fails with 'Unsupported Filter' on Scanned Documents #13880
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Genai-labs on GitHub (Sep 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5164
Is your feature request related to a problem? Please describe.
In the Admin Dashboard, under the Settings > Document section, when using the "PDF Extract Images (OCR)" option, the process fails for scanned documents and returns the error message: "Unsupported filter". This issue specifically occurs with documents that have been printed and then scanned back into PDF format.
Describe alternatives you've considered
It could be beneficial to have an extraction method that supports this type of document.
Additional context
version : 3.18