mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 02:48:13 -05:00
feat Unsupported Filter: /JBIG2Decode in PDF Files #774
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Yanyutin753 on GitHub (May 1, 2024).
Is your feature request related to a problem? Please describe.
I have encountered a problem when attempting to load PDF files that contain images which are using the /JBIG2Decode filter. JBIG2 is a compression method for monochrome (black and white) images which is especially important when dealing with scanned documents that require high compression rates.
Specifically, the error that is raised is as follows:
Copy Code
NotImplementedError: unsupported filter /JBIG2Decode
This problem stems from the pypdf library not currently supporting the /JBIG2Decode filter. When it tries to decode the image data that uses this filter, it throws the aforementioned error.
Additional context
This issue hampers the processing and manipulation of many PDFs that utilize this common compression technique. It would be greatly appreciated if support for /JBIG2Decode filter could be considered for introduction in a future update.
In the meantime, if you could provide any guidance or workarounds to properly load these types of PDFs, it would be very helpful.
Thank you for your time and consideration.