mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 11:28:35 -05:00
[GH-ISSUE #11479] feat: convert documents to images for vision models #31775
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @funnycups on GitHub (Mar 10, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/11479
Check Existing Issues
Problem Description
Currently, open webui will use OCR anyway when uploading documents.
However, for vision-supported models, just convey the images of documents may be better, especially when there're some pictures in documents that OCR cannot handled.
Desired Solution you'd like
Provide an option in documents, enabling open webui to convert documents, such as PDFs or docx, to images, and pass images to models instead of text results when documents are uploaded for vision supported models. For models that do not support vision, use the original OCR solution.
Alternatives Considered
No response
Additional Context
No response