mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 03:18:23 -05:00
issue: Multimodal models cannot recognize larger-sized images #6107
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @AXuanCreator on GitHub (Aug 15, 2025).
Check Existing Issues
Installation Method
Docker
Open WebUI Version
v0.6.22
Ollama Version (if applicable)
No response
Operating System
Windows 11
Browser (if applicable)
No response
Confirmation
README.md.Expected Behavior
In multimodal model conversations, large-sized images should be recognized
Actual Behavior
In reality, no content will be output, while smaller-sized images can be recognized normally
Using the image compression in the settings is still ineffective
In the official online inference provided by the model, images can be recognized normally
Tested on
Image size: 2560 * 1440
Steps to Reproduce
copy the image to [dialog]
enter "summarize the image content" and press Enter
Logs & Screenshots
origin image:
Additional Information
No response
@tjbck commented on GitHub (Aug 16, 2025):
Model inference issue.