[GH-ISSUE #12950] app: vision models claim to not support images for before downloading and 30s after #55098

Open
opened 2026-04-29 08:20:04 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @jmorganca on GitHub (Nov 4, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/12950

What is the issue?

When adding an image to a non-downloaded vision model, the app claims the model does not support images.

Relevant log output


OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @jmorganca on GitHub (Nov 4, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/12950 ### What is the issue? When adding an image to a non-downloaded vision model, the app claims the model does not support images. ### Relevant log output ```shell ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the appbug labels 2026-04-29 08:20:04 -05:00
Author
Owner

@abcbarryn commented on GitHub (Nov 5, 2025):

I had this issue and worked around it by including this text in the system prompt:
You can process multiple images. Check for images. If images are present,
process all. Then process the text prompt.

<!-- gh-comment-id:3494013847 --> @abcbarryn commented on GitHub (Nov 5, 2025): I had this issue and worked around it by including this text in the system prompt: You can process multiple images. Check for images. If images are present, process all. Then process the text prompt.
Author
Owner

@Wheeeeeeeeezer commented on GitHub (Nov 13, 2025):

I had this issue and worked around it by including this text in the system prompt: You can process multiple images. Check for images. If images are present, process all. Then process the text prompt.

worked for me, I used this guide

<!-- gh-comment-id:3524792771 --> @Wheeeeeeeeezer commented on GitHub (Nov 13, 2025): > I had this issue and worked around it by including this text in the system prompt: You can process multiple images. Check for images. If images are present, process all. Then process the text prompt. worked for me, I used [this guide](https://medium.com/@sumudithalanz/unlocking-the-power-of-large-language-models-a-guide-to-customization-with-ollama-6c0da1e756d9)
Author
Owner

@hoyyeva commented on GitHub (Nov 20, 2025):

Hi everyone, I am sorry that you are experiencing this - after some investigation, I believe that there are two bugs causing this issue:

  1. Capabilities are not updated after the download is complete.
  2. /api/show does not support models that have not yet been downloaded.

Bug number 1 is easy to fix (PR #13179 is already open). However, we will need further discussion for bug number 2 since it will require a larger change involving the Ollama JS SDK.

<!-- gh-comment-id:3560080118 --> @hoyyeva commented on GitHub (Nov 20, 2025): Hi everyone, I am sorry that you are experiencing this - after some investigation, I believe that there are two bugs causing this issue: 1. Capabilities are not updated after the download is complete. 2. `/api/show` does not support models that have not yet been downloaded. Bug number 1 is easy to fix (PR #13179 is already open). However, we will need further discussion for bug number 2 since it will require a larger change involving the Ollama JS SDK.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#55098