[GH-ISSUE #10312] The image recognition effect is poor: gemma3 #6771

Open
opened 2026-04-12 18:31:59 -05:00 by GiteaMirror · 4 comments
Owner

Originally created by @Jaleel-zhu on GitHub (Apr 17, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/10312

What is the issue?

Unable to correctly identify.
Image

Relevant log output


OS

Linux, Docker

GPU

Nvidia

CPU

Intel

Ollama version

0.6.5

Originally created by @Jaleel-zhu on GitHub (Apr 17, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/10312 ### What is the issue? Unable to correctly identify. ![Image](https://github.com/user-attachments/assets/ae1ad78b-6e59-4ed8-9488-9ecf6055828c) ### Relevant log output ```shell ``` ### OS Linux, Docker ### GPU Nvidia ### CPU Intel ### Ollama version 0.6.5
GiteaMirror added the bug label 2026-04-12 18:31:59 -05:00
Author
Owner

@quaggalinux commented on GitHub (Apr 17, 2025):

Image

OS: ubuntu 24.04.2 server
GPU: Nvidia
CPU: Intel
RAM: 256GB
NVRAM: 12GB
Ollama version: 0.6.5
tool: Page Assist Extension in chrome

gemma3:27b

It seems to work well in my environment.

<!-- gh-comment-id:2814159150 --> @quaggalinux commented on GitHub (Apr 17, 2025): ![Image](https://github.com/user-attachments/assets/d89b7efa-7b6a-403a-b465-19c4d22933dd) OS: ubuntu 24.04.2 server GPU: Nvidia CPU: Intel RAM: 256GB NVRAM: 12GB Ollama version: 0.6.5 tool: Page Assist Extension in chrome gemma3:27b It seems to work well in my environment.
Author
Owner

@Li-Hikinghat commented on GitHub (Apr 20, 2025):

你的模型参数不够,换更大参数的模型就行了

<!-- gh-comment-id:2816995020 --> @Li-Hikinghat commented on GitHub (Apr 20, 2025): 你的模型参数不够,换更大参数的模型就行了
Author
Owner

@Jaleel-zhu commented on GitHub (Apr 23, 2025):

Image

OS: ubuntu 24.04.2 server GPU: Nvidia CPU: Intel RAM: 256GB NVRAM: 12GB Ollama version: 0.6.5 tool:Page Assist Extension in chrome

gemma3:27b

It seems to work well in my environment.

I tested 27b, and the recognition effect for large images isn't very good either.
Test image
Image
Result
Image

Image

OS: ubuntu 24.04.2 server GPU: Nvidia CPU: Intel RAM: 256GB NVRAM: 12GB Ollama version: 0.6.5 tool: Page Assist Extension in chrome

gemma3:27b

It seems to work well in my environment.

<!-- gh-comment-id:2822815905 --> @Jaleel-zhu commented on GitHub (Apr 23, 2025): > ![Image](https://github.com/user-attachments/assets/d89b7efa-7b6a-403a-b465-19c4d22933dd) > > OS: ubuntu 24.04.2 server GPU: Nvidia CPU: Intel RAM: 256GB NVRAM: 12GB Ollama version: 0.6.5 tool:Page Assist Extension in chrome > > [gemma3:27b](gemma3:27b) > > It seems to work well in my environment. I tested 27b, and the recognition effect for large images isn't very good either. Test image ![Image](https://github.com/user-attachments/assets/0c9abe5c-f8f8-42dd-a579-3a2447359341) Result ![Image](https://github.com/user-attachments/assets/825a48e7-2ab8-4251-9bb3-ef97677c6304) > ![Image](https://github.com/user-attachments/assets/d89b7efa-7b6a-403a-b465-19c4d22933dd) > > OS: ubuntu 24.04.2 server GPU: Nvidia CPU: Intel RAM: 256GB NVRAM: 12GB Ollama version: 0.6.5 tool: Page Assist Extension in chrome > > gemma3:27b > > It seems to work well in my environment.
Author
Owner

@egg1234 commented on GitHub (Apr 24, 2025):

Image

As you can see here, even using Gemini 2.5 Pro, the result is still the same as gemma 3:27b

<!-- gh-comment-id:2826174179 --> @egg1234 commented on GitHub (Apr 24, 2025): ![Image](https://github.com/user-attachments/assets/ed38b4f5-bb51-4284-b03e-9a2e240db966) As you can see here, even using Gemini 2.5 Pro, the result is still the same as gemma 3:27b
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#6771