[GH-ISSUE #9418] Granite Vision model granite3.2-vision:2b-q8_0 #6142

Closed
opened 2026-04-12 17:29:35 -05:00 by GiteaMirror · 2 comments
Owner

Originally created by @paulstoicasebastian on GitHub (Feb 28, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/9418

What is the issue?

I try OCR on local images from Ollama console (Windows). The response for the second image (and so on) contains the text from the first processed image.

The model granite3.2-vision:latest seams to work fine.

Example (3 different images)

extract text from G:\test4.jpg
Added image 'G:\test4.jpg'

POWER SUPPLY
THE CALCULATOR IS POWERED BY
SOLAR CELL AND A BATTERY FOR
BACKUP.THE DISPLAY WILL DIM WHEN
THE BATTERY NEEDS TO BE REPLACED.
THE " SIGN OF EACH BATTERY MUST

extract text from G:\text2.png
Added image 'G:\text2.png'

POWER SUPPLY
SOLAR CELL AND A BATTERY FOR
BACKUP.THE DISPLAY WILL DIM WHEN
THE BATTERY NEEDS TO BE REPLACED.
THE " SIGN OF EACH BATTERY MUST
SHOW UPWARD WHEN INSERTED.
This is the first line of
this text example.
This is the second line
of the same text.
This is the first line of
this text example.
This is the second line
of the same text.

extract text from G:\text1.png
Added image 'G:\text1.png'

POWER SUPPLY
SOLAR CELL AND A BATTERY MUST
SHOW UPWARD WHEN INSERTED.
This is the first line of
this text example.
This is the second line
of the same text.
It was the best of
times, it was the worst
of wisdom, it was the
age of foolishness...

Relevant log output


OS

Windows

GPU

Nvidia

CPU

Intel

Ollama version

0.5.13-rc1

Originally created by @paulstoicasebastian on GitHub (Feb 28, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/9418 ### What is the issue? I try OCR on local images from Ollama console (Windows). The response for the second image (and so on) contains the text from the first processed image. The model granite3.2-vision:latest seams to work fine. Example (3 different images) >>> extract text from G:\test4.jpg Added image 'G:\test4.jpg' POWER SUPPLY THE CALCULATOR IS POWERED BY SOLAR CELL AND A BATTERY FOR BACKUP.THE DISPLAY WILL DIM WHEN THE BATTERY NEEDS TO BE REPLACED. THE " SIGN OF EACH BATTERY MUST >>> extract text from G:\text2.png Added image 'G:\text2.png' <doc> POWER SUPPLY SOLAR CELL AND A BATTERY FOR BACKUP.THE DISPLAY WILL DIM WHEN THE BATTERY NEEDS TO BE REPLACED. THE " SIGN OF EACH BATTERY MUST SHOW UPWARD WHEN INSERTED. This is the first line of this text example. This is the second line of the same text. This is the first line of this text example. This is the second line of the same text. </doc> >>> extract text from G:\text1.png Added image 'G:\text1.png' <doc> POWER SUPPLY SOLAR CELL AND A BATTERY MUST SHOW UPWARD WHEN INSERTED. This is the first line of this text example. This is the second line of the same text. It was the best of times, it was the worst of wisdom, it was the age of foolishness... </doc> ### Relevant log output ```shell ``` ### OS Windows ### GPU Nvidia ### CPU Intel ### Ollama version 0.5.13-rc1
GiteaMirror added the bug label 2026-04-12 17:29:35 -05:00
Author
Owner

@jmorganca commented on GitHub (Feb 28, 2025):

Hi @paulstoicasebastian this may be because follow-on images also contain the chat history from the previous ones. Would it be try by creating a new chat for each image? In the meantime, we'll work on improving this so follow-on images don't impact future ones. This may be somewhat model-dependent, but I totally agree having context from one image leak into the next one isn't deseriable.

<!-- gh-comment-id:2691725179 --> @jmorganca commented on GitHub (Feb 28, 2025): Hi @paulstoicasebastian this may be because follow-on images also contain the chat history from the previous ones. Would it be try by creating a new chat for each image? In the meantime, we'll work on improving this so follow-on images don't impact future ones. This may be somewhat model-dependent, but I totally agree having context from one image leak into the next one isn't deseriable.
Author
Owner

@paulstoicasebastian commented on GitHub (Mar 1, 2025):

Yes, new chat for every pic do resolve this problem.

<!-- gh-comment-id:2692176063 --> @paulstoicasebastian commented on GitHub (Mar 1, 2025): Yes, new chat for every pic do resolve this problem.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#6142