[PR #10388] Fixes for Mistral 3 Small #13228

Closed
opened 2026-04-13 00:21:30 -05:00 by GiteaMirror · 0 comments
Owner

Original Pull Request: https://github.com/ollama/ollama/pull/10388

State: closed
Merged: No


These patches are requires for Mistral 3 Small to work with (larger?) images.

The first prevents an overflow in in the CUDA scale function, by chunking the work into blocks of at most INT_MAX.

The second squishes an assertion that is triggered by Mistral 3 Small where on layer 27, we copy to a view that is backed by memory that is a different (larger) size to the view. The size of the view matches the source, so this operation is safe.

These patches fix https://github.com/ollama/ollama/issues/10234

**Original Pull Request:** https://github.com/ollama/ollama/pull/10388 **State:** closed **Merged:** No --- These patches are requires for Mistral 3 Small to work with (larger?) images. The first prevents an overflow in in the CUDA scale function, by chunking the work into blocks of at most INT_MAX. The second squishes an assertion that is triggered by Mistral 3 Small where on layer 27, we copy to a view that is backed by memory that is a different (larger) size to the view. The size of the view matches the source, so this operation is safe. These patches fix https://github.com/ollama/ollama/issues/10234
GiteaMirror added the pull-request label 2026-04-13 00:21:30 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#13228