[PR #10674] Multi-modal worst case graph #13319

Closed
opened 2026-04-13 00:23:43 -05:00 by GiteaMirror · 0 comments
Owner

Original Pull Request: https://github.com/ollama/ollama/pull/10674

State: closed
Merged: Yes


We currently preallocate compute graph memory for the worst case batch of text tokens. This adds support for doing the same for images.

Note that image models are more complicated than text models in how they process their inputs so there may be cases where this approach isn't completely generic for all models. It covers all currently supported models though.

**Original Pull Request:** https://github.com/ollama/ollama/pull/10674 **State:** closed **Merged:** Yes --- We currently preallocate compute graph memory for the worst case batch of text tokens. This adds support for doing the same for images. Note that image models are more complicated than text models in how they process their inputs so there may be cases where this approach isn't completely generic for all models. It covers all currently supported models though.
GiteaMirror added the pull-request label 2026-04-13 00:23:43 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#13319