[PR #10468] lower default num parallel to 2 #13250

Closed
opened 2026-04-13 00:22:03 -05:00 by GiteaMirror · 0 comments
Owner

Original Pull Request: https://github.com/ollama/ollama/pull/10468

State: closed
Merged: Yes


this is in part to "pay" for #10452, which doubled the default context length. The combination isn't fully neutral though, because even though the old 4x2k limit and the new 2x4k limit are memory equivalent, the 1x fallback is larger with 4k

**Original Pull Request:** https://github.com/ollama/ollama/pull/10468 **State:** closed **Merged:** Yes --- this is in part to "pay" for #10452, which doubled the default context length. The combination isn't fully neutral though, because even though the old 4x2k limit and the new 2x4k limit are memory equivalent, the 1x fallback is larger with 4k
GiteaMirror added the pull-request label 2026-04-13 00:22:03 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#13250