[PR #11383] ggml: Disable unused pipeline parallelism #13524

Closed
opened 2026-04-13 00:29:27 -05:00 by GiteaMirror · 0 comments
Owner

Original Pull Request: https://github.com/ollama/ollama/pull/11383

State: closed
Merged: Yes


We're not currently using it, even in cases where we could. Disabling it improves generation performance by 10-30% with multiple GPUs.

**Original Pull Request:** https://github.com/ollama/ollama/pull/11383 **State:** closed **Merged:** Yes --- We're not currently using it, even in cases where we could. Disabling it improves generation performance by 10-30% with multiple GPUs.
GiteaMirror added the pull-request label 2026-04-13 00:29:27 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#13524