[GH-ISSUE #3753] 多卡推理块开始单卡快? #64353

Closed
opened 2026-05-03 17:15:01 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @papandadj on GitHub (Apr 19, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/3753

What is the issue?

我不太懂推理,假设我有4个gpu,每个24g。我的模型运行起来也需要24g,此时我应该选择多卡还是单卡。单卡使用24g快还是多卡每个使用6g快?

OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @papandadj on GitHub (Apr 19, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/3753 ### What is the issue? 我不太懂推理,假设我有4个gpu,每个24g。我的模型运行起来也需要24g,此时我应该选择多卡还是单卡。单卡使用24g快还是多卡每个使用6g快? ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the bug label 2026-05-03 17:15:01 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#64353