[GH-ISSUE #4996] Apple Silicon macs with 8GB or 16GB slow down when loading larger models #49671

Open
opened 2026-04-28 12:36:09 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @jmorganca on GitHub (Jun 12, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/4996

What is the issue?

Less of the model should be loaded to Metal to avoid causing lag

OS

macOS

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @jmorganca on GitHub (Jun 12, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/4996 ### What is the issue? Less of the model should be loaded to Metal to avoid causing lag ### OS macOS ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the bug label 2026-04-28 12:36:09 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#49671