[GH-ISSUE #9821] Models have different weights in process status #32190

Closed
opened 2026-04-22 13:13:55 -05:00 by GiteaMirror · 2 comments
Owner

Originally created by @DavidePrati99 on GitHub (Mar 17, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/9821

What is the issue?

why phi4 14b (that weights 9.1gb) in ps has a weight of 13 gb while gemma3:12b, that weights less (8.1gb) weights more in ps (16 gb)?

Relevant log output


OS

No response

GPU

No response

CPU

No response

Ollama version

No response

Originally created by @DavidePrati99 on GitHub (Mar 17, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/9821 ### What is the issue? why phi4 14b (that weights 9.1gb) in ps has a weight of 13 gb while gemma3:12b, that weights less (8.1gb) weights more in ps (16 gb)? ### Relevant log output ```shell ``` ### OS _No response_ ### GPU _No response_ ### CPU _No response_ ### Ollama version _No response_
GiteaMirror added the question label 2026-04-22 13:13:55 -05:00
Author
Owner

@rick-github commented on GitHub (Mar 17, 2025):

#9791

<!-- gh-comment-id:2729731317 --> @rick-github commented on GitHub (Mar 17, 2025): #9791
Author
Owner

@pdevine commented on GitHub (Mar 17, 2025):

While most of gemma3 is quantized there is also a non-quantized vision component which is what causes the consumed memory to be larger.

There is also a memory leak w/ gemma3 which will be patched later today and released w/ 0.6.2 which @rick-github was alluding to.

I'll go ahead and close the issue as answered.

<!-- gh-comment-id:2730376613 --> @pdevine commented on GitHub (Mar 17, 2025): While most of gemma3 is quantized there is also a non-quantized vision component which is what causes the consumed memory to be larger. There is also a memory leak w/ gemma3 which will be patched later today and released w/ `0.6.2` which @rick-github was alluding to. I'll go ahead and close the issue as answered.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#32190