[GH-ISSUE #10494] Entire model size being allocated to pagefile even with 100% GPU offload #68962

Closed
opened 2026-05-04 16:28:27 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @ethanblake4 on GitHub (Apr 30, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/10494

What is the issue?

Using Ollama v0.6.6 with ROCm backend on Windows 11 (gfx1151).
When loading a model that is 100% GPU offloaded, Ollama 'commits' the entire size of the model to system memory regardless, which causes massive pagefiles to be created unnecessarily. It does not seem to be using this allocated space for anything, otherwise the memory would actually be used instead of just paged, so it just takes up extra SSD space for nothing.

Relevant log output

(base) PS C:\Users\home> ollama ps
NAME                     ID              SIZE      PROCESSOR    UNTIL
qwen2.5-coder:7b-base    bd8755145f1c    7.0 GB    100% GPU     24 minutes from now

OS

Windows 11

GPU

Radeon (TM) 8050S Graphics

CPU

AMD Ryzen AI Max PRO 390

Ollama version

0.6.6

Originally created by @ethanblake4 on GitHub (Apr 30, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/10494 ### What is the issue? Using Ollama v0.6.6 with ROCm backend on Windows 11 (gfx1151). When loading a model that is 100% GPU offloaded, Ollama 'commits' the entire size of the model to system memory regardless, which causes massive pagefiles to be created unnecessarily. It does not seem to be using this allocated space for anything, otherwise the memory would actually be used instead of just paged, so it just takes up extra SSD space for nothing. ### Relevant log output ```shell (base) PS C:\Users\home> ollama ps NAME ID SIZE PROCESSOR UNTIL qwen2.5-coder:7b-base bd8755145f1c 7.0 GB 100% GPU 24 minutes from now ``` ### OS Windows 11 ### GPU Radeon (TM) 8050S Graphics ### CPU AMD Ryzen AI Max PRO 390 ### Ollama version 0.6.6
GiteaMirror added the bug label 2026-05-04 16:28:27 -05:00
Author
Owner

@rick-github commented on GitHub (Apr 30, 2025):

Server logs may aid in debugging.

<!-- gh-comment-id:2841375119 --> @rick-github commented on GitHub (Apr 30, 2025): [Server logs](https://github.com/ollama/ollama/blob/main/docs/troubleshooting.md#how-to-troubleshoot-issues) may aid in debugging.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#68962