[GH-ISSUE #5852] Ollama download model with cause my Hard Drive to always 100% Usage #65688

New Issue

GiteaMirror · 2026-05-03T22:14:42-05:00

GiteaMirror commented

2026-05-03 22:14:42 -05:00

Originally created by @rentianxiang on GitHub (Jul 22, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/5852

Originally assigned to: @dhiltgen on GitHub.

What is the issue?

When I want to download new models, for example I run: ollama run gemma2:27b

The model download will stuck, and according to my task manager, my C: drive SSD is always 100% usage.

I cannot kill ollama's process and only option for me is to force restart my PC

OS

Windows

GPU

Nvidia

CPU

Intel

Ollama version

0.2.7

Originally created by @rentianxiang on GitHub (Jul 22, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/5852 Originally assigned to: @dhiltgen on GitHub. ### What is the issue? When I want to download new models, for example I run: ollama run gemma2:27b The model download will stuck, and according to my task manager, my C: drive SSD is always 100% usage. I cannot kill ollama's process and only option for me is to force restart my PC ### OS Windows ### GPU Nvidia ### CPU Intel ### Ollama version 0.2.7

GiteaMirror added the bug windows labels 2026-05-03 22:14:42 -05:00

GiteaMirror closed this issue

2026-05-03 22:14:45 -05:00

GiteaMirror commented

2026-05-03 22:14:46 -05:00

@rick-github commented on GitHub (Jul 22, 2024):

Delete stuff to make room for the model.

@rick-github commented on GitHub (Jul 22, 2024): Delete stuff to make room for the model.

GiteaMirror commented

2026-05-03 22:14:47 -05:00

@rentianxiang commented on GitHub (Jul 22, 2024):

Delete stuff to make room for the model.

Hi Rick, I have 800G in C:/ and 3TB in D:/, no room should not be the issue here

@rentianxiang commented on GitHub (Jul 22, 2024): > Delete stuff to make room for the model. Hi Rick, I have 800G in C:/ and 3TB in D:/, no room should not be the issue here

GiteaMirror commented

2026-05-03 22:14:48 -05:00

@rick-github commented on GitHub (Jul 22, 2024):

What does 100% mean?

@rick-github commented on GitHub (Jul 22, 2024): What does 100% mean?

GiteaMirror commented

2026-05-03 22:14:48 -05:00

@rentianxiang commented on GitHub (Jul 22, 2024):

What does 100% mean?

Thank you for looking into it!
This is what happened when I start to run: ollama run gemma2:27b
The writing speed is crazy high, and I believe my network is not that good.
Even I stop the downloading by pressing ctrl+c, it still remains 100%.

@rentianxiang commented on GitHub (Jul 22, 2024): > What does 100% mean? Thank you for looking into it! This is what happened when I start to run: ollama run gemma2:27b The writing speed is crazy high, and I believe my network is not that good. Even I stop the downloading by pressing ctrl+c, it still remains 100%. ![image](https://github.com/user-attachments/assets/457e56d4-e16a-435c-a552-886a2ce122e1)

GiteaMirror commented

2026-05-03 22:14:49 -05:00

@rick-github commented on GitHub (Jul 22, 2024):

Add server logs, it may make it easier to debug.

Also look in the server logs for OLLAMA_MODELS. Then do a dir /s of that directory and report what you see.

@rick-github commented on GitHub (Jul 22, 2024): Add server logs, it may make it easier to debug. Also look in the server logs for `OLLAMA_MODELS`. Then do a `dir /s` of that directory and report what you see.

GiteaMirror commented

2026-05-03 22:14:49 -05:00

@rentianxiang commented on GitHub (Jul 22, 2024):

Server logs for my last failed run:
2024/07/22 22:24:55 routes.go:1096: INFO server config env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: OLLAMA_DEBUG:false OLLAMA_FLASH_ATTENTION:false OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_LLM_LIBRARY: OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MAX_VRAM:0 OLLAMA_MODELS:C:\Users\rtx\.ollama\models OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://*] OLLAMA_RUNNERS_DIR:C:\Users\rtx\AppData\Local\Programs\Ollama\ollama_runners OLLAMA_SCHED_SPREAD:false OLLAMA_TMPDIR: ROCR_VISIBLE_DEVICES:]"
time=2024-07-22T22:24:55.726+08:00 level=INFO source=images.go:778 msg="total blobs: 5"
time=2024-07-22T22:24:55.732+08:00 level=INFO source=images.go:785 msg="total unused blobs removed: 0"
time=2024-07-22T22:24:55.734+08:00 level=INFO source=routes.go:1143 msg="Listening on 127.0.0.1:11434 (version 0.2.7)"
time=2024-07-22T22:24:55.736+08:00 level=INFO source=payload.go:44 msg="Dynamic LLM libraries [cpu_avx cpu_avx2 cuda_v11.3 rocm_v6.1 cpu]"
time=2024-07-22T22:24:55.737+08:00 level=INFO source=gpu.go:205 msg="looking for compatible GPUs"
time=2024-07-22T22:24:55.877+08:00 level=INFO source=gpu.go:287 msg="detected OS VRAM overhead" id=GPU-e3ce22d3-ac09-e72f-5795-3c3f0a60b4d2 library=cuda compute=8.9 driver=12.5 name="NVIDIA GeForce RTX 4080 SUPER" overhead="410.6 MiB"
time=2024-07-22T22:24:55.878+08:00 level=INFO source=types.go:105 msg="inference compute" id=GPU-e3ce22d3-ac09-e72f-5795-3c3f0a60b4d2 library=cuda compute=8.9 driver=12.5 name="NVIDIA GeForce RTX 4080 SUPER" total="16.0 GiB" available="14.7 GiB"

==========

dir -s on OLLAMA_MODELS
dir -s

目录: C:\Users\rtx\.ollama\models