[GH-ISSUE #12640] ollama hanging #8391

New Issue

GiteaMirror · 2026-04-12T21:02:15-05:00

GiteaMirror commented

2026-04-12 21:02:15 -05:00

Originally created by @samialisayed on GitHub (Oct 15, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/12640

Originally assigned to: @dhiltgen on GitHub.

What is the issue?

Ollama is hanging when I send a message and not getting any response. this happens with all models. Any idea how to solve this. Ollama was working fine a month ago. I do not know what happened!!

Relevant log output

OS

Windows

GPU

Tesla T4

CPU

No response

Ollama version

v0.12.5

Originally created by @samialisayed on GitHub (Oct 15, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/12640 Originally assigned to: @dhiltgen on GitHub. ### What is the issue? Ollama is hanging when I send a message and not getting any response. this happens with all models. Any idea how to solve this. Ollama was working fine a month ago. I do not know what happened!! ### Relevant log output ```shell ``` ### OS Windows ### GPU Tesla T4 ### CPU _No response_ ### Ollama version v0.12.5

GiteaMirror added the bug windows labels 2026-04-12 21:02:15 -05:00

GiteaMirror closed this issue

2026-04-12 21:02:15 -05:00

GiteaMirror commented

2026-04-12 21:02:16 -05:00

@jmorganca commented on GitHub (Oct 15, 2025):

@samialisayed may I ask which model you are running. Sorry about that!

@jmorganca commented on GitHub (Oct 15, 2025): @samialisayed may I ask which model you are running. Sorry about that!

GiteaMirror commented

2026-04-12 21:02:17 -05:00

@samialisayed commented on GitHub (Oct 15, 2025):

@jmorganca
I tried many models such as llama3, tinyllama, phi

@samialisayed commented on GitHub (Oct 15, 2025): @jmorganca I tried many models such as llama3, tinyllama, phi

GiteaMirror commented

2026-04-12 21:02:18 -05:00

@kappa8219 commented on GitHub (Oct 17, 2025):

see #12660 fixed in 0.12.6

@kappa8219 commented on GitHub (Oct 17, 2025): see #12660 fixed in 0.12.6

GiteaMirror commented

2026-04-12 21:02:19 -05:00

@samialisayed commented on GitHub (Oct 17, 2025):

@kappa8219 it is not fixed. it is still hanging

@samialisayed commented on GitHub (Oct 17, 2025): @kappa8219 it is not fixed. it is still hanging

GiteaMirror commented

2026-04-12 21:02:19 -05:00

@kappa8219 commented on GitHub (Oct 17, 2025):

@kappa8219 it is not fixed. it is still hanging

Strange, I got same GPU, maybe node is different.

@kappa8219 commented on GitHub (Oct 17, 2025): > @kappa8219 it is not fixed. it is still hanging Strange, I got same GPU, maybe node is different.

GiteaMirror commented

2026-04-12 21:02:20 -05:00

@samialisayed commented on GitHub (Oct 17, 2025):

@kappa8219 do you recommend any solution?

@samialisayed commented on GitHub (Oct 17, 2025): @kappa8219 do you recommend any solution?

GiteaMirror commented

2026-04-12 21:02:20 -05:00

@kappa8219 commented on GitHub (Oct 17, 2025):

@kappa8219 do you recommend any solution?

No, for OS Windows none. At Linux would try to get system logs.

@kappa8219 commented on GitHub (Oct 17, 2025): > @kappa8219 do you recommend any solution? No, for OS Windows none. At Linux would try to get system logs.

GiteaMirror commented

2026-04-12 21:02:21 -05:00

@katmandoo212 commented on GitHub (Oct 17, 2025):

Same issue. Tried several models including gpt-oss:20b downloaded to my PC today.

ollama --version
ollama version is 0.12.6

ollama run qwen3:1.7b "What is the equation to calculate the area of a sqaure?"

ollama serve
time=2025-10-17T17:39:29.977-04:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\OllamaFiles\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-17T17:39:30.041-04:00 level=INFO source=images.go:522 msg="total blobs: 330"
time=2025-10-17T17:39:30.075-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-17T17:39:30.087-04:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-17T17:39:30.090-04:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-17T17:39:30.878-04:00 level=INFO source=types.go:129 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="8.1 GiB"
time=2025-10-17T17:39:30.878-04:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/17 - 17:39:45 | 200 | 603.2µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/17 - 17:39:45 | 200 | 221.7137ms | 127.0.0.1 | POST "/api/show"

@katmandoo212 commented on GitHub (Oct 17, 2025): Same issue. Tried several models including gpt-oss:20b downloaded to my PC today. ollama --version ollama version is 0.12.6 ollama run qwen3:1.7b "What is the equation to calculate the area of a sqaure?" ollama serve time=2025-10-17T17:39:29.977-04:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\\OllamaFiles\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-10-17T17:39:30.041-04:00 level=INFO source=images.go:522 msg="total blobs: 330" time=2025-10-17T17:39:30.075-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-10-17T17:39:30.087-04:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)" time=2025-10-17T17:39:30.090-04:00 level=INFO source=runner.go:80 msg="discovering available GPUs..." time=2025-10-17T17:39:30.878-04:00 level=INFO source=types.go:129 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="8.1 GiB" time=2025-10-17T17:39:30.878-04:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB" [GIN] 2025/10/17 - 17:39:45 | 200 | 603.2µs | 127.0.0.1 | HEAD "/" [GIN] 2025/10/17 - 17:39:45 | 200 | 221.7137ms | 127.0.0.1 | POST "/api/show"

GiteaMirror commented

2026-04-12 21:02:22 -05:00

@mkrostm commented on GitHub (Oct 17, 2025):

Same issue. with all models, deepseek-r1:1.5b, llama3.2, qwen2.5
ollama version is 0.12.6
windows 10

@mkrostm commented on GitHub (Oct 17, 2025): Same issue. with all models, deepseek-r1:1.5b, llama3.2, qwen2.5 ollama version is 0.12.6 windows 10

GiteaMirror commented

2026-04-12 21:02:22 -05:00

@Panican-Whyasker commented on GitHub (Oct 19, 2025):

Same issue here. Ollama 0.12.6 on Windows 11, GPU nVidia GeForce 840M (CUDA compute capability 5.0, the minimum required by Ollama) with 2 GB of VRAM; CPU Intel Core i7-5600, 6 GB system RAM. Therefore, have tried only rather small LLMs like gemma3:270m and qwen3:1.7b. Trying to start the LLMs in either PowerShell or inside Ollama's GUI - same result: The model start takes forever. At the same time, one ollama.exe process keeps running at 25% CPU (which is one of four logical cores running at 100%). Looked at the process' threads with SysInternals Process Explorer, see attached screenshot.

Here is my server log:

time=2025-10-19T12:39:08.441+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-19T12:39:08.450+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-19T12:39:08.453+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-19T12:39:10.284+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB"
time=2025-10-19T12:39:10.285+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB"
[GIN] 2025/10/19 - 12:39:10 | 200 | 522.6µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/19 - 12:39:10 | 200 | 3.7394ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:43:09 | 200 | 0s | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/19 - 12:43:09 | 200 | 117.6405ms | 127.0.0.1 | POST "/api/show"

And that's it. Ollama.exe (one of two copies in memory) keeps running at 25% CPU (100% of one logical core).

When trying to run the same model from within Ollama's GUI, the server log gets a few extra lines:

time=2025-10-19T12:54:34.628+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-19T12:54:34.643+02:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-19T12:54:34.657+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-19T12:54:34.673+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-19T12:54:34.675+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-19T12:54:37.090+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB"
time=2025-10-19T12:54:37.091+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB"
[GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:37 | 200 | 5.7471ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:54:37 | 200 | 160.8987ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/19 - 12:54:44 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:44 | 200 | 6.6484ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:54:56 | 200 | 124.4086ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/19 - 12:54:56 | 200 | 100.1837ms | 127.0.0.1 | POST "/api/show"

And here's my app.log:

time=2025-10-19T12:54:32.717+02:00 level=INFO source=app_windows.go:272 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.6 OS=Windows/10.0.26100
time=2025-10-19T12:54:32.721+02:00 level=INFO source=app.go:232 msg="initialized tools registry" tool_count=0
time=2025-10-19T12:54:32.735+02:00 level=INFO source=app.go:247 msg="starting ollama server"
time=2025-10-19T12:54:33.173+02:00 level=INFO source=app.go:279 msg="starting ui server" port=56109
time=2025-10-19T12:54:35.437+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=997.1µs request_id=1760871275436712400 version=0.12.6
time=2025-10-19T12:54:35.471+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=1.0016ms request_id=1760871275470420900 version=0.12.6
time=2025-10-19T12:54:35.490+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=121.4562ms request_id=1760871275368774800 version=0.12.6
time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:1617 msg="failed to get inference compute" error="timeout scanning server log for inference compute details"
time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:171 msg=site.serveHTTP error="failed to get inference compute: timeout scanning server log for inference compute details" http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=500 http.d=592.3808ms request_id=1760871275447595400 version=0.12.6
time=2025-10-19T12:54:36.174+02:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s
time=2025-10-19T12:54:37.091+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=1.6225602s request_id=1760871275469340400 version=0.12.6
time=2025-10-19T12:54:37.098+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=1.6539439s request_id=1760871275444706100 version=0.12.6
time=2025-10-19T12:54:37.187+02:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:CUDA Variant: Compute:5.0 Driver:13.0 Name:CUDA0 VRAM:2.0 GiB}"
time=2025-10-19T12:54:37.187+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=128.9369ms request_id=1760871277058569100 version=0.12.6
time=2025-10-19T12:54:37.253+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gemma3:270m/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=1.7391883s request_id=1760871275514116200 version=0.12.6
time=2025-10-19T12:54:37.293+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=161.3473ms request_id=1760871277131674100 version=0.12.6
time=2025-10-19T12:54:40.146+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=15.5374ms request_id=1760871280131187300 version=0.12.6
time=2025-10-19T12:54:42.095+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=507.5µs request_id=1760871282095350400 version=0.12.6
time=2025-10-19T12:54:44.635+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=950.6µs request_id=1760871284634242300 version=0.12.6
time=2025-10-19T12:54:44.639+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=1.9311ms request_id=1760871284637210500 version=0.12.6
time=2025-10-19T12:54:44.646+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=10.3327ms request_id=1760871284636351800 version=0.12.6
time=2025-10-19T12:57:50.812+02:00 level=ERROR source=ui.go:1178 msg="chat stream error" error="Post "http://127.0.0.1:11434/api/chat": context canceled"
time=2025-10-19T12:57:50.814+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m54.1014868s request_id=1760871296712971700 version=0.12.6
time=2025-10-19T12:57:50.878+02:00 level=INFO source=app.go:323 msg="shutting down desktop server"
time=2025-10-19T12:57:50.881+02:00 level=INFO source=app.go:328 msg="shutting down ollama server"
time=2025-10-19T12:57:55.888+02:00 level=WARN source=server.go:132 msg="timeout waiting for graceful shutdown; killing" pid=3832

@Panican-Whyasker commented on GitHub (Oct 19, 2025): Same issue here. Ollama 0.12.6 on Windows 11, GPU nVidia GeForce 840M (CUDA compute capability 5.0, the minimum required by Ollama) with 2 GB of VRAM; CPU Intel Core i7-5600, 6 GB system RAM. Therefore, have tried only rather small LLMs like gemma3:270m and qwen3:1.7b. Trying to start the LLMs in either PowerShell or inside Ollama's GUI - same result: The model start takes forever. At the same time, one ollama.exe process keeps running at 25% CPU (which is one of four logical cores running at 100%). Looked at the process' threads with SysInternals Process Explorer, see attached screenshot. ![Image](https://github.com/user-attachments/assets/5b0169d4-e4ad-4755-8965-32d789b84505) Here is my server log: time=2025-10-19T12:39:08.441+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\\Users\\Joro\\.ollama\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:522 msg="total blobs: 12" time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-10-19T12:39:08.450+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)" time=2025-10-19T12:39:08.453+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..." time=2025-10-19T12:39:10.284+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB" time=2025-10-19T12:39:10.285+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB" [GIN] 2025/10/19 - 12:39:10 | 200 | 522.6µs | 127.0.0.1 | HEAD "/" [GIN] 2025/10/19 - 12:39:10 | 200 | 3.7394ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/19 - 12:43:09 | 200 | 0s | 127.0.0.1 | HEAD "/" [GIN] 2025/10/19 - 12:43:09 | 200 | 117.6405ms | 127.0.0.1 | POST "/api/show" And that's it. Ollama.exe (one of two copies in memory) keeps running at 25% CPU (100% of one logical core). ![Image](https://github.com/user-attachments/assets/bc9fa550-1bc7-4980-a5ad-86cc8280aa7e) When trying to run the same model from within Ollama's GUI, the server log gets a few extra lines: time=2025-10-19T12:54:34.628+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\\Users\\Joro\\.ollama\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-10-19T12:54:34.643+02:00 level=INFO source=images.go:522 msg="total blobs: 12" time=2025-10-19T12:54:34.657+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-10-19T12:54:34.673+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)" time=2025-10-19T12:54:34.675+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..." time=2025-10-19T12:54:37.090+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB" time=2025-10-19T12:54:37.091+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB" [GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/19 - 12:54:37 | 200 | 5.7471ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/19 - 12:54:37 | 200 | 160.8987ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/19 - 12:54:44 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/19 - 12:54:44 | 200 | 6.6484ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/19 - 12:54:56 | 200 | 124.4086ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/19 - 12:54:56 | 200 | 100.1837ms | 127.0.0.1 | POST "/api/show" ![Image](https://github.com/user-attachments/assets/f7d385c1-46a3-4d49-a109-ccab9ad87632) And here's my app.log: time=2025-10-19T12:54:32.717+02:00 level=INFO source=app_windows.go:272 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.6 OS=Windows/10.0.26100 time=2025-10-19T12:54:32.721+02:00 level=INFO source=app.go:232 msg="initialized tools registry" tool_count=0 time=2025-10-19T12:54:32.735+02:00 level=INFO source=app.go:247 msg="starting ollama server" time=2025-10-19T12:54:33.173+02:00 level=INFO source=app.go:279 msg="starting ui server" port=56109 time=2025-10-19T12:54:35.437+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=997.1µs request_id=1760871275436712400 version=0.12.6 time=2025-10-19T12:54:35.471+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=1.0016ms request_id=1760871275470420900 version=0.12.6 time=2025-10-19T12:54:35.490+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=121.4562ms request_id=1760871275368774800 version=0.12.6 time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:1617 msg="failed to get inference compute" error="timeout scanning server log for inference compute details" time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:171 msg=site.serveHTTP error="failed to get inference compute: timeout scanning server log for inference compute details" http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=500 http.d=592.3808ms request_id=1760871275447595400 version=0.12.6 time=2025-10-19T12:54:36.174+02:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s time=2025-10-19T12:54:37.091+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=1.6225602s request_id=1760871275469340400 version=0.12.6 time=2025-10-19T12:54:37.098+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=1.6539439s request_id=1760871275444706100 version=0.12.6 time=2025-10-19T12:54:37.187+02:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:CUDA Variant: Compute:5.0 Driver:13.0 Name:CUDA0 VRAM:2.0 GiB}" time=2025-10-19T12:54:37.187+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=128.9369ms request_id=1760871277058569100 version=0.12.6 time=2025-10-19T12:54:37.253+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gemma3:270m/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=1.7391883s request_id=1760871275514116200 version=0.12.6 time=2025-10-19T12:54:37.293+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=161.3473ms request_id=1760871277131674100 version=0.12.6 time=2025-10-19T12:54:40.146+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=15.5374ms request_id=1760871280131187300 version=0.12.6 time=2025-10-19T12:54:42.095+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=507.5µs request_id=1760871282095350400 version=0.12.6 time=2025-10-19T12:54:44.635+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=950.6µs request_id=1760871284634242300 version=0.12.6 time=2025-10-19T12:54:44.639+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=1.9311ms request_id=1760871284637210500 version=0.12.6 time=2025-10-19T12:54:44.646+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=10.3327ms request_id=1760871284636351800 version=0.12.6 time=2025-10-19T12:57:50.812+02:00 level=ERROR source=ui.go:1178 msg="chat stream error" error="Post \"http://127.0.0.1:11434/api/chat\": context canceled" time=2025-10-19T12:57:50.814+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m54.1014868s request_id=1760871296712971700 version=0.12.6 time=2025-10-19T12:57:50.878+02:00 level=INFO source=app.go:323 msg="shutting down desktop server" time=2025-10-19T12:57:50.881+02:00 level=INFO source=app.go:328 msg="shutting down ollama server" time=2025-10-19T12:57:55.888+02:00 level=WARN source=server.go:132 msg="timeout waiting for graceful shutdown; killing" pid=3832

GiteaMirror commented

2026-04-12 21:02:22 -05:00

@Panican-Whyasker commented on GitHub (Oct 19, 2025):

Regarding my earlier comment: That was with a new Ollama installation, on a freshly installed (and updated) Windows 11.

My main laptop with Ollama 0.12.6 (also Windows 11; Intel Core i9-9880H; nVidia Quadro RTX3000 w. 6 GB of VRAM; 128 GB of system RAM) runs normally the above LLMs. Here, Ollama was updated to v.0.12.6 shortly after that update became available.

@Panican-Whyasker commented on GitHub (Oct 19, 2025): Regarding my earlier comment: That was with a new Ollama installation, on a freshly installed (and updated) Windows 11. My main laptop with Ollama 0.12.6 (also Windows 11; Intel Core i9-9880H; nVidia Quadro RTX3000 w. 6 GB of VRAM; 128 GB of system RAM) runs normally the above LLMs. Here, Ollama was updated to v.0.12.6 shortly after that update became available.

GiteaMirror commented

2026-04-12 21:02:22 -05:00

@Panican-Whyasker commented on GitHub (Oct 19, 2025):

Update: The issue is not present in Ollama v.0.12.3. However, updating to 0.12.6 breaks it.

@samialisayed you may want to download and install version 0.12.3 (it will replace v.0.12.6, just don't apply the update to 0.12.6, Ollama downloads and offers it very soon).

(https://github.com/ollama/ollama/releases)

(https://github.com/ollama/ollama/releases/download/v0.12.3/OllamaSetup.exe)

@Panican-Whyasker commented on GitHub (Oct 19, 2025): Update: The issue is not present in Ollama v.0.12.3. However, updating to 0.12.6 breaks it. @samialisayed you may want to download and install version 0.12.3 (it will replace v.0.12.6, just don't apply the update to 0.12.6, Ollama downloads and offers it very soon). (https://github.com/ollama/ollama/releases) (https://github.com/ollama/ollama/releases/download/v0.12.3/OllamaSetup.exe) ![Image](https://github.com/user-attachments/assets/a50475f8-cf08-4853-a5f8-646832a997ca) ![Image](https://github.com/user-attachments/assets/38d194ce-2bf1-4c94-9923-00d01ebc2811) ![Image](https://github.com/user-attachments/assets/361ff374-35bd-4730-9fd0-946627238c96)

GiteaMirror commented

2026-04-12 21:02:23 -05:00

@Panican-Whyasker commented on GitHub (Oct 20, 2025):

@samialisayed: as @rick-github explained in (https://github.com/ollama/ollama/issues/12699), ollama 0.12.3 is not affected; the issue likely starts with 0.12.4.

@Panican-Whyasker commented on GitHub (Oct 20, 2025): @samialisayed: as @rick-github explained in (https://github.com/ollama/ollama/issues/12699), ollama 0.12.3 is not affected; the issue likely starts with 0.12.4.

GiteaMirror commented

2026-04-12 21:02:23 -05:00

@Panican-Whyasker commented on GitHub (Oct 20, 2025):

Interestingly, no problem with ollama 0.12.6 on Windows Server 2016 Datacenter (GPU-less):

@Panican-Whyasker commented on GitHub (Oct 20, 2025): Interestingly, no problem with ollama 0.12.6 on Windows Server 2016 Datacenter (GPU-less): ![Image](https://github.com/user-attachments/assets/d4778c6c-a176-4fc5-b749-2e4226296b79)

GiteaMirror commented

2026-04-12 21:02:24 -05:00

@rick-github commented on GitHub (Oct 20, 2025):

Thanks for the data point. Based on the initial reports I thought it was due to the lack of GPU, and not widely reported because most users have a GPU. But it seems like there is something else at play.

@rick-github commented on GitHub (Oct 20, 2025): Thanks for the data point. Based on the initial reports I thought it was due to the lack of GPU, and not widely reported because most users have a GPU. But it seems like there is something else at play.

GiteaMirror commented

2026-04-12 21:02:24 -05:00

@samialisayed commented on GitHub (Oct 22, 2025):

@Panican-Whyasker I have installed the lower version and it works. thank you

@samialisayed commented on GitHub (Oct 22, 2025): @Panican-Whyasker I have installed the lower version and it works. thank you

GiteaMirror commented

2026-04-12 21:02:25 -05:00

@dhiltgen commented on GitHub (Oct 29, 2025):

Please give 0.12.7 a try and let us know if the issues are resolved. If not, please share an updated log with OLLAMA_DEBUG=2 set so we can take a look.

@dhiltgen commented on GitHub (Oct 29, 2025): Please give 0.12.7 a try and let us know if the issues are resolved. If not, please share an updated log with OLLAMA_DEBUG=2 set so we can take a look.

GiteaMirror commented

2026-04-12 21:02:25 -05:00

@katmandoo212 commented on GitHub (Oct 30, 2025):

I gave 0.12.7 a try. with OLLAMA_DEBUG=2. I could not get local models to load, but cloud models do work. gpt-oss:20b-cloud for example. This is my server.log followed by my app.log. I hope this helps.

time=2025-10-29T21:44:11.691-04:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:16384 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\OllamaFiles\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-29T21:44:11.745-04:00 level=INFO source=images.go:522 msg="total blobs: 349"
time=2025-10-29T21:44:11.776-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-29T21:44:11.786-04:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)"
time=2025-10-29T21:44:11.786-04:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler"
time=2025-10-29T21:44:11.788-04:00 level=INFO source=runner.go:76 msg="discovering available GPUs..."
time=2025-10-29T21:44:11.788-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extraEnvs=map[]
time=2025-10-29T21:44:11.801-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60946"
time=2025-10-29T21:44:11.802-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-29T21:44:11.867-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:11.868-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60946"
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:11.880-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:11.881-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:11.913-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
dl_load_library unable to load library C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12\ggml-cuda.dll: The specified module could not be found.

time=2025-10-29T21:44:12.094-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=216.4957ms
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" devices=[]
time=2025-10-29T21:44:12.096-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=308.1172ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[]
time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extraEnvs=map[]
time=2025-10-29T21:44:12.099-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60955"
time=2025-10-29T21:44:12.099-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-29T21:44:12.161-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:12.162-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60955"
time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:12.167-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:12.169-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:12.169-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:12.197-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
ggml_cuda_init: failed to initialize CUDA: (null)
load_backend: loaded CUDA backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13\ggml-cuda.dll
time=2025-10-29T21:44:12.283-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=119.2606ms
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.285-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" devices=[]
time=2025-10-29T21:44:12.286-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=189.4849ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[]
time=2025-10-29T21:44:12.286-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extraEnvs=map[]
time=2025-10-29T21:44:12.288-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60962"
time=2025-10-29T21:44:12.288-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-29T21:44:12.364-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:12.366-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60962"
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:12.377-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:12.406-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
ggml_cuda_init: failed to initialize ROCm: no ROCm-capable device is detected
load_backend: loaded ROCm backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm\ggml-hip.dll
time=2025-10-29T21:44:12.446-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=72.7312ms
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" devices=[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=162.4718ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0
time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=661.722ms
time=2025-10-29T21:44:12.449-04:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="6.0 GiB"
time=2025-10-29T21:44:12.449-04:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/29 - 21:44:12 | 200 | 541µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/29 - 21:44:12 | 200 | 57.0626ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:44:24 | 200 | 185.0917ms | 127.0.0.1 | GET "/api/tags"
time=2025-10-29T21:44:25.105-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:25 | 200 | 254.2493ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:44:40 | 200 | 0s | 127.0.0.1 | GET "/api/version"
time=2025-10-29T21:44:40.843-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:40 | 200 | 250.6383ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:44:40 | 200 | 106.9248ms | 127.0.0.1 | GET "/api/tags"
time=2025-10-29T21:44:41.009-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:41 | 200 | 166.299ms | 127.0.0.1 | POST "/api/show"
time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:267 msg="refreshing free memory"
time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s
[GIN] 2025/10/29 - 21:45:10 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:45:11 | 200 | 78.7139ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:45:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:45:41 | 200 | 52.8417ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:46:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:46:11 | 200 | 74.5929ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:46:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:46:41 | 200 | 199.5415ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:47:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:47:11 | 200 | 70.797ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:47:14 | 200 | 31.5551ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:23 | 200 | 33.352ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:23 | 200 | 35.7424ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:25 | 200 | 1.3314975s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:27 | 200 | 814.5542ms | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:28 | 200 | 1.1260067s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:30 | 200 | 1.6694057s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:47:41 | 200 | 55.0317ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:48:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:48:11 | 200 | 52.7654ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:48:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:48:41 | 200 | 49.1216ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:49:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:49:11 | 200 | 60.5865ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:49:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:49:41 | 200 | 54.8271ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:50:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:50:11 | 200 | 58.8494ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:50:42 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:50:42 | 200 | 53.4373ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:51:12 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:51:12 | 200 | 54.2343ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:51:42 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:51:42 | 200 | 59.3495ms | 127.0.0.1 | GET "/api/tags"

time=2025-10-29T21:44:10.544-04:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\User\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.19045
time=2025-10-29T21:44:10.546-04:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0
time=2025-10-29T21:44:10.587-04:00 level=INFO source=app.go:246 msg="starting ollama server"
time=2025-10-29T21:44:10.931-04:00 level=INFO source=app.go:275 msg="starting ui server" port=60942
time=2025-10-29T21:44:10.953-04:00 level=INFO source=app.go:336 msg="deferring pending update for fast startup"
time=2025-10-29T21:44:13.931-04:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s
time=2025-10-29T21:44:24.714-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788664714383500 version=0.12.7
time=2025-10-29T21:44:24.719-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=84.5648ms request_id=1761788664634994100 version=0.12.7
time=2025-10-29T21:44:24.733-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=11.0824ms request_id=1761788664722425600 version=0.12.7
time=2025-10-29T21:44:24.749-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=24.2975ms request_id=1761788664725422700 version=0.12.7
time=2025-10-29T21:44:24.762-04:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:cpu Variant: Compute: Driver: Name:cpu VRAM:11.9 GiB}"
time=2025-10-29T21:44:24.762-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=43.6747ms request_id=1761788664718542400 version=0.12.7
time=2025-10-29T21:44:24.915-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=199.1878ms request_id=1761788664716368200 version=0.12.7
time=2025-10-29T21:44:24.984-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=25.1705ms request_id=1761788664959127700 version=0.12.7
time=2025-10-29T21:44:25.117-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/qwen3:4b/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=264.7715ms request_id=1761788664853051300 version=0.12.7
time=2025-10-29T21:44:25.399-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=162.7553ms request_id=1761788665236699300 version=0.12.7
time=2025-10-29T21:44:40.658-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=23.821ms request_id=1761788680635163500 version=0.12.7
time=2025-10-29T21:44:40.767-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=1.3556ms request_id=1761788680766282800 version=0.12.7
time=2025-10-29T21:44:40.878-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=111.7008ms request_id=1761788680767101300 version=0.12.7
time=2025-10-29T21:45:11.007-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=86.2728ms request_id=1761788710921244800 version=0.12.7
time=2025-10-29T21:45:41.072-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.9496ms request_id=1761788741013530600 version=0.12.7
time=2025-10-29T21:46:11.160-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=81.4457ms request_id=1761788771078929500 version=0.12.7
time=2025-10-29T21:46:41.391-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=212.4071ms request_id=1761788801178638100 version=0.12.7
time=2025-10-29T21:47:08.047-04:00 level=ERROR source=ui.go:1179 msg="chat stream error" error="Post "http://127.0.0.1:11434/api/chat": context canceled"
time=2025-10-29T21:47:08.047-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/new http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m27.4542528s request_id=1761788680592933600 version=0.12.7
time=2025-10-29T21:47:11.535-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=79.1006ms request_id=1761788831456019700 version=0.12.7
time=2025-10-29T21:47:14.028-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/settings http.pattern="POST /api/v1/settings" http.status=200 http.d=594µs request_id=1761788834028069900 version=0.12.7
time=2025-10-29T21:47:14.035-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788834035388500 version=0.12.7
time=2025-10-29T21:47:14.084-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gpt-oss:20b-cloud/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=31.9895ms request_id=1761788834052226600 version=0.12.7
time=2025-10-29T21:47:14.396-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=351.1287ms request_id=1761788834045034700 version=0.12.7
time=2025-10-29T21:47:30.128-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=6.3121519s request_id=1761788843815980500 version=0.12.7
time=2025-10-29T21:47:30.138-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=9.1821ms request_id=1761788850129584200 version=0.12.7
time=2025-10-29T21:47:41.608-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=61.4188ms request_id=1761788861546597500 version=0.12.7
time=2025-10-29T21:48:11.680-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.3517ms request_id=1761788891622309800 version=0.12.7
time=2025-10-29T21:48:41.752-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=51.5636ms request_id=1761788921700600300 version=0.12.7
time=2025-10-29T21:49:11.828-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=66.4161ms request_id=1761788951762197100 version=0.12.7
time=2025-10-29T21:49:41.898-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.9438ms request_id=1761788981841884900 version=0.12.7
time=2025-10-29T21:50:11.983-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.2983ms request_id=1761789011918117000 version=0.12.7
time=2025-10-29T21:50:42.065-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=57.7472ms request_id=1761789042007958300 version=0.12.7
time=2025-10-29T21:51:12.137-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.72ms request_id=1761789072080460500 version=0.12.7
time=2025-10-29T21:51:42.216-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=63.3792ms request_id=1761789102152736300 version=0.12.7
time=2025-10-29T21:52:12.303-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=71.2135ms request_id=1761789132232153000 version=0.12.7
time=2025-10-29T21:52:42.392-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.8005ms request_id=1761789162327049800 version=0.12.7

@katmandoo212 commented on GitHub (Oct 30, 2025): I gave 0.12.7 a try. with OLLAMA_DEBUG=2. I could not get local models to load, but cloud models do work. gpt-oss:20b-cloud for example. This is my server.log followed by my app.log. I hope this helps. time=2025-10-29T21:44:11.691-04:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:16384 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\\OllamaFiles\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-10-29T21:44:11.745-04:00 level=INFO source=images.go:522 msg="total blobs: 349" time=2025-10-29T21:44:11.776-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-10-29T21:44:11.786-04:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)" time=2025-10-29T21:44:11.786-04:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler" time=2025-10-29T21:44:11.788-04:00 level=INFO source=runner.go:76 msg="discovering available GPUs..." time=2025-10-29T21:44:11.788-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extraEnvs=map[] time=2025-10-29T21:44:11.801-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 60946" time=2025-10-29T21:44:11.802-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 time=2025-10-29T21:44:11.867-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-29T21:44:11.868-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60946" time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:11.880-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-29T21:44:11.881-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-29T21:44:11.913-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 dl_load_library unable to load library C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12\ggml-cuda.dll: The specified module could not be found. time=2025-10-29T21:44:12.094-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang) time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default="" time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1 time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=216.4957ms time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" devices=[] time=2025-10-29T21:44:12.096-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=308.1172ms OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extraEnvs=map[] time=2025-10-29T21:44:12.099-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 60955" time=2025-10-29T21:44:12.099-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 time=2025-10-29T21:44:12.161-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-29T21:44:12.162-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60955" time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-29T21:44:12.167-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-29T21:44:12.169-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-29T21:44:12.169-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-29T21:44:12.197-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 ggml_cuda_init: failed to initialize CUDA: (null) load_backend: loaded CUDA backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13\ggml-cuda.dll time=2025-10-29T21:44:12.283-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang) time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0 time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0 time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}" time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}" time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}" time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}" time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0 time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0 time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}" time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default="" time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1 time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=119.2606ms time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s time=2025-10-29T21:44:12.285-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" devices=[] time=2025-10-29T21:44:12.286-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=189.4849ms OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extra_envs=map[] time=2025-10-29T21:44:12.286-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extraEnvs=map[] time=2025-10-29T21:44:12.288-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 60962" time=2025-10-29T21:44:12.288-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm time=2025-10-29T21:44:12.364-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-29T21:44:12.366-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60962" time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-29T21:44:12.377-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-29T21:44:12.406-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm ggml_cuda_init: failed to initialize ROCm: no ROCm-capable device is detected load_backend: loaded ROCm backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm\ggml-hip.dll time=2025-10-29T21:44:12.446-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang) time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0 time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0 time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default="" time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1 time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=72.7312ms time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" devices=[] time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=162.4718ms OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extra_envs=map[] time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0 time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[] time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=661.722ms time=2025-10-29T21:44:12.449-04:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="6.0 GiB" time=2025-10-29T21:44:12.449-04:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB" [GIN] 2025/10/29 - 21:44:12 | 200 | 541µs | 127.0.0.1 | HEAD "/" [GIN] 2025/10/29 - 21:44:12 | 200 | 57.0626ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:44:24 | 200 | 185.0917ms | 127.0.0.1 | GET "/api/tags" time=2025-10-29T21:44:25.105-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 [GIN] 2025/10/29 - 21:44:25 | 200 | 254.2493ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/29 - 21:44:40 | 200 | 0s | 127.0.0.1 | GET "/api/version" time=2025-10-29T21:44:40.843-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 [GIN] 2025/10/29 - 21:44:40 | 200 | 250.6383ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/29 - 21:44:40 | 200 | 106.9248ms | 127.0.0.1 | GET "/api/tags" time=2025-10-29T21:44:41.009-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 [GIN] 2025/10/29 - 21:44:41 | 200 | 166.299ms | 127.0.0.1 | POST "/api/show" time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:267 msg="refreshing free memory" time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s [GIN] 2025/10/29 - 21:45:10 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:45:11 | 200 | 78.7139ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:45:41 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:45:41 | 200 | 52.8417ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:46:11 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:46:11 | 200 | 74.5929ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:46:41 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:46:41 | 200 | 199.5415ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:47:11 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:47:11 | 200 | 70.797ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:47:14 | 200 | 31.5551ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/29 - 21:47:23 | 200 | 33.352ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/29 - 21:47:23 | 200 | 35.7424ms | 127.0.0.1 | POST "/api/show" [GIN] 2025/10/29 - 21:47:25 | 200 | 1.3314975s | 127.0.0.1 | POST "/api/chat" [GIN] 2025/10/29 - 21:47:27 | 200 | 814.5542ms | 127.0.0.1 | POST "/api/chat" [GIN] 2025/10/29 - 21:47:28 | 200 | 1.1260067s | 127.0.0.1 | POST "/api/chat" [GIN] 2025/10/29 - 21:47:30 | 200 | 1.6694057s | 127.0.0.1 | POST "/api/chat" [GIN] 2025/10/29 - 21:47:41 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:47:41 | 200 | 55.0317ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:48:11 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:48:11 | 200 | 52.7654ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:48:41 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:48:41 | 200 | 49.1216ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:49:11 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:49:11 | 200 | 60.5865ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:49:41 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:49:41 | 200 | 54.8271ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:50:11 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:50:11 | 200 | 58.8494ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:50:42 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:50:42 | 200 | 53.4373ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:51:12 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:51:12 | 200 | 54.2343ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/29 - 21:51:42 | 200 | 0s | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/29 - 21:51:42 | 200 | 59.3495ms | 127.0.0.1 | GET "/api/tags" time=2025-10-29T21:44:10.544-04:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\User\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.19045 time=2025-10-29T21:44:10.546-04:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0 time=2025-10-29T21:44:10.587-04:00 level=INFO source=app.go:246 msg="starting ollama server" time=2025-10-29T21:44:10.931-04:00 level=INFO source=app.go:275 msg="starting ui server" port=60942 time=2025-10-29T21:44:10.953-04:00 level=INFO source=app.go:336 msg="deferring pending update for fast startup" time=2025-10-29T21:44:13.931-04:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s time=2025-10-29T21:44:24.714-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788664714383500 version=0.12.7 time=2025-10-29T21:44:24.719-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=84.5648ms request_id=1761788664634994100 version=0.12.7 time=2025-10-29T21:44:24.733-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=11.0824ms request_id=1761788664722425600 version=0.12.7 time=2025-10-29T21:44:24.749-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=24.2975ms request_id=1761788664725422700 version=0.12.7 time=2025-10-29T21:44:24.762-04:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:cpu Variant: Compute: Driver: Name:cpu VRAM:11.9 GiB}" time=2025-10-29T21:44:24.762-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=43.6747ms request_id=1761788664718542400 version=0.12.7 time=2025-10-29T21:44:24.915-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=199.1878ms request_id=1761788664716368200 version=0.12.7 time=2025-10-29T21:44:24.984-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=25.1705ms request_id=1761788664959127700 version=0.12.7 time=2025-10-29T21:44:25.117-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/qwen3:4b/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=264.7715ms request_id=1761788664853051300 version=0.12.7 time=2025-10-29T21:44:25.399-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=162.7553ms request_id=1761788665236699300 version=0.12.7 time=2025-10-29T21:44:40.658-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=23.821ms request_id=1761788680635163500 version=0.12.7 time=2025-10-29T21:44:40.767-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=1.3556ms request_id=1761788680766282800 version=0.12.7 time=2025-10-29T21:44:40.878-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=111.7008ms request_id=1761788680767101300 version=0.12.7 time=2025-10-29T21:45:11.007-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=86.2728ms request_id=1761788710921244800 version=0.12.7 time=2025-10-29T21:45:41.072-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.9496ms request_id=1761788741013530600 version=0.12.7 time=2025-10-29T21:46:11.160-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=81.4457ms request_id=1761788771078929500 version=0.12.7 time=2025-10-29T21:46:41.391-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=212.4071ms request_id=1761788801178638100 version=0.12.7 time=2025-10-29T21:47:08.047-04:00 level=ERROR source=ui.go:1179 msg="chat stream error" error="Post \"http://127.0.0.1:11434/api/chat\": context canceled" time=2025-10-29T21:47:08.047-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/new http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m27.4542528s request_id=1761788680592933600 version=0.12.7 time=2025-10-29T21:47:11.535-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=79.1006ms request_id=1761788831456019700 version=0.12.7 time=2025-10-29T21:47:14.028-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/settings http.pattern="POST /api/v1/settings" http.status=200 http.d=594µs request_id=1761788834028069900 version=0.12.7 time=2025-10-29T21:47:14.035-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788834035388500 version=0.12.7 time=2025-10-29T21:47:14.084-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gpt-oss:20b-cloud/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=31.9895ms request_id=1761788834052226600 version=0.12.7 time=2025-10-29T21:47:14.396-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=351.1287ms request_id=1761788834045034700 version=0.12.7 time=2025-10-29T21:47:30.128-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=6.3121519s request_id=1761788843815980500 version=0.12.7 time=2025-10-29T21:47:30.138-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=9.1821ms request_id=1761788850129584200 version=0.12.7 time=2025-10-29T21:47:41.608-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=61.4188ms request_id=1761788861546597500 version=0.12.7 time=2025-10-29T21:48:11.680-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.3517ms request_id=1761788891622309800 version=0.12.7 time=2025-10-29T21:48:41.752-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=51.5636ms request_id=1761788921700600300 version=0.12.7 time=2025-10-29T21:49:11.828-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=66.4161ms request_id=1761788951762197100 version=0.12.7 time=2025-10-29T21:49:41.898-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.9438ms request_id=1761788981841884900 version=0.12.7 time=2025-10-29T21:50:11.983-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.2983ms request_id=1761789011918117000 version=0.12.7 time=2025-10-29T21:50:42.065-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=57.7472ms request_id=1761789042007958300 version=0.12.7 time=2025-10-29T21:51:12.137-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.72ms request_id=1761789072080460500 version=0.12.7 time=2025-10-29T21:51:42.216-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=63.3792ms request_id=1761789102152736300 version=0.12.7 time=2025-10-29T21:52:12.303-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=71.2135ms request_id=1761789132232153000 version=0.12.7 time=2025-10-29T21:52:42.392-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.8005ms request_id=1761789162327049800 version=0.12.7

GiteaMirror commented

2026-04-12 21:02:25 -05:00

@Panican-Whyasker commented on GitHub (Oct 31, 2025):

Please give 0.12.7 a try and let us know if the issues are resolved. If not, please share an updated log with OLLAMA_DEBUG=2 set so we can take a look.

Waiting forever to load a 0.6B model in Ollama 0.12.7 on a Win11Pro laptop w. 5th-Gen Core i7 CPU, 8 GB of RAM, with nVidia GeForce M840 GPU with 2 GB of own VRAM:

OLLAMA_DEBUG=2 was added to Environment Variables:

At present time 08:23AM, here is the

app.log:

time=2025-10-31T08:08:00.575+01:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.26100
time=2025-10-31T08:08:00.583+01:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0
time=2025-10-31T08:08:00.616+01:00 level=INFO source=app.go:246 msg="starting ollama server"
time=2025-10-31T08:08:00.965+01:00 level=INFO source=app.go:275 msg="starting ui server" port=53147
time=2025-10-31T08:08:03.966+01:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s

...as well as the

server.log:

time=2025-10-31T08:08:02.049+01:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-31T08:08:02.110+01:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-31T08:08:02.113+01:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-31T08:08:02.119+01:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)"
time=2025-10-31T08:08:02.120+01:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler"
time=2025-10-31T08:08:02.125+01:00 level=INFO source=runner.go:76 msg="discovering available GPUs..."
time=2025-10-31T08:08:02.126+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extraEnvs=map[]
time=2025-10-31T08:08:02.148+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 53150"
time=2025-10-31T08:08:02.149+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-31T08:08:02.226+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:08:02.231+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:53150"
time=2025-10-31T08:08:02.244+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:08:02.245+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:08:02.246+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:08:02.260+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:08:02.260+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:08:02.955+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-31T08:08:32.127+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" devices=[]
time=2025-10-31T08:08:32.127+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=30.0015049s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[]
time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extraEnvs=map[]
time=2025-10-31T08:08:32.133+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 62942"
time=2025-10-31T08:08:32.133+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-31T08:08:32.194+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:08:32.196+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62942"
time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:08:32.202+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:08:32.203+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:08:32.232+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-31T08:09:04.206+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" devices=[]
time=2025-10-31T08:09:04.206+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=32.0789732s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[]
time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extraEnvs=map[]
time=2025-10-31T08:09:04.243+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 62949"
time=2025-10-31T08:09:04.243+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-31T08:09:21.729+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:09:21.732+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62949"
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:09:21.819+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:09:21.852+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-31T08:10:04.207+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:10:04.207+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" devices=[]
time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=1m0.0009716s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[]
time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0
time=2025-10-31T08:10:04.208+01:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[]
time=2025-10-31T08:10:04.208+01:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=2m2.0872522s
time=2025-10-31T08:10:04.209+01:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="7.9 GiB" available="802.4 MiB"
time=2025-10-31T08:10:04.209+01:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/31 - 08:12:58 | 200 | 8.1601ms | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/31 - 08:13:03 | 200 | 0s | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/31 - 08:13:03 | 200 | 38.5777ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/31 - 08:13:24 | 200 | 0s | 127.0.0.1 | HEAD "/"
time=2025-10-31T08:13:24.904+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/31 - 08:13:24 | 200 | 173.1176ms | 127.0.0.1 | POST "/api/show"
time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:267 msg="refreshing free memory"
time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s

@Panican-Whyasker commented on GitHub (Oct 31, 2025): > Please give 0.12.7 a try and let us know if the issues are resolved. If not, please share an updated log with OLLAMA_DEBUG=2 set so we can take a look. Waiting forever to load a 0.6B model in Ollama 0.12.7 on a Win11Pro laptop w. 5th-Gen Core i7 CPU, 8 GB of RAM, with nVidia GeForce M840 GPU with 2 GB of own VRAM: ![Image](https://github.com/user-attachments/assets/36bb8719-adbf-405e-bb89-92d7d3c67f21) OLLAMA_DEBUG=2 was added to Environment Variables: ![Image](https://github.com/user-attachments/assets/37b4574c-31a6-4926-8f49-886e9ce587a9) At present time 08:23AM, here is the app.log: time=2025-10-31T08:08:00.575+01:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.26100 time=2025-10-31T08:08:00.583+01:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0 time=2025-10-31T08:08:00.616+01:00 level=INFO source=app.go:246 msg="starting ollama server" time=2025-10-31T08:08:00.965+01:00 level=INFO source=app.go:275 msg="starting ui server" port=53147 time=2025-10-31T08:08:03.966+01:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s ...as well as the server.log: time=2025-10-31T08:08:02.049+01:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\\Users\\Joro\\.ollama\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-10-31T08:08:02.110+01:00 level=INFO source=images.go:522 msg="total blobs: 12" time=2025-10-31T08:08:02.113+01:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-10-31T08:08:02.119+01:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)" time=2025-10-31T08:08:02.120+01:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler" time=2025-10-31T08:08:02.125+01:00 level=INFO source=runner.go:76 msg="discovering available GPUs..." time=2025-10-31T08:08:02.126+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extraEnvs=map[] time=2025-10-31T08:08:02.148+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 53150" time=2025-10-31T08:08:02.149+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\Program Files (x86)\\NVIDIA Corporation\\PhysX\\Common;C:\\Program Files\\PowerShell\\7\\;C:\\Users\\Joro\\AppData\\Local\\Microsoft\\WindowsApps;;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro\.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 time=2025-10-31T08:08:02.226+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-31T08:08:02.231+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:53150" time=2025-10-31T08:08:02.244+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-31T08:08:02.245+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-31T08:08:02.246+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-31T08:08:02.260+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-31T08:08:02.260+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-31T08:08:02.955+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 time=2025-10-31T08:08:32.127+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] error="failed to finish discovery before timeout" time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" devices=[] time=2025-10-31T08:08:32.127+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=30.0015049s OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extraEnvs=map[] time=2025-10-31T08:08:32.133+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 62942" time=2025-10-31T08:08:32.133+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\Program Files (x86)\\NVIDIA Corporation\\PhysX\\Common;C:\\Program Files\\PowerShell\\7\\;C:\\Users\\Joro\\AppData\\Local\\Microsoft\\WindowsApps;;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro\.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 time=2025-10-31T08:08:32.194+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-31T08:08:32.196+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62942" time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-31T08:08:32.202+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-31T08:08:32.203+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-31T08:08:32.232+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 time=2025-10-31T08:09:04.206+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extra_envs=map[] error="failed to finish discovery before timeout" time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" devices=[] time=2025-10-31T08:09:04.206+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=32.0789732s OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extra_envs=map[] time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extraEnvs=map[] time=2025-10-31T08:09:04.243+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 62949" time=2025-10-31T08:09:04.243+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\Program Files (x86)\\NVIDIA Corporation\\PhysX\\Common;C:\\Program Files\\PowerShell\\7\\;C:\\Users\\Joro\\AppData\\Local\\Microsoft\\WindowsApps;;C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro\.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm time=2025-10-31T08:09:21.729+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine" time=2025-10-31T08:09:21.732+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62949" time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-10-31T08:09:21.819+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-10-31T08:09:21.852+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm time=2025-10-31T08:10:04.207+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extra_envs=map[] error="failed to finish discovery before timeout" time=2025-10-31T08:10:04.207+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" devices=[] time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=1m0.0009716s OLLAMA_LIBRARY_PATH="[C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\Joro\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extra_envs=map[] time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0 time=2025-10-31T08:10:04.208+01:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[] time=2025-10-31T08:10:04.208+01:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=2m2.0872522s time=2025-10-31T08:10:04.209+01:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="7.9 GiB" available="802.4 MiB" time=2025-10-31T08:10:04.209+01:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB" [GIN] 2025/10/31 - 08:12:58 | 200 | 8.1601ms | 127.0.0.1 | GET "/api/version" [GIN] 2025/10/31 - 08:13:03 | 200 | 0s | 127.0.0.1 | HEAD "/" [GIN] 2025/10/31 - 08:13:03 | 200 | 38.5777ms | 127.0.0.1 | GET "/api/tags" [GIN] 2025/10/31 - 08:13:24 | 200 | 0s | 127.0.0.1 | HEAD "/" time=2025-10-31T08:13:24.904+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 [GIN] 2025/10/31 - 08:13:24 | 200 | 173.1176ms | 127.0.0.1 | POST "/api/show" time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:267 msg="refreshing free memory" time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s

GiteaMirror commented

2026-04-12 21:02:26 -05:00

@dhiltgen commented on GitHub (Oct 31, 2025):

I made a small fix to some logging logic for Windows on 0.12.8 which likely wont fix this, but may help us get more details on the failure. Anyone still seeing the hang, please install 0.12.8 and set $env:OLLAMA_DEBUG="2" and share the server log from startup through a request that hangs so we can try to narrow down what's going wrong.

@dhiltgen commented on GitHub (Oct 31, 2025): I made a small fix to some logging logic for Windows on 0.12.8 which likely wont fix this, but may help us get more details on the failure. Anyone still seeing the hang, please install 0.12.8 and set `$env:OLLAMA_DEBUG="2"` and share the server log from startup through a request that hangs so we can try to narrow down what's going wrong.

GiteaMirror commented

2026-04-12 21:02:27 -05:00

@katmandoo212 commented on GitHub (Nov 1, 2025):

I ran 0.12.8 with OLLAMA_DEBUG=2 as suggested. Trying to run a local model hangs, but Cloud models work. I have attached my server.log and app.log.

app.log
server.log

@katmandoo212 commented on GitHub (Nov 1, 2025): I ran 0.12.8 with OLLAMA_DEBUG=2 as suggested. Trying to run a local model hangs, but Cloud models work. I have attached my server.log and app.log. [app.log](https://github.com/user-attachments/files/23279576/app.log) [server.log](https://github.com/user-attachments/files/23279575/server.log)

GiteaMirror commented

2026-04-12 21:02:28 -05:00

@katmandoo212 commented on GitHub (Nov 1, 2025):

Just to let everyone know, I tried 0.12.9 on Windows 10, no GPU and it still hangs (spinner spins) when loading local models, but cloud models do work.

@katmandoo212 commented on GitHub (Nov 1, 2025): Just to let everyone know, I tried 0.12.9 on Windows 10, no GPU and it still hangs (spinner spins) when loading local models, but cloud models do work.

GiteaMirror commented

2026-04-12 21:02:28 -05:00

@dhiltgen commented on GitHub (Nov 4, 2025):

It sounds like there's a deadlock someplace, but I'm not sure where the system is getting hung up. Lets try to isolate things a little more. @katmandoo212 can you quit the GUI app by exiting the tray application, then lets run the server and CLI in a terminal.

$env:OLLAMA_DEBUG="2"
ollama serve 2>&1 | % ToString | tee-object serve.log

then in another terminal

ollama run qwen3:1.7b hello

Also, when it gets into this stuck state, do you see the ollama serve chewing up a CPU core in Task Manager, or is the system completely idle?

Comparing logs with other systems, it seems like it may be gathering information about the CPUs in the system. Is there anything unusual about your CPU setup?

@dhiltgen commented on GitHub (Nov 4, 2025): It sounds like there's a deadlock someplace, but I'm not sure where the system is getting hung up. Lets try to isolate things a little more. @katmandoo212 can you quit the GUI app by exiting the tray application, then lets run the server and CLI in a terminal. ```powershell $env:OLLAMA_DEBUG="2" ollama serve 2>&1 | % ToString | tee-object serve.log ``` then in another terminal ```powershell ollama run qwen3:1.7b hello ``` Also, when it gets into this stuck state, do you see the `ollama serve` chewing up a CPU core in Task Manager, or is the system completely idle? Comparing logs with other systems, it seems like it may be gathering information about the CPUs in the system. Is there anything unusual about your CPU setup?

GiteaMirror commented

2026-04-12 21:02:29 -05:00

@katmandoo212 commented on GitHub (Nov 5, 2025):

Here is my server log from the latest run using your instructions.

$env:OLLAMA_DEBUG="2"
ollama serve 2>&1 | % ToString | tee-object serve.log

ollama run qwen3:1.7b hello

❯ ollama serve 2>&1 | % ToString | tee-object serve.log
time=2025-11-04T20:36:02.700-05:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\\OllamaFiles\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-11-04T20:36:05.637-05:00 level=INFO source=images.go:522 msg="total blobs: 374"
time=2025-11-04T20:36:05.687-05:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-11-04T20:36:05.719-05:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.9)"
time=2025-11-04T20:36:05.719-05:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler"
time=2025-11-04T20:36:05.726-05:00 level=INFO source=runner.go:76 msg="discovering available GPUs..."
time=2025-11-04T20:36:05.726-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extraEnvs=map[]
time=2025-11-04T20:36:05.775-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65308"
time=2025-11-04T20:36:05.775-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-11-04T20:36:05.911-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine"
time=2025-11-04T20:36:05.915-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65308"
time=2025-11-04T20:36:05.926-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-11-04T20:36:05.928-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-11-04T20:36:05.929-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:05.944-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:05.944-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-11-04T20:36:05.945-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-11-04T20:36:05.945-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-11-04T20:36:05.945-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-11-04T20:36:05.946-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-11-04T20:36:06.849-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-11-04T20:36:35.728-05:00 level=INFO source=runner.go:498 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-11-04T20:36:35.729-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" devices=[]
time=2025-11-04T20:36:35.729-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=30.0029573s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[]
time=2025-11-04T20:36:35.729-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extraEnvs=map[]
time=2025-11-04T20:36:35.751-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65322"
time=2025-11-04T20:36:35.751-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-11-04T20:36:35.970-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine"
time=2025-11-04T20:36:35.973-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65322"
time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-11-04T20:36:35.982-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-11-04T20:36:36.072-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
ggml_cuda_init: failed to initialize CUDA: (null)
load_backend: loaded CUDA backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13\ggml-cuda.dll
time=2025-11-04T20:36:38.318-05:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-11-04T20:36:38.321-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-11-04T20:36:38.324-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-11-04T20:36:38.325-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-11-04T20:36:38.325-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-11-04T20:36:38.329-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-11-04T20:36:38.333-05:00 level=DEBUG source=runner.go:1324 msg="dummy model load took" duration=2.3568036s
time=2025-11-04T20:36:38.333-05:00 level=DEBUG source=runner.go:1329 msg="gathering device infos took" duration=0s
time=2025-11-04T20:36:38.342-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" devices=[]
time=2025-11-04T20:36:38.343-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=2.6138919s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extra_envs=map[]
time=2025-11-04T20:36:38.343-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extraEnvs=map[]
time=2025-11-04T20:36:38.351-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65337"
time=2025-11-04T20:36:38.351-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-11-04T20:36:38.601-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine"
time=2025-11-04T20:36:38.604-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65337"
time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-11-04T20:36:38.611-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-11-04T20:36:38.661-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
ggml_cuda_init: failed to initialize ROCm: no ROCm-capable device is detected
load_backend: loaded ROCm backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm\ggml-hip.dll
time=2025-11-04T20:36:39.604-05:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=runner.go:1324 msg="dummy model load took" duration=998.0417ms
time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=runner.go:1329 msg="gathering device infos took" duration=0s
time=2025-11-04T20:36:39.606-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" devices=[]
time=2025-11-04T20:36:39.607-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=1.263995s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extra_envs=map[]
time=2025-11-04T20:36:39.607-05:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0
time=2025-11-04T20:36:39.608-05:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[]
time=2025-11-04T20:36:39.611-05:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=33.8889696s
time=2025-11-04T20:36:39.616-05:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="587.9 MiB"
time=2025-11-04T20:36:39.616-05:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/11/04 - 20:37:08 | 200 |      5.3166ms |       127.0.0.1 | HEAD     "/"
time=2025-11-04T20:37:09.444-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/11/04 - 20:37:09 | 200 |    1.2758634s |       127.0.0.1 | POST     "/api/show"
time=2025-11-04T20:37:09.876-05:00 level=DEBUG source=runner.go:267 msg="refreshing free memory"
time=2025-11-04T20:37:09.877-05:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=567.7┬╡s

I hope that helps.

@katmandoo212 commented on GitHub (Nov 5, 2025): Here is my server log from the latest run using your instructions. ``` $env:OLLAMA_DEBUG="2" ollama serve 2>&1 | % ToString | tee-object serve.log ``` ``` ollama run qwen3:1.7b hello ``` ``` ❯ ollama serve 2>&1 | % ToString | tee-object serve.log time=2025-11-04T20:36:02.700-05:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\\OllamaFiles\\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-11-04T20:36:05.637-05:00 level=INFO source=images.go:522 msg="total blobs: 374" time=2025-11-04T20:36:05.687-05:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0" time=2025-11-04T20:36:05.719-05:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.9)" time=2025-11-04T20:36:05.719-05:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler" time=2025-11-04T20:36:05.726-05:00 level=INFO source=runner.go:76 msg="discovering available GPUs..." time=2025-11-04T20:36:05.726-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extraEnvs=map[] time=2025-11-04T20:36:05.775-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65308" time=2025-11-04T20:36:05.775-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 time=2025-11-04T20:36:05.911-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine" time=2025-11-04T20:36:05.915-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65308" time=2025-11-04T20:36:05.926-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-11-04T20:36:05.928-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-11-04T20:36:05.929-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:05.944-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:05.944-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-11-04T20:36:05.945-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-11-04T20:36:05.945-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-11-04T20:36:05.945-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-11-04T20:36:05.946-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-11-04T20:36:06.849-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12 time=2025-11-04T20:36:35.728-05:00 level=INFO source=runner.go:498 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] error="failed to finish discovery before timeout" time=2025-11-04T20:36:35.729-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" devices=[] time=2025-11-04T20:36:35.729-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=30.0029573s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v12]" extra_envs=map[] time=2025-11-04T20:36:35.729-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extraEnvs=map[] time=2025-11-04T20:36:35.751-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65322" time=2025-11-04T20:36:35.751-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 time=2025-11-04T20:36:35.970-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine" time=2025-11-04T20:36:35.973-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65322" time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-11-04T20:36:35.976-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-11-04T20:36:35.982-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-11-04T20:36:35.982-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-11-04T20:36:36.072-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13 ggml_cuda_init: failed to initialize CUDA: (null) load_backend: loaded CUDA backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13\ggml-cuda.dll time=2025-11-04T20:36:38.318-05:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang) time=2025-11-04T20:36:38.321-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-11-04T20:36:38.324-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0 time=2025-11-04T20:36:38.325-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0 time=2025-11-04T20:36:38.325-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}" time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}" time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}" time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}" time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0 time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0 time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}" time=2025-11-04T20:36:38.327-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default="" time=2025-11-04T20:36:38.329-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000 time=2025-11-04T20:36:38.330-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1 time=2025-11-04T20:36:38.333-05:00 level=DEBUG source=runner.go:1324 msg="dummy model load took" duration=2.3568036s time=2025-11-04T20:36:38.333-05:00 level=DEBUG source=runner.go:1329 msg="gathering device infos took" duration=0s time=2025-11-04T20:36:38.342-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" devices=[] time=2025-11-04T20:36:38.343-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=2.6138919s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\cuda_v13]" extra_envs=map[] time=2025-11-04T20:36:38.343-05:00 level=TRACE source=runner.go:474 msg="starting runner for device discovery" libDirs="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extraEnvs=map[] time=2025-11-04T20:36:38.351-05:00 level=INFO source=server.go:400 msg="starting runner" cmd="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\ollama.exe runner --ollama-engine --port 65337" time=2025-11-04T20:36:38.351-05:00 level=DEBUG source=server.go:401 msg=subprocess OLLAMA_AUTO_UPDATE=false OLLAMA_DEBUG=2 OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm;C:\\Program Files\\PowerShell\\7;C:\\Program Files (x86)\\oh-my-posh\\bin\\;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Program Files\\Common Files\\Oracle\\Java\\javapath;C:\\ActiveTcl\\bin;C:\\Program Files\\Microsoft MPI\\Bin\\;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tct\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python314\\tcl;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\Scripts;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl\\tcl8.6;C:\\Users\\User\\AppData\\Local\\Programs\\Python\\Python313\\tcl;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Roaming\\ActiveState\\bin;C:\\Windows\\system32;C:\\Windows;C:\\Windows\\System32\\Wbem;C:\\Windows\\System32\\WindowsPowerShell\\v1.0\\;C:\\Program Files\\Graphviz\\bin;C:\\Windows\\System32\\OpenSSH\\;C:\\Program Files\\Microsoft SQL Server\\130\\Tools\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\Client SDK\\ODBC\\170\\Tools\\Binn\\;C:\\Program Files\\dotnet\\;C:\\Program Files\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files\\Microsoft SQL Server\\150\\DTS\\Binn\\;C:\\Program Files (x86)\\Microsoft SQL Server\\150\\Tools\\Binn\\;C:\\Program Files\\Azure Data Studio\\bin;C:\\WINDOWS\\system32;C:\\WINDOWS;C:\\WINDOWS\\System32\\Wbem;C:\\WINDOWS\\System32\\WindowsPowerShell\\v1.0\\;C:\\WINDOWS\\System32\\OpenSSH\\;C:\\ProgramData\\chocolatey\\bin;C:\\Program Files\\Java\\jdk-21\\bin;C:\\Program Files\\NASM;C:\\Program Files\\Microsoft VS Code\\bin;C:\\Program Files\\gs\\gs10.03.0\\bin;C:\\Program Files (x86)\\Microsoft SQL Server\\160\\DTS\\Binn\\;C:\\Program Files\\PuTTY\\;C:\\Program Files\\RedHat\\Podman\\;C:\\TDM-GCC-64\\bin;D:\\home\\blt\\github\\vcpkg;C:\\Program Files\\CMake\\bin;C:\\Program Files\\nodejs\\;C:\\Program Files\\Go\\bin;C:\\Program Files\\Pandoc\\;C:\\Program Files\\Docker\\Docker\\resources\\bin;C:\\Program Files\\PowerShell\\7\\;C:\\Program Files (x86)\\Windows Kits\\10\\Windows Performance Toolkit\\;C:\\Program Files\\Git\\cmd;C:\\Users\\User\\AppData\\Local\\Programs\\oh-my-posh\\bin\\;C:\\Users\\User\\.local\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\Ollama;C:\\Users\\User\\Downloads\\vcmake\\vcpkg\\installed\\x64_windows;C:\\Users\\User\\.cargo\\bin;C:\\Program Files\\OpenSSL\\bin;C:\\Users\\User\\AppData\\Local\\Microsoft\\WindowsApps;C:\\Program Files\\Azure Data Studio\\bin;C:\\Program Files\\PostgreSQL\\15\\bin;C:\\Users\\User\\AppData\\Local\\GitHubDesktop\\bin;C:\\Users\\User\\Downloads\\ffmpeg-master-latest-win64-gpl\\ffmpeg-master-latest-win64-gpl\\bin;C:\\Program Files\\Graphviz\\bin;c:\\Program Files\\zig;c:\\users\\user\\.local\\bin;C:\\Program Files (x86)\\Intel\\oneAPI;C:\\Users\\User\\go\\bin;C:\\Users\\User\\.lmstudio\\bin;C:\\Users\\User\\.dotnet\\tools;C:\\Users\\User\\AppData\\Local\\Programs\\Windsurf\\bin;C:\\Users\\User\\AppData\\Local\\reflex\\bun\\bin;C:\\Users\\User\\AppData\\Local\\Programs\\MiKTeX\\miktex\\bin\\x64\\;C:\\Users\\User\\AppData\\Roaming\\npm;C:\\Users\\User\\go\\bin;C:\\Users\\User\\AppData\\Local\\PowerToys\\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm time=2025-11-04T20:36:38.601-05:00 level=INFO source=runner.go:1349 msg="starting ollama engine" time=2025-11-04T20:36:38.604-05:00 level=INFO source=runner.go:1384 msg="Server listening on 127.0.0.1:65337" time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string time=2025-11-04T20:36:38.608-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0 time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default="" time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default="" time=2025-11-04T20:36:38.611-05:00 level=INFO source=ggml.go:136 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3 time=2025-11-04T20:36:38.611-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll time=2025-11-04T20:36:38.661-05:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm ggml_cuda_init: failed to initialize ROCm: no ROCm-capable device is detected load_backend: loaded ROCm backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm\ggml-hip.dll time=2025-11-04T20:36:39.604-05:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang) time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0 time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0 time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}" time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}" time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}" time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}" time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0 time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0 time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}" time=2025-11-04T20:36:39.604-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default="" time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1 time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=runner.go:1324 msg="dummy model load took" duration=998.0417ms time=2025-11-04T20:36:39.605-05:00 level=DEBUG source=runner.go:1329 msg="gathering device infos took" duration=0s time=2025-11-04T20:36:39.606-05:00 level=TRACE source=runner.go:501 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" devices=[] time=2025-11-04T20:36:39.607-05:00 level=DEBUG source=runner.go:471 msg="bootstrap discovery took" duration=1.263995s OLLAMA_LIBRARY_PATH="[C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama C:\\Users\\User\\AppData\\Local\\Programs\\Ollama\\lib\\ollama\\rocm]" extra_envs=map[] time=2025-11-04T20:36:39.607-05:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0 time=2025-11-04T20:36:39.608-05:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[] time=2025-11-04T20:36:39.611-05:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=33.8889696s time=2025-11-04T20:36:39.616-05:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="587.9 MiB" time=2025-11-04T20:36:39.616-05:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB" [GIN] 2025/11/04 - 20:37:08 | 200 | 5.3166ms | 127.0.0.1 | HEAD "/" time=2025-11-04T20:37:09.444-05:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32 [GIN] 2025/11/04 - 20:37:09 | 200 | 1.2758634s | 127.0.0.1 | POST "/api/show" time=2025-11-04T20:37:09.876-05:00 level=DEBUG source=runner.go:267 msg="refreshing free memory" time=2025-11-04T20:37:09.877-05:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=567.7┬╡s ``` I hope that helps.

GiteaMirror referenced this issue

2026-04-22 11:20:12 -05:00

[GH-ISSUE #8391] Add MoonDream 2 rev:2025-1-9 support #31147

GiteaMirror referenced this issue

2026-04-28 21:11:50 -05:00

[GH-ISSUE #8391] Add MoonDream 2 rev:2025-1-9 support #51898

GiteaMirror referenced this issue

2026-05-04 10:22:13 -05:00

[GH-ISSUE #8391] Add MoonDream 2 rev:2025-1-9 support #67443

Sign in to join this conversation.

Branches Tags

main

dhiltgen/ci

hoyyeva/editor-config-repair

parth-launch-codex-app

hoyyeva/fix-codex-model-metadata-warning

hoyyeva/qwen

hoyyeva/launch-backup-ux

parth/hide-claude-desktop-till-release

hoyyeva/opencode-image-modality

parth-mlx-decode-checkpoints

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

hoyyeva/opencode-thinking

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

brucemacd/download-before-remove

parth/update-claude-docs

parth-anthropic-reference-images-path

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#8391