mirror of
https://github.com/ollama/ollama.git
synced 2026-05-05 23:53:43 -05:00
Closed
opened 2026-04-12 21:02:15 -05:00 by GiteaMirror
·
24 comments
No Branch/Tag Specified
main
dhiltgen/ci
hoyyeva/editor-config-repair
parth-launch-codex-app
hoyyeva/fix-codex-model-metadata-warning
hoyyeva/qwen
hoyyeva/launch-backup-ux
parth/hide-claude-desktop-till-release
hoyyeva/opencode-image-modality
parth-mlx-decode-checkpoints
parth-add-claude-code-autoinstall
release_v0.22.0
pdevine/manifest-list
codex/fix-codex-model-metadata-warning
pdevine/addressable-manifest
brucemacd/launch-fetch-reccomended
jmorganca/llama-compat
launch-copilot-cli
hoyyeva/opencode-thinking
release_v0.20.7
parth-auto-save-backup
parth-test
jmorganca/gemma4-audio-replacements
fix-manifest-digest-on-pull
hoyyeva/vscode-improve
brucemacd/install-server-wait
brucemacd/download-before-remove
parth/update-claude-docs
parth-anthropic-reference-images-path
brucemac/start-ap-install
pdevine/mlx-update
pdevine/qwen35_vision
drifkin/api-show-fallback
mintlify/image-generation-1773352582
hoyyeva/server-context-length-local-config
jmorganca/faster-reptition-penalties
jmorganca/convert-nemotron
parth-pi-thinking
pdevine/sampling-penalties
jmorganca/fix-create-quantization-memory
dongchen/resumable_transfer_fix
pdevine/sampling-cache-error
jessegross/mlx-usage
hoyyeva/openclaw-config
hoyyeva/app-html
pdevine/qwen3next
brucemacd/sign-sh-install
brucemacd/tui-update
brucemacd/usage-api
jmorganca/launch-empty
fix-app-dist-embed
mxyng/mlx-compile
mxyng/mlx-quant
mxyng/mlx-glm4.7
mxyng/mlx
brucemacd/simplify-model-picker
jmorganca/qwen3-concurrent
fix-glm-4.7-flash-mla-config
drifkin/qwen3-coder-opening-tag
brucemacd/usage-cli
fix-cuda12-fattn-shmem
ollama-imagegen-docs
parth/fix-multiline-inputs
brucemacd/config-docs
mxyng/model-files
mxyng/simple-execute
fix-imagegen-ollama-models
mxyng/async-upload
jmorganca/lazy-no-dtype-changes
imagegen-auto-detect-create
parth/decrease-concurrent-download-hf
fix-mlx-quantize-init
jmorganca/x-cleanup
usage
imagegen-readme
jmorganca/glm-image
mlx-gpu-cd
jmorganca/imagegen-modelfile
parth/agent-skills
parth/agent-allowlist
parth/signed-in-offline
parth/agents
parth/fix-context-chopping
improve-cloud-flow
parth/add-models-websearch
parth/prompt-renderer-mcp
jmorganca/native-settings
jmorganca/download-stream-hash
jmorganca/client2-rebased
brucemacd/oai-chat-req-multipart
jessegross/multi_chunk_reserve
grace/additional-omit-empty
grace/mistral-3-large
mxyng/tokenizer2
mxyng/tokenizer
jessegross/flash
hoyyeva/windows-nacked-app
mxyng/cleanup-attention
grace/deepseek-parser
hoyyeva/remember-unsent-prompt
parth/add-lfs-pointer-error-conversion
parth/olmo2-test2
hoyyeva/ollama-launchagent-plist
nicole/olmo-model
parth/olmo-test
mxyng/remove-embedded
parth/render-template
jmorganca/intellect-3
parth/remove-prealloc-linter
jmorganca/cmd-eval
nicole/nomic-embed-text-fix
mxyng/lint-2
hoyyeva/add-gemini-3-pro-preview
hoyyeva/load-model-list
mxyng/expand-path
mxyng/environ-2
hoyyeva/deeplink-json-encoding
parth/improve-tool-calling-tests
hoyyeva/conversation
hoyyeva/assistant-edit-response
hoyyeva/thinking
origin/brucemacd/invalid-char-i-err
parth/improve-tool-calling
jmorganca/required-omitempty
grace/qwen3-vl-tests
mxyng/iter-client
parth/docs-readme
nicole/embed-test
pdevine/integration-benchstat
parth/remove-generate-cmd
parth/add-toolcall-id
mxyng/server-tests
jmorganca/glm-4.6
jmorganca/gin-h-compat
drifkin/stable-tool-args
pdevine/qwen3-more-thinking
parth/add-websearch-client
nicole/websearch_local
jmorganca/qwen3-coder-updates
grace/deepseek-v3-migration-tests
mxyng/fix-create
jmorganca/cloud-errors
pdevine/parser-tidy
revert-12233-parth/simplify-entrypoints-runner
parth/enable-so-gpt-oss
brucemacd/qwen3vl
jmorganca/readme-simplify
parth/gpt-oss-structured-outputs
revert-12039-jmorganca/tools-braces
mxyng/embeddings
mxyng/gguf
mxyng/benchmark
mxyng/types-null
parth/move-parsing
mxyng/gemma2
jmorganca/docs
mxyng/16-bit
mxyng/create-stdin
pdevine/authorizedkeys
mxyng/quant
parth/opt-in-error-context-window
brucemacd/cache-models
brucemacd/runner-completion
jmorganca/llama-update-6
brucemacd/benchmark-list
brucemacd/partial-read-caps
parth/deepseek-r1-tools
mxyng/omit-array
parth/tool-prefix-temp
brucemacd/runner-test
jmorganca/qwen25vl
brucemacd/model-forward-test-ext
parth/python-function-parsing
jmorganca/cuda-compression-none
drifkin/num-parallel
drifkin/chat-truncation-fix
jmorganca/sync
parth/python-tools-calling
drifkin/array-head-count
brucemacd/create-no-loop
parth/server-enable-content-stream-with-tools
qwen25omni
mxyng/v3
brucemacd/ropeconfig
jmorganca/silence-tokenizer
parth/sample-so-test
parth/sampling-structured-outputs
brucemacd/doc-go-engine
parth/constrained-sampling-json
jmorganca/mistral-wip
brucemacd/mistral-small-convert
parth/sample-unmarshal-json-for-params
brucemacd/jomorganca/mistral
pdevine/bfloat16
jmorganca/mistral
brucemacd/mistral
pdevine/logging
parth/sample-correctness-fix
parth/sample-fix-sorting
jmorgan/sample-fix-sorting-extras
jmorganca/temp-0-images
brucemacd/parallel-embed-models
brucemacd/shim-grammar
jmorganca/fix-gguf-error
bmizerany/nameswork
jmorganca/faster-releases
bmizerany/validatenames
brucemacd/err-no-vocab
brucemacd/rope-config
brucemacd/err-hint
brucemacd/qwen2_5
brucemacd/logprobs
brucemacd/new_runner_graph_bench
progress-flicker
brucemacd/forward-test
brucemacd/go_qwen2
pdevine/gemma2
jmorganca/add-missing-symlink-eval
mxyng/next-debug
parth/set-context-size-openai
brucemacd/next-bpe-bench
brucemacd/next-bpe-test
brucemacd/new_runner_e2e
brucemacd/new_runner_qwen2
pdevine/convert-cohere2
brucemacd/convert-cli
parth/log-probs
mxyng/next-mlx
mxyng/cmd-history
parth/templating
parth/tokenize-detokenize
brucemacd/check-key-register
bmizerany/grammar
jmorganca/vendor-081b29bd
mxyng/func-checks
jmorganca/fix-null-format
parth/fix-default-to-warn-json
jmorganca/qwen2vl
jmorganca/no-concat
parth/cmd-cleanup-SO
brucemacd/check-key-register-structured-err
parth/openai-stream-usage
parth/fix-referencing-so
stream-tools-stop
jmorganca/degin-1
brucemacd/install-path-clean
brucemacd/push-name-validation
brucemacd/browser-key-register
jmorganca/openai-fix-first-message
jmorganca/fix-proxy
jessegross/sample
parth/disallow-streaming-tools
dhiltgen/remove_submodule
jmorganca/ga
jmorganca/mllama
pdevine/newlines
pdevine/geems-2b
jmorganca/llama-bump
mxyng/modelname-7
mxyng/gin-slog
mxyng/modelname-6
jyan/convert-prog
jyan/quant5
paligemma-support
pdevine/import-docs
jmorganca/openai-context
jyan/paligemma
jyan/p2
jyan/palitest
bmizerany/embedspeedup
jmorganca/llama-vit
brucemacd/allow-ollama
royh/ep-methods
royh/whisper
mxyng/api-models
mxyng/fix-memory
jyan/q4_4/8
jyan/ollama-v
royh/stream-tools
roy-embed-parallel
bmizerany/hrm
revert-5963-revert-5924-mxyng/llama3.1-rope
royh/embed-viz
jyan/local2
jyan/auth
jyan/local
jyan/parse-temp
jmorganca/template-mistral
jyan/reord-g
royh-openai-suffixdocs
royh-imgembed
royh-embed-parallel
jyan/quant4
royh-precision
jyan/progress
pdevine/fix-template
jyan/quant3
pdevine/ggla
mxyng/update-registry-domain
jmorganca/ggml-static
mxyng/create-context
jyan/v0.146
mxyng/layers-from-files
build_dist
bmizerany/noseek
royh-ls
royh-name
timeout
mxyng/server-timestamp
bmizerany/nosillyggufslurps
royh-params
jmorganca/llama-cpp-7c26775
royh-openai-delete
royh-show-rigid
jmorganca/enable-fa
jmorganca/no-error-template
jyan/format
royh-testdelete
bmizerany/fastverify
language_support
pdevine/ps-glitches
brucemacd/tokenize
bruce/iq-quants
bmizerany/filepathwithcoloninhost
mxyng/split-bin
bmizerany/client-registry
jmorganca/if-none-match
native
jmorganca/native
jmorganca/batch-embeddings
jmorganca/initcmake
jmorganca/mm
pdevine/showggmlinfo
modenameenforcealphanum
bmizerany/modenameenforcealphanum
jmorganca/done-reason
jmorganca/llama-cpp-8960fe8
ollama.com
bmizerany/filepathnobuild
bmizerany/types/model/defaultfix
rmdisplaylong
nogogen
bmizerany/x
modelfile-readme
bmizerany/replacecolon
jmorganca/limit
jmorganca/execstack
jmorganca/replace-assets
mxyng/tune-concurrency
jmorganca/testing
whitespace-detection
jmorganca/options
upgrade-all
scratch
cuda-search
mattw/airenamer
mattw/allmodelsonhuggingface
mattw/quantcontext
mattw/whatneedstorun
brucemacd/llama-mem-calc
mattw/faq-context
mattw/communitylinks
mattw/noprune
mattw/python-functioncalling
rename
mxyng/install
pulse
remove-first
editor
mattw/selfqueryingretrieval
cgo
mattw/howtoquant
api
matt/streamingapi
format-config
mxyng/extra-args
shell
update-nous-hermes
cp-model
upload-progress
fix-unknown-model
fix-model-names
delete-fix
insecure-registry
ls
deletemodels
progressbar
readme-updates
license-layers
skip-list
list-models
modelpath
matt/examplemodelfiles
distribution
go-opts
v0.23.1
v0.23.1-rc0
v0.23.0
v0.23.0-rc0
v0.22.1
v0.22.1-rc1
v0.22.1-rc0
v0.22.0
v0.22.0-rc1
v0.21.3-rc0
v0.21.2-rc1
v0.21.2
v0.21.2-rc0
v0.21.1
v0.21.1-rc1
v0.21.1-rc0
v0.21.0
v0.21.0-rc1
v0.21.0-rc0
v0.20.8-rc0
v0.20.7
v0.20.7-rc1
v0.20.7-rc0
v0.20.6
v0.20.6-rc1
v0.20.6-rc0
v0.20.5
v0.20.5-rc2
v0.20.5-rc1
v0.20.5-rc0
v0.20.4
v0.20.4-rc2
v0.20.4-rc1
v0.20.4-rc0
v0.20.3
v0.20.3-rc0
v0.20.2
v0.20.1
v0.20.1-rc2
v0.20.1-rc1
v0.20.1-rc0
v0.20.0
v0.20.0-rc1
v0.20.0-rc0
v0.19.0
v0.19.0-rc2
v0.19.0-rc1
v0.19.0-rc0
v0.18.4-rc1
v0.18.4-rc0
v0.18.3
v0.18.3-rc2
v0.18.3-rc1
v0.18.3-rc0
v0.18.2
v0.18.2-rc1
v0.18.2-rc0
v0.18.1
v0.18.1-rc1
v0.18.1-rc0
v0.18.0
v0.18.0-rc2
v0.18.0-rc1
v0.18.0-rc0
v0.17.8-rc4
v0.17.8-rc3
v0.17.8-rc2
v0.17.8-rc1
v0.17.8-rc0
v0.17.7
v0.17.7-rc2
v0.17.7-rc1
v0.17.7-rc0
v0.17.6
v0.17.5
v0.17.4
v0.17.3
v0.17.2
v0.17.1
v0.17.1-rc2
v0.17.1-rc1
v0.17.1-rc0
v0.17.0
v0.17.0-rc2
v0.17.0-rc1
v0.17.0-rc0
v0.16.3
v0.16.3-rc2
v0.16.3-rc1
v0.16.3-rc0
v0.16.2
v0.16.2-rc0
v0.16.1
v0.16.0
v0.16.0-rc2
v0.16.0-rc0
v0.16.0-rc1
v0.15.6
v0.15.5
v0.15.5-rc5
v0.15.5-rc4
v0.15.5-rc3
v0.15.5-rc2
v0.15.5-rc1
v0.15.5-rc0
v0.15.4
v0.15.3
v0.15.2
v0.15.1
v0.15.1-rc1
v0.15.1-rc0
v0.15.0-rc6
v0.15.0
v0.15.0-rc5
v0.15.0-rc4
v0.15.0-rc3
v0.15.0-rc2
v0.15.0-rc1
v0.15.0-rc0
v0.14.3
v0.14.3-rc3
v0.14.3-rc2
v0.14.3-rc1
v0.14.3-rc0
v0.14.2
v0.14.2-rc1
v0.14.2-rc0
v0.14.1
v0.14.0-rc11
v0.14.0
v0.14.0-rc10
v0.14.0-rc9
v0.14.0-rc8
v0.14.0-rc7
v0.14.0-rc6
v0.14.0-rc5
v0.14.0-rc4
v0.14.0-rc3
v0.14.0-rc2
v0.14.0-rc1
v0.14.0-rc0
v0.13.5
v0.13.5-rc1
v0.13.5-rc0
v0.13.4-rc2
v0.13.4
v0.13.4-rc1
v0.13.4-rc0
v0.13.3
v0.13.3-rc1
v0.13.3-rc0
v0.13.2
v0.13.2-rc2
v0.13.2-rc1
v0.13.2-rc0
v0.13.1
v0.13.1-rc2
v0.13.1-rc1
v0.13.1-rc0
v0.13.0
v0.13.0-rc0
v0.12.11
v0.12.11-rc1
v0.12.11-rc0
v0.12.10
v0.12.10-rc1
v0.12.10-rc0
v0.12.9-rc0
v0.12.9
v0.12.8
v0.12.8-rc0
v0.12.7
v0.12.7-rc1
v0.12.7-rc0
v0.12.7-citest0
v0.12.6
v0.12.6-rc1
v0.12.6-rc0
v0.12.5
v0.12.5-rc0
v0.12.4
v0.12.4-rc7
v0.12.4-rc6
v0.12.4-rc5
v0.12.4-rc4
v0.12.4-rc3
v0.12.4-rc2
v0.12.4-rc1
v0.12.4-rc0
v0.12.3
v0.12.2
v0.12.2-rc0
v0.12.1
v0.12.1-rc1
v0.12.1-rc2
v0.12.1-rc0
v0.12.0
v0.12.0-rc1
v0.12.0-rc0
v0.11.11
v0.11.11-rc3
v0.11.11-rc2
v0.11.11-rc1
v0.11.11-rc0
v0.11.10
v0.11.9
v0.11.9-rc0
v0.11.8
v0.11.8-rc0
v0.11.7-rc1
v0.11.7-rc0
v0.11.7
v0.11.6
v0.11.6-rc0
v0.11.5-rc4
v0.11.5-rc3
v0.11.5
v0.11.5-rc5
v0.11.5-rc2
v0.11.5-rc1
v0.11.5-rc0
v0.11.4
v0.11.4-rc0
v0.11.3
v0.11.3-rc0
v0.11.2
v0.11.1
v0.11.0-rc0
v0.11.0-rc1
v0.11.0-rc2
v0.11.0
v0.10.2-int1
v0.10.1
v0.10.0
v0.10.0-rc4
v0.10.0-rc3
v0.10.0-rc2
v0.10.0-rc1
v0.10.0-rc0
v0.9.7-rc1
v0.9.7-rc0
v0.9.6
v0.9.6-rc0
v0.9.6-ci0
v0.9.5
v0.9.4-rc5
v0.9.4-rc6
v0.9.4
v0.9.4-rc3
v0.9.4-rc4
v0.9.4-rc1
v0.9.4-rc2
v0.9.4-rc0
v0.9.3
v0.9.3-rc5
v0.9.4-citest0
v0.9.3-rc4
v0.9.3-rc3
v0.9.3-rc2
v0.9.3-rc1
v0.9.3-rc0
v0.9.2
v0.9.1
v0.9.1-rc1
v0.9.1-rc0
v0.9.1-ci1
v0.9.1-ci0
v0.9.0
v0.9.0-rc0
v0.8.0
v0.8.0-rc0
v0.7.1-rc2
v0.7.1
v0.7.1-rc1
v0.7.1-rc0
v0.7.0
v0.7.0-rc1
v0.7.0-rc0
v0.6.9-rc0
v0.6.8
v0.6.8-rc0
v0.6.7
v0.6.7-rc2
v0.6.7-rc1
v0.6.7-rc0
v0.6.6
v0.6.6-rc2
v0.6.6-rc1
v0.6.6-rc0
v0.6.5-rc1
v0.6.5
v0.6.5-rc0
v0.6.4-rc0
v0.6.4
v0.6.3-rc1
v0.6.3
v0.6.3-rc0
v0.6.2
v0.6.2-rc0
v0.6.1
v0.6.1-rc0
v0.6.0-rc0
v0.6.0
v0.5.14-rc0
v0.5.13
v0.5.13-rc6
v0.5.13-rc5
v0.5.13-rc4
v0.5.13-rc3
v0.5.13-rc2
v0.5.13-rc1
v0.5.13-rc0
v0.5.12
v0.5.12-rc1
v0.5.12-rc0
v0.5.11
v0.5.10
v0.5.9
v0.5.9-rc0
v0.5.8-rc13
v0.5.8
v0.5.8-rc12
v0.5.8-rc11
v0.5.8-rc10
v0.5.8-rc9
v0.5.8-rc8
v0.5.8-rc7
v0.5.8-rc6
v0.5.8-rc5
v0.5.8-rc4
v0.5.8-rc3
v0.5.8-rc2
v0.5.8-rc1
v0.5.8-rc0
v0.5.7
v0.5.6
v0.5.5
v0.5.5-rc0
v0.5.4
v0.5.3
v0.5.3-rc0
v0.5.2
v0.5.2-rc3
v0.5.2-rc2
v0.5.2-rc1
v0.5.2-rc0
v0.5.1
v0.5.0
v0.5.0-rc1
v0.4.8-rc0
v0.4.7
v0.4.6
v0.4.5
v0.4.4
v0.4.3
v0.4.3-rc0
v0.4.2
v0.4.2-rc1
v0.4.2-rc0
v0.4.1
v0.4.1-rc0
v0.4.0
v0.4.0-rc8
v0.4.0-rc7
v0.4.0-rc6
v0.4.0-rc5
v0.4.0-rc4
v0.4.0-rc3
v0.4.0-rc2
v0.4.0-rc1
v0.4.0-rc0
v0.4.0-ci3
v0.3.14
v0.3.14-rc0
v0.3.13
v0.3.12
v0.3.12-rc5
v0.3.12-rc4
v0.3.12-rc3
v0.3.12-rc2
v0.3.12-rc1
v0.3.11
v0.3.11-rc4
v0.3.11-rc3
v0.3.11-rc2
v0.3.11-rc1
v0.3.10
v0.3.10-rc1
v0.3.9
v0.3.8
v0.3.7
v0.3.7-rc6
v0.3.7-rc5
v0.3.7-rc4
v0.3.7-rc3
v0.3.7-rc2
v0.3.7-rc1
v0.3.6
v0.3.5
v0.3.4
v0.3.3
v0.3.2
v0.3.1
v0.3.0
v0.2.8
v0.2.8-rc2
v0.2.8-rc1
v0.2.7
v0.2.6
v0.2.5
v0.2.4
v0.2.3
v0.2.2
v0.2.2-rc2
v0.2.2-rc1
v0.2.1
v0.2.0
v0.1.49-rc14
v0.1.49-rc13
v0.1.49-rc12
v0.1.49-rc11
v0.1.49-rc10
v0.1.49-rc9
v0.1.49-rc8
v0.1.49-rc7
v0.1.49-rc6
v0.1.49-rc4
v0.1.49-rc5
v0.1.49-rc3
v0.1.49-rc2
v0.1.49-rc1
v0.1.48
v0.1.47
v0.1.46
v0.1.45-rc5
v0.1.45
v0.1.45-rc4
v0.1.45-rc3
v0.1.45-rc2
v0.1.45-rc1
v0.1.44
v0.1.43
v0.1.42
v0.1.41
v0.1.40
v0.1.40-rc1
v0.1.39
v0.1.39-rc2
v0.1.39-rc1
v0.1.38
v0.1.37
v0.1.36
v0.1.35
v0.1.35-rc1
v0.1.34
v0.1.34-rc1
v0.1.33
v0.1.33-rc7
v0.1.33-rc6
v0.1.33-rc5
v0.1.33-rc4
v0.1.33-rc3
v0.1.33-rc2
v0.1.33-rc1
v0.1.32
v0.1.32-rc2
v0.1.32-rc1
v0.1.31
v0.1.30
v0.1.29
v0.1.28
v0.1.27
v0.1.26
v0.1.25
v0.1.24
v0.1.23
v0.1.22
v0.1.21
v0.1.20
v0.1.19
v0.1.18
v0.1.17
v0.1.16
v0.1.15
v0.1.14
v0.1.13
v0.1.12
v0.1.11
v0.1.10
v0.1.9
v0.1.8
v0.1.7
v0.1.6
v0.1.5
v0.1.4
v0.1.3
v0.1.2
v0.1.1
v0.1.0
v0.0.21
v0.0.20
v0.0.19
v0.0.18
v0.0.17
v0.0.16
v0.0.15
v0.0.14
v0.0.13
v0.0.12
v0.0.11
v0.0.10
v0.0.9
v0.0.8
v0.0.7
v0.0.6
v0.0.5
v0.0.4
v0.0.3
v0.0.2
v0.0.1
Labels
Clear labels
amd
api
app
bug
build
cli
cloud
compatibility
context-length
create
docker
documentation
embeddings
feature request
feedback wanted
good first issue
gpt-oss
gpu
harmony
help wanted
image
install
intel
js
launch
linux
macos
memory
mlx
model
needs more info
networking
nvidia
ollama.com
performance
pull-request
python
question
registry
rendering
thinking
tools
top
vulkan
windows
wsl
Mirrored from GitHub Pull Request
Milestone
No items
No Milestone
Projects
Clear projects
No project
No Assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: github-starred/ollama#8391
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @samialisayed on GitHub (Oct 15, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/12640
Originally assigned to: @dhiltgen on GitHub.
What is the issue?
Ollama is hanging when I send a message and not getting any response. this happens with all models. Any idea how to solve this. Ollama was working fine a month ago. I do not know what happened!!
Relevant log output
OS
Windows
GPU
Tesla T4
CPU
No response
Ollama version
v0.12.5
@jmorganca commented on GitHub (Oct 15, 2025):
@samialisayed may I ask which model you are running. Sorry about that!
@samialisayed commented on GitHub (Oct 15, 2025):
@jmorganca
I tried many models such as llama3, tinyllama, phi
@kappa8219 commented on GitHub (Oct 17, 2025):
see #12660 fixed in 0.12.6
@samialisayed commented on GitHub (Oct 17, 2025):
@kappa8219 it is not fixed. it is still hanging
@kappa8219 commented on GitHub (Oct 17, 2025):
Strange, I got same GPU, maybe node is different.
@samialisayed commented on GitHub (Oct 17, 2025):
@kappa8219 do you recommend any solution?
@kappa8219 commented on GitHub (Oct 17, 2025):
No, for OS Windows none. At Linux would try to get system logs.
@katmandoo212 commented on GitHub (Oct 17, 2025):
Same issue. Tried several models including gpt-oss:20b downloaded to my PC today.
ollama --version
ollama version is 0.12.6
ollama run qwen3:1.7b "What is the equation to calculate the area of a sqaure?"
ollama serve
time=2025-10-17T17:39:29.977-04:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\OllamaFiles\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-17T17:39:30.041-04:00 level=INFO source=images.go:522 msg="total blobs: 330"
time=2025-10-17T17:39:30.075-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-17T17:39:30.087-04:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-17T17:39:30.090-04:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-17T17:39:30.878-04:00 level=INFO source=types.go:129 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="8.1 GiB"
time=2025-10-17T17:39:30.878-04:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/17 - 17:39:45 | 200 | 603.2µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/17 - 17:39:45 | 200 | 221.7137ms | 127.0.0.1 | POST "/api/show"
@mkrostm commented on GitHub (Oct 17, 2025):
Same issue. with all models, deepseek-r1:1.5b, llama3.2, qwen2.5
ollama version is 0.12.6
windows 10
@Panican-Whyasker commented on GitHub (Oct 19, 2025):
Same issue here. Ollama 0.12.6 on Windows 11, GPU nVidia GeForce 840M (CUDA compute capability 5.0, the minimum required by Ollama) with 2 GB of VRAM; CPU Intel Core i7-5600, 6 GB system RAM. Therefore, have tried only rather small LLMs like gemma3:270m and qwen3:1.7b. Trying to start the LLMs in either PowerShell or inside Ollama's GUI - same result: The model start takes forever. At the same time, one ollama.exe process keeps running at 25% CPU (which is one of four logical cores running at 100%). Looked at the process' threads with SysInternals Process Explorer, see attached screenshot.
Here is my server log:
time=2025-10-19T12:39:08.441+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-19T12:39:08.447+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-19T12:39:08.450+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-19T12:39:08.453+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-19T12:39:10.284+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB"
time=2025-10-19T12:39:10.285+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB"
[GIN] 2025/10/19 - 12:39:10 | 200 | 522.6µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/19 - 12:39:10 | 200 | 3.7394ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:43:09 | 200 | 0s | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/19 - 12:43:09 | 200 | 117.6405ms | 127.0.0.1 | POST "/api/show"
And that's it. Ollama.exe (one of two copies in memory) keeps running at 25% CPU (100% of one logical core).
When trying to run the same model from within Ollama's GUI, the server log gets a few extra lines:
time=2025-10-19T12:54:34.628+02:00 level=INFO source=routes.go:1511 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-19T12:54:34.643+02:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-19T12:54:34.657+02:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-19T12:54:34.673+02:00 level=INFO source=routes.go:1564 msg="Listening on 127.0.0.1:11434 (version 0.12.6)"
time=2025-10-19T12:54:34.675+02:00 level=INFO source=runner.go:80 msg="discovering available GPUs..."
time=2025-10-19T12:54:37.090+02:00 level=INFO source=types.go:112 msg="inference compute" id=GPU-838173db-880d-6d37-e6ad-d4b277cdde5a library=CUDA compute=5.0 name=CUDA0 description="NVIDIA GeForce 840M" libdirs=ollama,cuda_v12 driver=13.0 pci_id=03:00.0 type=discrete total="2.0 GiB" available="2.0 GiB"
time=2025-10-19T12:54:37.091+02:00 level=INFO source=routes.go:1605 msg="entering low vram mode" "total vram"="2.0 GiB" threshold="20.0 GiB"
[GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:37 | 200 | 525.5µs | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:37 | 200 | 5.7471ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:54:37 | 200 | 160.8987ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/19 - 12:54:44 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/19 - 12:54:44 | 200 | 6.6484ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/19 - 12:54:56 | 200 | 124.4086ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/19 - 12:54:56 | 200 | 100.1837ms | 127.0.0.1 | POST "/api/show"
And here's my app.log:
time=2025-10-19T12:54:32.717+02:00 level=INFO source=app_windows.go:272 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.6 OS=Windows/10.0.26100
time=2025-10-19T12:54:32.721+02:00 level=INFO source=app.go:232 msg="initialized tools registry" tool_count=0
time=2025-10-19T12:54:32.735+02:00 level=INFO source=app.go:247 msg="starting ollama server"
time=2025-10-19T12:54:33.173+02:00 level=INFO source=app.go:279 msg="starting ui server" port=56109
time=2025-10-19T12:54:35.437+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=997.1µs request_id=1760871275436712400 version=0.12.6
time=2025-10-19T12:54:35.471+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=1.0016ms request_id=1760871275470420900 version=0.12.6
time=2025-10-19T12:54:35.490+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=121.4562ms request_id=1760871275368774800 version=0.12.6
time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:1617 msg="failed to get inference compute" error="timeout scanning server log for inference compute details"
time=2025-10-19T12:54:36.039+02:00 level=ERROR source=ui.go:171 msg=site.serveHTTP error="failed to get inference compute: timeout scanning server log for inference compute details" http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=500 http.d=592.3808ms request_id=1760871275447595400 version=0.12.6
time=2025-10-19T12:54:36.174+02:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s
time=2025-10-19T12:54:37.091+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=1.6225602s request_id=1760871275469340400 version=0.12.6
time=2025-10-19T12:54:37.098+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=1.6539439s request_id=1760871275444706100 version=0.12.6
time=2025-10-19T12:54:37.187+02:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:CUDA Variant: Compute:5.0 Driver:13.0 Name:CUDA0 VRAM:2.0 GiB}"
time=2025-10-19T12:54:37.187+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=128.9369ms request_id=1760871277058569100 version=0.12.6
time=2025-10-19T12:54:37.253+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gemma3:270m/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=1.7391883s request_id=1760871275514116200 version=0.12.6
time=2025-10-19T12:54:37.293+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=161.3473ms request_id=1760871277131674100 version=0.12.6
time=2025-10-19T12:54:40.146+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=15.5374ms request_id=1760871280131187300 version=0.12.6
time=2025-10-19T12:54:42.095+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=507.5µs request_id=1760871282095350400 version=0.12.6
time=2025-10-19T12:54:44.635+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=950.6µs request_id=1760871284634242300 version=0.12.6
time=2025-10-19T12:54:44.639+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=1.9311ms request_id=1760871284637210500 version=0.12.6
time=2025-10-19T12:54:44.646+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=10.3327ms request_id=1760871284636351800 version=0.12.6
time=2025-10-19T12:57:50.812+02:00 level=ERROR source=ui.go:1178 msg="chat stream error" error="Post "http://127.0.0.1:11434/api/chat": context canceled"
time=2025-10-19T12:57:50.814+02:00 level=INFO source=ui.go:171 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/0199fbf4-fbea-7735-85af-904746282010 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m54.1014868s request_id=1760871296712971700 version=0.12.6
time=2025-10-19T12:57:50.878+02:00 level=INFO source=app.go:323 msg="shutting down desktop server"
time=2025-10-19T12:57:50.881+02:00 level=INFO source=app.go:328 msg="shutting down ollama server"
time=2025-10-19T12:57:55.888+02:00 level=WARN source=server.go:132 msg="timeout waiting for graceful shutdown; killing" pid=3832
@Panican-Whyasker commented on GitHub (Oct 19, 2025):
Regarding my earlier comment: That was with a new Ollama installation, on a freshly installed (and updated) Windows 11.
My main laptop with Ollama 0.12.6 (also Windows 11; Intel Core i9-9880H; nVidia Quadro RTX3000 w. 6 GB of VRAM; 128 GB of system RAM) runs normally the above LLMs. Here, Ollama was updated to v.0.12.6 shortly after that update became available.
@Panican-Whyasker commented on GitHub (Oct 19, 2025):
Update: The issue is not present in Ollama v.0.12.3. However, updating to 0.12.6 breaks it.
@samialisayed you may want to download and install version 0.12.3 (it will replace v.0.12.6, just don't apply the update to 0.12.6, Ollama downloads and offers it very soon).
(https://github.com/ollama/ollama/releases)
(https://github.com/ollama/ollama/releases/download/v0.12.3/OllamaSetup.exe)
@Panican-Whyasker commented on GitHub (Oct 20, 2025):
@samialisayed: as @rick-github explained in (https://github.com/ollama/ollama/issues/12699), ollama 0.12.3 is not affected; the issue likely starts with 0.12.4.
@Panican-Whyasker commented on GitHub (Oct 20, 2025):
Interestingly, no problem with ollama 0.12.6 on Windows Server 2016 Datacenter (GPU-less):
@rick-github commented on GitHub (Oct 20, 2025):
Thanks for the data point. Based on the initial reports I thought it was due to the lack of GPU, and not widely reported because most users have a GPU. But it seems like there is something else at play.
@samialisayed commented on GitHub (Oct 22, 2025):
@Panican-Whyasker I have installed the lower version and it works. thank you
@dhiltgen commented on GitHub (Oct 29, 2025):
Please give 0.12.7 a try and let us know if the issues are resolved. If not, please share an updated log with OLLAMA_DEBUG=2 set so we can take a look.
@katmandoo212 commented on GitHub (Oct 30, 2025):
I gave 0.12.7 a try. with OLLAMA_DEBUG=2. I could not get local models to load, but cloud models do work. gpt-oss:20b-cloud for example. This is my server.log followed by my app.log. I hope this helps.
time=2025-10-29T21:44:11.691-04:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:16384 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:G:\OllamaFiles\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-29T21:44:11.745-04:00 level=INFO source=images.go:522 msg="total blobs: 349"
time=2025-10-29T21:44:11.776-04:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-29T21:44:11.786-04:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)"
time=2025-10-29T21:44:11.786-04:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler"
time=2025-10-29T21:44:11.788-04:00 level=INFO source=runner.go:76 msg="discovering available GPUs..."
time=2025-10-29T21:44:11.788-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extraEnvs=map[]
time=2025-10-29T21:44:11.801-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60946"
time=2025-10-29T21:44:11.802-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-29T21:44:11.867-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:11.868-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60946"
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:11.879-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:11.880-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:11.881-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:11.881-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:11.913-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
dl_load_library unable to load library C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12\ggml-cuda.dll: The specified module could not be found.
time=2025-10-29T21:44:12.094-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=216.4957ms
time=2025-10-29T21:44:12.094-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" devices=[]
time=2025-10-29T21:44:12.096-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=308.1172ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[]
time=2025-10-29T21:44:12.096-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extraEnvs=map[]
time=2025-10-29T21:44:12.099-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60955"
time=2025-10-29T21:44:12.099-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-29T21:44:12.161-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:12.162-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60955"
time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:12.166-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:12.167-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:12.168-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:12.169-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:12.169-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:12.197-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
ggml_cuda_init: failed to initialize CUDA: (null)
load_backend: loaded CUDA backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13\ggml-cuda.dll
time=2025-10-29T21:44:12.283-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.284-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=119.2606ms
time=2025-10-29T21:44:12.285-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.285-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" devices=[]
time=2025-10-29T21:44:12.286-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=189.4849ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[]
time=2025-10-29T21:44:12.286-04:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extraEnvs=map[]
time=2025-10-29T21:44:12.288-04:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\User\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 60962"
time=2025-10-29T21:44:12.288-04:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_MODELS=G:\OllamaFiles\models OLLAMA_DEBUG=2 OLLAMA_CONTEXT_LENGTH=16384 OLLAMA_AUTO_UPDATE=false OLLAMA_RAGTEMP=C:\OllamaRAGTemp PATH="C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm;C:\Program Files\PowerShell\7;C:\Program Files (x86)\oh-my-posh\bin\;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Program Files\Common Files\Oracle\Java\javapath;C:\ActiveTcl\bin;C:\Program Files\Microsoft MPI\Bin\;C:\Users\User\AppData\Local\Programs\Python\Python314\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python314;C:\Users\User\AppData\Local\Programs\Python\Python314\tct\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python314\tcl;C:\Users\User\AppData\Local\Programs\Python\Python313\Scripts;C:\Users\User\AppData\Local\Programs\Python\Python313;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl\tcl8.6;C:\Users\User\AppData\Local\Programs\Python\Python313\tcl;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Roaming\ActiveState\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Graphviz\bin;C:\Windows\System32\OpenSSH\;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\;C:\Program Files\Microsoft SQL Server\Client SDK\ODBC\170\Tools\Binn\;C:\Program Files\dotnet\;C:\Program Files\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files\Microsoft SQL Server\150\DTS\Binn\;C:\Program Files (x86)\Microsoft SQL Server\150\Tools\Binn\;C:\Program Files\Azure Data Studio\bin;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\ProgramData\chocolatey\bin;C:\Program Files\Java\jdk-21\bin;C:\Program Files\NASM;C:\Program Files\Microsoft VS Code\bin;C:\Program Files\gs\gs10.03.0\bin;C:\Program Files (x86)\Microsoft SQL Server\160\DTS\Binn\;C:\Program Files\PuTTY\;C:\Program Files\RedHat\Podman\;C:\TDM-GCC-64\bin;D:\home\blt\github\vcpkg;C:\Program Files\CMake\bin;C:\Program Files\nodejs\;C:\Program Files\Go\bin;C:\Program Files\Pandoc\;C:\Program Files\Docker\Docker\resources\bin;C:\Program Files\PowerShell\7\;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Users\User\AppData\Local\Programs\oh-my-posh\bin\;C:\Users\User\.local\bin;C:\Users\User\AppData\Local\Programs\Ollama;C:\Users\User\Downloads\vcmake\vcpkg\installed\x64_windows;C:\Users\User\.cargo\bin;C:\Program Files\OpenSSL\bin;C:\Users\User\AppData\Local\Microsoft\WindowsApps;C:\Program Files\Azure Data Studio\bin;C:\Program Files\PostgreSQL\15\bin;C:\Users\User\AppData\Local\GitHubDesktop\bin;C:\Users\User\Downloads\ffmpeg-master-latest-win64-gpl\ffmpeg-master-latest-win64-gpl\bin;C:\Program Files\Graphviz\bin;c:\Program Files\zig;c:\users\user\.local\bin;C:\Program Files (x86)\Intel\oneAPI;C:\Users\User\go\bin;C:\Users\User\.lmstudio\bin;C:\Users\User\.dotnet\tools;C:\Users\User\AppData\Local\Programs\Windsurf\bin;C:\Users\User\AppData\Local\reflex\bun\bin;C:\Users\User\AppData\Local\Programs\MiKTeX\miktex\bin\x64\;C:\Users\User\AppData\Roaming\npm;C:\Users\User\go\bin;C:\Users\User\AppData\Local\PowerToys\" OLLAMA_LIBRARY_PATH=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-29T21:44:12.364-04:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-29T21:44:12.366-04:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:60962"
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-29T21:44:12.375-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-29T21:44:12.377-04:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-29T21:44:12.377-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-29T21:44:12.406-04:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm
ggml_cuda_init: failed to initialize ROCm: no ROCm-capable device is detected
load_backend: loaded ROCm backend from C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm\ggml-hip.dll
time=2025-10-29T21:44:12.446-04:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 compiler=cgo(clang)
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.pooling_type default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.expert_count default=0
time=2025-10-29T21:44:12.446-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.tokens default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.scores default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.token_type default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.merges default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_bos_token default=true
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.bos_token_id default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.add_eos_token default=false
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_id default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.eos_token_ids default="&{size:0 values:[]}"
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=tokenizer.ggml.pre default=""
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.block_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.embedding_length default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.head_count_kv default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.key_length default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.dimension_count default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.attention.layer_norm_rms_epsilon default=0
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.freq_base default=100000
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=llama.rope.scaling.factor default=1
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1312 msg="dummy model load took" duration=72.7312ms
time=2025-10-29T21:44:12.447-04:00 level=DEBUG source=runner.go:1317 msg="gathering device infos took" duration=0s
time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" devices=[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=162.4718ms OLLAMA_LIBRARY_PATH="[C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama C:\Users\User\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0
time=2025-10-29T21:44:12.448-04:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[]
time=2025-10-29T21:44:12.448-04:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=661.722ms
time=2025-10-29T21:44:12.449-04:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="11.9 GiB" available="6.0 GiB"
time=2025-10-29T21:44:12.449-04:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/29 - 21:44:12 | 200 | 541µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/29 - 21:44:12 | 200 | 57.0626ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:44:24 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:44:24 | 200 | 185.0917ms | 127.0.0.1 | GET "/api/tags"
time=2025-10-29T21:44:25.105-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:25 | 200 | 254.2493ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:44:40 | 200 | 0s | 127.0.0.1 | GET "/api/version"
time=2025-10-29T21:44:40.843-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:40 | 200 | 250.6383ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:44:40 | 200 | 106.9248ms | 127.0.0.1 | GET "/api/tags"
time=2025-10-29T21:44:41.009-04:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/29 - 21:44:41 | 200 | 166.299ms | 127.0.0.1 | POST "/api/show"
time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:267 msg="refreshing free memory"
time=2025-10-29T21:44:41.196-04:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s
[GIN] 2025/10/29 - 21:45:10 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:45:11 | 200 | 78.7139ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:45:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:45:41 | 200 | 52.8417ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:46:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:46:11 | 200 | 74.5929ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:46:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:46:41 | 200 | 199.5415ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:47:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:47:11 | 200 | 70.797ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:47:14 | 200 | 31.5551ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:23 | 200 | 33.352ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:23 | 200 | 35.7424ms | 127.0.0.1 | POST "/api/show"
[GIN] 2025/10/29 - 21:47:25 | 200 | 1.3314975s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:27 | 200 | 814.5542ms | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:28 | 200 | 1.1260067s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:30 | 200 | 1.6694057s | 127.0.0.1 | POST "/api/chat"
[GIN] 2025/10/29 - 21:47:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:47:41 | 200 | 55.0317ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:48:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:48:11 | 200 | 52.7654ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:48:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:48:41 | 200 | 49.1216ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:49:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:49:11 | 200 | 60.5865ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:49:41 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:49:41 | 200 | 54.8271ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:50:11 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:50:11 | 200 | 58.8494ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:50:42 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:50:42 | 200 | 53.4373ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:51:12 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:51:12 | 200 | 54.2343ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/29 - 21:51:42 | 200 | 0s | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/29 - 21:51:42 | 200 | 59.3495ms | 127.0.0.1 | GET "/api/tags"
time=2025-10-29T21:44:10.544-04:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\User\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.19045
time=2025-10-29T21:44:10.546-04:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0
time=2025-10-29T21:44:10.587-04:00 level=INFO source=app.go:246 msg="starting ollama server"
time=2025-10-29T21:44:10.931-04:00 level=INFO source=app.go:275 msg="starting ui server" port=60942
time=2025-10-29T21:44:10.953-04:00 level=INFO source=app.go:336 msg="deferring pending update for fast startup"
time=2025-10-29T21:44:13.931-04:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s
time=2025-10-29T21:44:24.714-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788664714383500 version=0.12.7
time=2025-10-29T21:44:24.719-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/me http.pattern="GET /api/v1/me" http.status=200 http.d=84.5648ms request_id=1761788664634994100 version=0.12.7
time=2025-10-29T21:44:24.733-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/health http.pattern="GET /api/v1/health" http.status=200 http.d=11.0824ms request_id=1761788664722425600 version=0.12.7
time=2025-10-29T21:44:24.749-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=24.2975ms request_id=1761788664725422700 version=0.12.7
time=2025-10-29T21:44:24.762-04:00 level=INFO source=server.go:343 msg=Matched "inference compute"="{Library:cpu Variant: Compute: Driver: Name:cpu VRAM:11.9 GiB}"
time=2025-10-29T21:44:24.762-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/inference-compute http.pattern="GET /api/v1/inference-compute" http.status=200 http.d=43.6747ms request_id=1761788664718542400 version=0.12.7
time=2025-10-29T21:44:24.915-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=199.1878ms request_id=1761788664716368200 version=0.12.7
time=2025-10-29T21:44:24.984-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=25.1705ms request_id=1761788664959127700 version=0.12.7
time=2025-10-29T21:44:25.117-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/qwen3:4b/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=264.7715ms request_id=1761788664853051300 version=0.12.7
time=2025-10-29T21:44:25.399-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=162.7553ms request_id=1761788665236699300 version=0.12.7
time=2025-10-29T21:44:40.658-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chats http.pattern="GET /api/v1/chats" http.status=200 http.d=23.821ms request_id=1761788680635163500 version=0.12.7
time=2025-10-29T21:44:40.767-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=1.3556ms request_id=1761788680766282800 version=0.12.7
time=2025-10-29T21:44:40.878-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=111.7008ms request_id=1761788680767101300 version=0.12.7
time=2025-10-29T21:45:11.007-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=86.2728ms request_id=1761788710921244800 version=0.12.7
time=2025-10-29T21:45:41.072-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.9496ms request_id=1761788741013530600 version=0.12.7
time=2025-10-29T21:46:11.160-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=81.4457ms request_id=1761788771078929500 version=0.12.7
time=2025-10-29T21:46:41.391-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=212.4071ms request_id=1761788801178638100 version=0.12.7
time=2025-10-29T21:47:08.047-04:00 level=ERROR source=ui.go:1179 msg="chat stream error" error="Post "http://127.0.0.1:11434/api/chat": context canceled"
time=2025-10-29T21:47:08.047-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/new http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=2m27.4542528s request_id=1761788680592933600 version=0.12.7
time=2025-10-29T21:47:11.535-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=79.1006ms request_id=1761788831456019700 version=0.12.7
time=2025-10-29T21:47:14.028-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/settings http.pattern="POST /api/v1/settings" http.status=200 http.d=594µs request_id=1761788834028069900 version=0.12.7
time=2025-10-29T21:47:14.035-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/settings http.pattern="GET /api/v1/settings" http.status=200 http.d=0s request_id=1761788834035388500 version=0.12.7
time=2025-10-29T21:47:14.084-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/model/gpt-oss:20b-cloud/capabilities http.pattern="GET /api/v1/model/{model}/capabilities" http.status=200 http.d=31.9895ms request_id=1761788834052226600 version=0.12.7
time=2025-10-29T21:47:14.396-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/model/upstream http.pattern="POST /api/v1/model/upstream" http.status=200 http.d=351.1287ms request_id=1761788834045034700 version=0.12.7
time=2025-10-29T21:47:30.128-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=POST http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="POST /api/v1/chat/{id}" http.status=200 http.d=6.3121519s request_id=1761788843815980500 version=0.12.7
time=2025-10-29T21:47:30.138-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/chat/019a32c9-d991-770c-882b-2ab0187daa95 http.pattern="GET /api/v1/chat/{id}" http.status=200 http.d=9.1821ms request_id=1761788850129584200 version=0.12.7
time=2025-10-29T21:47:41.608-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=61.4188ms request_id=1761788861546597500 version=0.12.7
time=2025-10-29T21:48:11.680-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=58.3517ms request_id=1761788891622309800 version=0.12.7
time=2025-10-29T21:48:41.752-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=51.5636ms request_id=1761788921700600300 version=0.12.7
time=2025-10-29T21:49:11.828-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=66.4161ms request_id=1761788951762197100 version=0.12.7
time=2025-10-29T21:49:41.898-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.9438ms request_id=1761788981841884900 version=0.12.7
time=2025-10-29T21:50:11.983-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.2983ms request_id=1761789011918117000 version=0.12.7
time=2025-10-29T21:50:42.065-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=57.7472ms request_id=1761789042007958300 version=0.12.7
time=2025-10-29T21:51:12.137-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=56.72ms request_id=1761789072080460500 version=0.12.7
time=2025-10-29T21:51:42.216-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=63.3792ms request_id=1761789102152736300 version=0.12.7
time=2025-10-29T21:52:12.303-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=71.2135ms request_id=1761789132232153000 version=0.12.7
time=2025-10-29T21:52:42.392-04:00 level=INFO source=ui.go:168 msg=site.serveHTTP http.method=GET http.path=/api/v1/models http.pattern="GET /api/v1/models" http.status=200 http.d=65.8005ms request_id=1761789162327049800 version=0.12.7
@Panican-Whyasker commented on GitHub (Oct 31, 2025):
Waiting forever to load a 0.6B model in Ollama 0.12.7 on a Win11Pro laptop w. 5th-Gen Core i7 CPU, 8 GB of RAM, with nVidia GeForce M840 GPU with 2 GB of own VRAM:
OLLAMA_DEBUG=2 was added to Environment Variables:
At present time 08:23AM, here is the
app.log:
time=2025-10-31T08:08:00.575+01:00 level=INFO source=app_windows.go:270 msg="starting Ollama" app=C:\Users\Joro\AppData\Local\Programs\Ollama version=0.12.7 OS=Windows/10.0.26100
time=2025-10-31T08:08:00.583+01:00 level=INFO source=app.go:231 msg="initialized tools registry" tool_count=0
time=2025-10-31T08:08:00.616+01:00 level=INFO source=app.go:246 msg="starting ollama server"
time=2025-10-31T08:08:00.965+01:00 level=INFO source=app.go:275 msg="starting ui server" port=53147
time=2025-10-31T08:08:03.966+01:00 level=INFO source=updater.go:252 msg="beginning update checker" interval=1h0m0s
...as well as the
server.log:
time=2025-10-31T08:08:02.049+01:00 level=INFO source=routes.go:1524 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG-4 OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:C:\Users\Joro\.ollama\models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-10-31T08:08:02.110+01:00 level=INFO source=images.go:522 msg="total blobs: 12"
time=2025-10-31T08:08:02.113+01:00 level=INFO source=images.go:529 msg="total unused blobs removed: 0"
time=2025-10-31T08:08:02.119+01:00 level=INFO source=routes.go:1577 msg="Listening on 127.0.0.1:11434 (version 0.12.7)"
time=2025-10-31T08:08:02.120+01:00 level=DEBUG source=sched.go:120 msg="starting llm scheduler"
time=2025-10-31T08:08:02.125+01:00 level=INFO source=runner.go:76 msg="discovering available GPUs..."
time=2025-10-31T08:08:02.126+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extraEnvs=map[]
time=2025-10-31T08:08:02.148+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 53150"
time=2025-10-31T08:08:02.149+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-31T08:08:02.226+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:08:02.231+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:53150"
time=2025-10-31T08:08:02.244+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:08:02.245+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:08:02.246+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:08:02.259+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:08:02.260+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:08:02.260+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:08:02.955+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12
time=2025-10-31T08:08:32.127+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" devices=[]
time=2025-10-31T08:08:32.127+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=30.0015049s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v12]" extra_envs=map[]
time=2025-10-31T08:08:32.127+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extraEnvs=map[]
time=2025-10-31T08:08:32.133+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 62942"
time=2025-10-31T08:08:32.133+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-31T08:08:32.194+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:08:32.196+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62942"
time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:08:32.201+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:08:32.202+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:08:32.203+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:08:32.203+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:08:32.232+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13
time=2025-10-31T08:09:04.206+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" devices=[]
time=2025-10-31T08:09:04.206+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=32.0789732s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\cuda_v13]" extra_envs=map[]
time=2025-10-31T08:09:04.206+01:00 level=TRACE source=runner.go:471 msg="starting runner for device discovery" libDirs="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extraEnvs=map[]
time=2025-10-31T08:09:04.243+01:00 level=INFO source=server.go:385 msg="starting runner" cmd="C:\Users\Joro\AppData\Local\Programs\Ollama\ollama.exe runner --ollama-engine --port 62949"
time=2025-10-31T08:09:04.243+01:00 level=DEBUG source=server.go:386 msg=subprocess OLLAMA_CONTEXT_LENGTH=4096 PATH="C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm;C:\WINDOWS\system32;C:\WINDOWS;C:\WINDOWS\System32\Wbem;C:\WINDOWS\System32\WindowsPowerShell\v1.0\;C:\WINDOWS\System32\OpenSSH\;C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common;C:\Program Files\PowerShell\7\;C:\Users\Joro\AppData\Local\Microsoft\WindowsApps;;C:\Users\Joro\AppData\Local\Programs\Ollama" OLLAMA_DEBUG=2 OLLAMA_MODELS=C:\Users\Joro.ollama\models OLLAMA_LIBRARY_PATH=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama;C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-31T08:09:21.729+01:00 level=INFO source=runner.go:1337 msg="starting ollama engine"
time=2025-10-31T08:09:21.732+01:00 level=INFO source=runner.go:1372 msg="Server listening on 127.0.0.1:62949"
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=general.architecture type=string
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=gguf.go:590 msg=tokenizer.ggml.model type=string
time=2025-10-31T08:09:21.817+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.file_type default=0
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.name default=""
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.description default=""
time=2025-10-31T08:09:21.819+01:00 level=INFO source=ggml.go:135 msg="" architecture=llama file_type=unknown name="" description="" num_tensors=0 num_key_values=3
time=2025-10-31T08:09:21.819+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama
load_backend: loaded CPU backend from C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\ggml-cpu-haswell.dll
time=2025-10-31T08:09:21.852+01:00 level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm
time=2025-10-31T08:10:04.207+01:00 level=INFO source=runner.go:495 msg="failure during GPU discovery" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[] error="failed to finish discovery before timeout"
time=2025-10-31T08:10:04.207+01:00 level=TRACE source=runner.go:498 msg="runner enumerated devices" OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" devices=[]
time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:468 msg="bootstrap discovery took" duration=1m0.0009716s OLLAMA_LIBRARY_PATH="[C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama C:\Users\Joro\AppData\Local\Programs\Ollama\lib\ollama\rocm]" extra_envs=map[]
time=2025-10-31T08:10:04.207+01:00 level=DEBUG source=runner.go:120 msg="evluating which if any devices to filter out" initial_count=0
time=2025-10-31T08:10:04.208+01:00 level=TRACE source=runner.go:179 msg="supported GPU library combinations before filtering" supported=map[]
time=2025-10-31T08:10:04.208+01:00 level=DEBUG source=runner.go:41 msg="GPU bootstrap discovery took" duration=2m2.0872522s
time=2025-10-31T08:10:04.209+01:00 level=INFO source=types.go:60 msg="inference compute" id=cpu library=cpu compute="" name=cpu description=cpu libdirs=ollama driver="" pci_id="" type="" total="7.9 GiB" available="802.4 MiB"
time=2025-10-31T08:10:04.209+01:00 level=INFO source=routes.go:1618 msg="entering low vram mode" "total vram"="0 B" threshold="20.0 GiB"
[GIN] 2025/10/31 - 08:12:58 | 200 | 8.1601ms | 127.0.0.1 | GET "/api/version"
[GIN] 2025/10/31 - 08:13:03 | 200 | 0s | 127.0.0.1 | HEAD "/"
[GIN] 2025/10/31 - 08:13:03 | 200 | 38.5777ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/10/31 - 08:13:24 | 200 | 0s | 127.0.0.1 | HEAD "/"
time=2025-10-31T08:13:24.904+01:00 level=DEBUG source=ggml.go:276 msg="key with type not found" key=general.alignment default=32
[GIN] 2025/10/31 - 08:13:24 | 200 | 173.1176ms | 127.0.0.1 | POST "/api/show"
time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:267 msg="refreshing free memory"
time=2025-10-31T08:13:25.046+01:00 level=DEBUG source=runner.go:41 msg="overall device VRAM discovery took" duration=0s
@dhiltgen commented on GitHub (Oct 31, 2025):
I made a small fix to some logging logic for Windows on 0.12.8 which likely wont fix this, but may help us get more details on the failure. Anyone still seeing the hang, please install 0.12.8 and set
$env:OLLAMA_DEBUG="2"and share the server log from startup through a request that hangs so we can try to narrow down what's going wrong.@katmandoo212 commented on GitHub (Nov 1, 2025):
I ran 0.12.8 with OLLAMA_DEBUG=2 as suggested. Trying to run a local model hangs, but Cloud models work. I have attached my server.log and app.log.
app.log
server.log
@katmandoo212 commented on GitHub (Nov 1, 2025):
Just to let everyone know, I tried 0.12.9 on Windows 10, no GPU and it still hangs (spinner spins) when loading local models, but cloud models do work.
@dhiltgen commented on GitHub (Nov 4, 2025):
It sounds like there's a deadlock someplace, but I'm not sure where the system is getting hung up. Lets try to isolate things a little more. @katmandoo212 can you quit the GUI app by exiting the tray application, then lets run the server and CLI in a terminal.
then in another terminal
Also, when it gets into this stuck state, do you see the
ollama servechewing up a CPU core in Task Manager, or is the system completely idle?Comparing logs with other systems, it seems like it may be gathering information about the CPUs in the system. Is there anything unusual about your CPU setup?
@katmandoo212 commented on GitHub (Nov 5, 2025):
Here is my server log from the latest run using your instructions.
I hope that helps.