mirror of
https://github.com/ollama/ollama.git
synced 2026-05-22 21:51:37 -05:00
Closed
opened 2026-05-04 12:48:59 -05:00 by GiteaMirror
·
7 comments
No Branch/Tag Specified
main
dhiltgen/llama-runner
parth-migrate-pi
codex/make-integration-hidden-and-lunchable
hoyyeva/migrate-pi
hoyyeva/opencode-thinking
hoyyeva/anthropic-local-image-path
parth-launch-codex-app
hoyyeva/anthropic-reference-images-path
parth-anthropic-reference-images-path
brucemacd/download-before-remove
hoyyeva/editor-config-repair
parth-mlx-decode-checkpoints
hoyyeva/qwen
parth/hide-claude-desktop-till-release
parth-add-claude-code-autoinstall
release_v0.22.0
pdevine/manifest-list
codex/fix-codex-model-metadata-warning
pdevine/addressable-manifest
brucemacd/launch-fetch-reccomended
jmorganca/llama-compat
launch-copilot-cli
release_v0.20.7
parth-auto-save-backup
parth-test
jmorganca/gemma4-audio-replacements
fix-manifest-digest-on-pull
hoyyeva/vscode-improve
brucemacd/install-server-wait
parth/update-claude-docs
brucemac/start-ap-install
pdevine/mlx-update
pdevine/qwen35_vision
drifkin/api-show-fallback
mintlify/image-generation-1773352582
hoyyeva/server-context-length-local-config
jmorganca/faster-reptition-penalties
jmorganca/convert-nemotron
parth-pi-thinking
pdevine/sampling-penalties
jmorganca/fix-create-quantization-memory
dongchen/resumable_transfer_fix
pdevine/sampling-cache-error
jessegross/mlx-usage
hoyyeva/openclaw-config
hoyyeva/app-html
pdevine/qwen3next
brucemacd/sign-sh-install
brucemacd/tui-update
brucemacd/usage-api
jmorganca/launch-empty
fix-app-dist-embed
mxyng/mlx-compile
mxyng/mlx-quant
mxyng/mlx-glm4.7
mxyng/mlx
brucemacd/simplify-model-picker
jmorganca/qwen3-concurrent
fix-glm-4.7-flash-mla-config
drifkin/qwen3-coder-opening-tag
brucemacd/usage-cli
fix-cuda12-fattn-shmem
ollama-imagegen-docs
parth/fix-multiline-inputs
brucemacd/config-docs
mxyng/model-files
mxyng/simple-execute
fix-imagegen-ollama-models
mxyng/async-upload
jmorganca/lazy-no-dtype-changes
imagegen-auto-detect-create
parth/decrease-concurrent-download-hf
fix-mlx-quantize-init
jmorganca/x-cleanup
usage
imagegen-readme
jmorganca/glm-image
mlx-gpu-cd
jmorganca/imagegen-modelfile
parth/agent-skills
parth/agent-allowlist
parth/signed-in-offline
parth/agents
parth/fix-context-chopping
improve-cloud-flow
parth/add-models-websearch
parth/prompt-renderer-mcp
jmorganca/native-settings
jmorganca/download-stream-hash
jmorganca/client2-rebased
brucemacd/oai-chat-req-multipart
jessegross/multi_chunk_reserve
grace/additional-omit-empty
grace/mistral-3-large
mxyng/tokenizer2
mxyng/tokenizer
jessegross/flash
hoyyeva/windows-nacked-app
mxyng/cleanup-attention
grace/deepseek-parser
hoyyeva/remember-unsent-prompt
parth/add-lfs-pointer-error-conversion
parth/olmo2-test2
hoyyeva/ollama-launchagent-plist
nicole/olmo-model
parth/olmo-test
mxyng/remove-embedded
parth/render-template
jmorganca/intellect-3
parth/remove-prealloc-linter
jmorganca/cmd-eval
nicole/nomic-embed-text-fix
mxyng/lint-2
hoyyeva/add-gemini-3-pro-preview
hoyyeva/load-model-list
mxyng/expand-path
mxyng/environ-2
hoyyeva/deeplink-json-encoding
parth/improve-tool-calling-tests
hoyyeva/conversation
hoyyeva/assistant-edit-response
hoyyeva/thinking
origin/brucemacd/invalid-char-i-err
parth/improve-tool-calling
jmorganca/required-omitempty
grace/qwen3-vl-tests
mxyng/iter-client
parth/docs-readme
nicole/embed-test
pdevine/integration-benchstat
parth/remove-generate-cmd
parth/add-toolcall-id
mxyng/server-tests
jmorganca/glm-4.6
jmorganca/gin-h-compat
drifkin/stable-tool-args
pdevine/qwen3-more-thinking
parth/add-websearch-client
nicole/websearch_local
jmorganca/qwen3-coder-updates
grace/deepseek-v3-migration-tests
mxyng/fix-create
jmorganca/cloud-errors
pdevine/parser-tidy
revert-12233-parth/simplify-entrypoints-runner
parth/enable-so-gpt-oss
brucemacd/qwen3vl
jmorganca/readme-simplify
parth/gpt-oss-structured-outputs
revert-12039-jmorganca/tools-braces
mxyng/embeddings
mxyng/gguf
mxyng/benchmark
mxyng/types-null
parth/move-parsing
mxyng/gemma2
jmorganca/docs
mxyng/16-bit
mxyng/create-stdin
pdevine/authorizedkeys
mxyng/quant
parth/opt-in-error-context-window
brucemacd/cache-models
brucemacd/runner-completion
jmorganca/llama-update-6
brucemacd/benchmark-list
brucemacd/partial-read-caps
parth/deepseek-r1-tools
mxyng/omit-array
parth/tool-prefix-temp
brucemacd/runner-test
jmorganca/qwen25vl
brucemacd/model-forward-test-ext
parth/python-function-parsing
jmorganca/cuda-compression-none
drifkin/num-parallel
drifkin/chat-truncation-fix
jmorganca/sync
parth/python-tools-calling
drifkin/array-head-count
brucemacd/create-no-loop
parth/server-enable-content-stream-with-tools
qwen25omni
mxyng/v3
brucemacd/ropeconfig
jmorganca/silence-tokenizer
parth/sample-so-test
parth/sampling-structured-outputs
brucemacd/doc-go-engine
parth/constrained-sampling-json
jmorganca/mistral-wip
brucemacd/mistral-small-convert
parth/sample-unmarshal-json-for-params
brucemacd/jomorganca/mistral
pdevine/bfloat16
jmorganca/mistral
brucemacd/mistral
pdevine/logging
parth/sample-correctness-fix
parth/sample-fix-sorting
jmorgan/sample-fix-sorting-extras
jmorganca/temp-0-images
brucemacd/parallel-embed-models
brucemacd/shim-grammar
jmorganca/fix-gguf-error
bmizerany/nameswork
jmorganca/faster-releases
bmizerany/validatenames
brucemacd/err-no-vocab
brucemacd/rope-config
brucemacd/err-hint
brucemacd/qwen2_5
brucemacd/logprobs
brucemacd/new_runner_graph_bench
progress-flicker
brucemacd/forward-test
brucemacd/go_qwen2
pdevine/gemma2
jmorganca/add-missing-symlink-eval
mxyng/next-debug
parth/set-context-size-openai
brucemacd/next-bpe-bench
brucemacd/next-bpe-test
brucemacd/new_runner_e2e
brucemacd/new_runner_qwen2
pdevine/convert-cohere2
brucemacd/convert-cli
parth/log-probs
mxyng/next-mlx
mxyng/cmd-history
parth/templating
parth/tokenize-detokenize
brucemacd/check-key-register
bmizerany/grammar
jmorganca/vendor-081b29bd
mxyng/func-checks
jmorganca/fix-null-format
parth/fix-default-to-warn-json
jmorganca/qwen2vl
jmorganca/no-concat
parth/cmd-cleanup-SO
brucemacd/check-key-register-structured-err
parth/openai-stream-usage
parth/fix-referencing-so
stream-tools-stop
jmorganca/degin-1
brucemacd/install-path-clean
brucemacd/push-name-validation
brucemacd/browser-key-register
jmorganca/openai-fix-first-message
jmorganca/fix-proxy
jessegross/sample
parth/disallow-streaming-tools
dhiltgen/remove_submodule
jmorganca/ga
jmorganca/mllama
pdevine/newlines
pdevine/geems-2b
jmorganca/llama-bump
mxyng/modelname-7
mxyng/gin-slog
mxyng/modelname-6
jyan/convert-prog
jyan/quant5
paligemma-support
pdevine/import-docs
jmorganca/openai-context
jyan/paligemma
jyan/p2
jyan/palitest
bmizerany/embedspeedup
jmorganca/llama-vit
brucemacd/allow-ollama
royh/ep-methods
royh/whisper
mxyng/api-models
mxyng/fix-memory
jyan/q4_4/8
jyan/ollama-v
royh/stream-tools
roy-embed-parallel
bmizerany/hrm
revert-5963-revert-5924-mxyng/llama3.1-rope
royh/embed-viz
jyan/local2
jyan/auth
jyan/local
jyan/parse-temp
jmorganca/template-mistral
jyan/reord-g
royh-openai-suffixdocs
royh-imgembed
royh-embed-parallel
jyan/quant4
royh-precision
jyan/progress
pdevine/fix-template
jyan/quant3
pdevine/ggla
mxyng/update-registry-domain
jmorganca/ggml-static
mxyng/create-context
jyan/v0.146
mxyng/layers-from-files
build_dist
bmizerany/noseek
royh-ls
royh-name
timeout
mxyng/server-timestamp
bmizerany/nosillyggufslurps
royh-params
jmorganca/llama-cpp-7c26775
royh-openai-delete
royh-show-rigid
jmorganca/enable-fa
jmorganca/no-error-template
jyan/format
royh-testdelete
bmizerany/fastverify
language_support
pdevine/ps-glitches
brucemacd/tokenize
bruce/iq-quants
bmizerany/filepathwithcoloninhost
mxyng/split-bin
bmizerany/client-registry
jmorganca/if-none-match
native
jmorganca/native
jmorganca/batch-embeddings
jmorganca/initcmake
jmorganca/mm
pdevine/showggmlinfo
modenameenforcealphanum
bmizerany/modenameenforcealphanum
jmorganca/done-reason
jmorganca/llama-cpp-8960fe8
ollama.com
bmizerany/filepathnobuild
bmizerany/types/model/defaultfix
rmdisplaylong
nogogen
bmizerany/x
modelfile-readme
bmizerany/replacecolon
jmorganca/limit
jmorganca/execstack
jmorganca/replace-assets
mxyng/tune-concurrency
jmorganca/testing
whitespace-detection
jmorganca/options
upgrade-all
scratch
cuda-search
mattw/airenamer
mattw/allmodelsonhuggingface
mattw/quantcontext
mattw/whatneedstorun
brucemacd/llama-mem-calc
mattw/faq-context
mattw/communitylinks
mattw/noprune
mattw/python-functioncalling
rename
mxyng/install
pulse
remove-first
editor
mattw/selfqueryingretrieval
cgo
mattw/howtoquant
api
matt/streamingapi
format-config
mxyng/extra-args
shell
update-nous-hermes
cp-model
upload-progress
fix-unknown-model
fix-model-names
delete-fix
insecure-registry
ls
deletemodels
progressbar
readme-updates
license-layers
skip-list
list-models
modelpath
matt/examplemodelfiles
distribution
go-opts
v0.30.0-rc23
v0.30.0-rc22
v0.30.0-rc21
v0.30.0-rc20
v0.30.0-rc19
v0.30.0-rc18
v0.25.0-rc0
v0.30.0-rc17
v0.30.0-rc16
v0.24.0-rc1
v0.24.0
v0.24.0-rc0
v0.23.4
v0.23.4-rc0
v0.30.0-rc15
v0.23.3
v0.23.3-rc1
v0.30.0-rc14
v0.23.3-rc0
v0.30.0-rc13
v0.30.0-rc12
v0.30.0-rc11
v0.30.0-rc10
v0.30.0-rc9
v0.30.0-rc8
v0.30.0-rc7
v0.30.0-rc6
v0.30.0-rc5
v0.23.2
v0.23.2-rc0
v0.30.0-rc4
v0.30.0-rc3
v0.30.0-rc2
v0.30.0-rc1
v0.30.0-rc0
v0.23.1
v0.23.1-rc0
v0.23.0
v0.23.0-rc0
v0.22.1
v0.22.1-rc1
v0.22.1-rc0
v0.22.0
v0.22.0-rc1
v0.21.3-rc0
v0.21.2-rc1
v0.21.2
v0.21.2-rc0
v0.21.1
v0.21.1-rc1
v0.21.1-rc0
v0.21.0
v0.21.0-rc1
v0.21.0-rc0
v0.20.8-rc0
v0.20.7
v0.20.7-rc1
v0.20.7-rc0
v0.20.6
v0.20.6-rc1
v0.20.6-rc0
v0.20.5
v0.20.5-rc2
v0.20.5-rc1
v0.20.5-rc0
v0.20.4
v0.20.4-rc2
v0.20.4-rc1
v0.20.4-rc0
v0.20.3
v0.20.3-rc0
v0.20.2
v0.20.1
v0.20.1-rc2
v0.20.1-rc1
v0.20.1-rc0
v0.20.0
v0.20.0-rc1
v0.20.0-rc0
v0.19.0
v0.19.0-rc2
v0.19.0-rc1
v0.19.0-rc0
v0.18.4-rc1
v0.18.4-rc0
v0.18.3
v0.18.3-rc2
v0.18.3-rc1
v0.18.3-rc0
v0.18.2
v0.18.2-rc1
v0.18.2-rc0
v0.18.1
v0.18.1-rc1
v0.18.1-rc0
v0.18.0
v0.18.0-rc2
v0.18.0-rc1
v0.18.0-rc0
v0.17.8-rc4
v0.17.8-rc3
v0.17.8-rc2
v0.17.8-rc1
v0.17.8-rc0
v0.17.7
v0.17.7-rc2
v0.17.7-rc1
v0.17.7-rc0
v0.17.6
v0.17.5
v0.17.4
v0.17.3
v0.17.2
v0.17.1
v0.17.1-rc2
v0.17.1-rc1
v0.17.1-rc0
v0.17.0
v0.17.0-rc2
v0.17.0-rc1
v0.17.0-rc0
v0.16.3
v0.16.3-rc2
v0.16.3-rc1
v0.16.3-rc0
v0.16.2
v0.16.2-rc0
v0.16.1
v0.16.0
v0.16.0-rc2
v0.16.0-rc0
v0.16.0-rc1
v0.15.6
v0.15.5
v0.15.5-rc5
v0.15.5-rc4
v0.15.5-rc3
v0.15.5-rc2
v0.15.5-rc1
v0.15.5-rc0
v0.15.4
v0.15.3
v0.15.2
v0.15.1
v0.15.1-rc1
v0.15.1-rc0
v0.15.0-rc6
v0.15.0
v0.15.0-rc5
v0.15.0-rc4
v0.15.0-rc3
v0.15.0-rc2
v0.15.0-rc1
v0.15.0-rc0
v0.14.3
v0.14.3-rc3
v0.14.3-rc2
v0.14.3-rc1
v0.14.3-rc0
v0.14.2
v0.14.2-rc1
v0.14.2-rc0
v0.14.1
v0.14.0-rc11
v0.14.0
v0.14.0-rc10
v0.14.0-rc9
v0.14.0-rc8
v0.14.0-rc7
v0.14.0-rc6
v0.14.0-rc5
v0.14.0-rc4
v0.14.0-rc3
v0.14.0-rc2
v0.14.0-rc1
v0.14.0-rc0
v0.13.5
v0.13.5-rc1
v0.13.5-rc0
v0.13.4-rc2
v0.13.4
v0.13.4-rc1
v0.13.4-rc0
v0.13.3
v0.13.3-rc1
v0.13.3-rc0
v0.13.2
v0.13.2-rc2
v0.13.2-rc1
v0.13.2-rc0
v0.13.1
v0.13.1-rc2
v0.13.1-rc1
v0.13.1-rc0
v0.13.0
v0.13.0-rc0
v0.12.11
v0.12.11-rc1
v0.12.11-rc0
v0.12.10
v0.12.10-rc1
v0.12.10-rc0
v0.12.9-rc0
v0.12.9
v0.12.8
v0.12.8-rc0
v0.12.7
v0.12.7-rc1
v0.12.7-rc0
v0.12.7-citest0
v0.12.6
v0.12.6-rc1
v0.12.6-rc0
v0.12.5
v0.12.5-rc0
v0.12.4
v0.12.4-rc7
v0.12.4-rc6
v0.12.4-rc5
v0.12.4-rc4
v0.12.4-rc3
v0.12.4-rc2
v0.12.4-rc1
v0.12.4-rc0
v0.12.3
v0.12.2
v0.12.2-rc0
v0.12.1
v0.12.1-rc1
v0.12.1-rc2
v0.12.1-rc0
v0.12.0
v0.12.0-rc1
v0.12.0-rc0
v0.11.11
v0.11.11-rc3
v0.11.11-rc2
v0.11.11-rc1
v0.11.11-rc0
v0.11.10
v0.11.9
v0.11.9-rc0
v0.11.8
v0.11.8-rc0
v0.11.7-rc1
v0.11.7-rc0
v0.11.7
v0.11.6
v0.11.6-rc0
v0.11.5-rc4
v0.11.5-rc3
v0.11.5
v0.11.5-rc5
v0.11.5-rc2
v0.11.5-rc1
v0.11.5-rc0
v0.11.4
v0.11.4-rc0
v0.11.3
v0.11.3-rc0
v0.11.2
v0.11.1
v0.11.0-rc0
v0.11.0-rc1
v0.11.0-rc2
v0.11.0
v0.10.2-int1
v0.10.1
v0.10.0
v0.10.0-rc4
v0.10.0-rc3
v0.10.0-rc2
v0.10.0-rc1
v0.10.0-rc0
v0.9.7-rc1
v0.9.7-rc0
v0.9.6
v0.9.6-rc0
v0.9.6-ci0
v0.9.5
v0.9.4-rc5
v0.9.4-rc6
v0.9.4
v0.9.4-rc3
v0.9.4-rc4
v0.9.4-rc1
v0.9.4-rc2
v0.9.4-rc0
v0.9.3
v0.9.3-rc5
v0.9.4-citest0
v0.9.3-rc4
v0.9.3-rc3
v0.9.3-rc2
v0.9.3-rc1
v0.9.3-rc0
v0.9.2
v0.9.1
v0.9.1-rc1
v0.9.1-rc0
v0.9.1-ci1
v0.9.1-ci0
v0.9.0
v0.9.0-rc0
v0.8.0
v0.8.0-rc0
v0.7.1-rc2
v0.7.1
v0.7.1-rc1
v0.7.1-rc0
v0.7.0
v0.7.0-rc1
v0.7.0-rc0
v0.6.9-rc0
v0.6.8
v0.6.8-rc0
v0.6.7
v0.6.7-rc2
v0.6.7-rc1
v0.6.7-rc0
v0.6.6
v0.6.6-rc2
v0.6.6-rc1
v0.6.6-rc0
v0.6.5-rc1
v0.6.5
v0.6.5-rc0
v0.6.4-rc0
v0.6.4
v0.6.3-rc1
v0.6.3
v0.6.3-rc0
v0.6.2
v0.6.2-rc0
v0.6.1
v0.6.1-rc0
v0.6.0-rc0
v0.6.0
v0.5.14-rc0
v0.5.13
v0.5.13-rc6
v0.5.13-rc5
v0.5.13-rc4
v0.5.13-rc3
v0.5.13-rc2
v0.5.13-rc1
v0.5.13-rc0
v0.5.12
v0.5.12-rc1
v0.5.12-rc0
v0.5.11
v0.5.10
v0.5.9
v0.5.9-rc0
v0.5.8-rc13
v0.5.8
v0.5.8-rc12
v0.5.8-rc11
v0.5.8-rc10
v0.5.8-rc9
v0.5.8-rc8
v0.5.8-rc7
v0.5.8-rc6
v0.5.8-rc5
v0.5.8-rc4
v0.5.8-rc3
v0.5.8-rc2
v0.5.8-rc1
v0.5.8-rc0
v0.5.7
v0.5.6
v0.5.5
v0.5.5-rc0
v0.5.4
v0.5.3
v0.5.3-rc0
v0.5.2
v0.5.2-rc3
v0.5.2-rc2
v0.5.2-rc1
v0.5.2-rc0
v0.5.1
v0.5.0
v0.5.0-rc1
v0.4.8-rc0
v0.4.7
v0.4.6
v0.4.5
v0.4.4
v0.4.3
v0.4.3-rc0
v0.4.2
v0.4.2-rc1
v0.4.2-rc0
v0.4.1
v0.4.1-rc0
v0.4.0
v0.4.0-rc8
v0.4.0-rc7
v0.4.0-rc6
v0.4.0-rc5
v0.4.0-rc4
v0.4.0-rc3
v0.4.0-rc2
v0.4.0-rc1
v0.4.0-rc0
v0.4.0-ci3
v0.3.14
v0.3.14-rc0
v0.3.13
v0.3.12
v0.3.12-rc5
v0.3.12-rc4
v0.3.12-rc3
v0.3.12-rc2
v0.3.12-rc1
v0.3.11
v0.3.11-rc4
v0.3.11-rc3
v0.3.11-rc2
v0.3.11-rc1
v0.3.10
v0.3.10-rc1
v0.3.9
v0.3.8
v0.3.7
v0.3.7-rc6
v0.3.7-rc5
v0.3.7-rc4
v0.3.7-rc3
v0.3.7-rc2
v0.3.7-rc1
v0.3.6
v0.3.5
v0.3.4
v0.3.3
v0.3.2
v0.3.1
v0.3.0
v0.2.8
v0.2.8-rc2
v0.2.8-rc1
v0.2.7
v0.2.6
v0.2.5
v0.2.4
v0.2.3
v0.2.2
v0.2.2-rc2
v0.2.2-rc1
v0.2.1
v0.2.0
v0.1.49-rc14
v0.1.49-rc13
v0.1.49-rc12
v0.1.49-rc11
v0.1.49-rc10
v0.1.49-rc9
v0.1.49-rc8
v0.1.49-rc7
v0.1.49-rc6
v0.1.49-rc4
v0.1.49-rc5
v0.1.49-rc3
v0.1.49-rc2
v0.1.49-rc1
v0.1.48
v0.1.47
v0.1.46
v0.1.45-rc5
v0.1.45
v0.1.45-rc4
v0.1.45-rc3
v0.1.45-rc2
v0.1.45-rc1
v0.1.44
v0.1.43
v0.1.42
v0.1.41
v0.1.40
v0.1.40-rc1
v0.1.39
v0.1.39-rc2
v0.1.39-rc1
v0.1.38
v0.1.37
v0.1.36
v0.1.35
v0.1.35-rc1
v0.1.34
v0.1.34-rc1
v0.1.33
v0.1.33-rc7
v0.1.33-rc6
v0.1.33-rc5
v0.1.33-rc4
v0.1.33-rc3
v0.1.33-rc2
v0.1.33-rc1
v0.1.32
v0.1.32-rc2
v0.1.32-rc1
v0.1.31
v0.1.30
v0.1.29
v0.1.28
v0.1.27
v0.1.26
v0.1.25
v0.1.24
v0.1.23
v0.1.22
v0.1.21
v0.1.20
v0.1.19
v0.1.18
v0.1.17
v0.1.16
v0.1.15
v0.1.14
v0.1.13
v0.1.12
v0.1.11
v0.1.10
v0.1.9
v0.1.8
v0.1.7
v0.1.6
v0.1.5
v0.1.4
v0.1.3
v0.1.2
v0.1.1
v0.1.0
v0.0.21
v0.0.20
v0.0.19
v0.0.18
v0.0.17
v0.0.16
v0.0.15
v0.0.14
v0.0.13
v0.0.12
v0.0.11
v0.0.10
v0.0.9
v0.0.8
v0.0.7
v0.0.6
v0.0.5
v0.0.4
v0.0.3
v0.0.2
v0.0.1
Labels
Clear labels
amd
api
app
bug
build
cli
cloud
compatibility
context-length
create
docker
documentation
embeddings
feature request
feedback wanted
good first issue
gpt-oss
gpu
harmony
help wanted
image
install
intel
js
launch
linux
macos
memory
mlx
model
needs more info
networking
nvidia
ollama.com
performance
pull-request
python
question
registry
rendering
thinking
tools
top
vulkan
windows
wsl
Mirrored from GitHub Pull Request
No Label
bug
Milestone
No items
No Milestone
Projects
Clear projects
No project
No Assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: github-starred/ollama#68196
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ProjectMoon on GitHub (Feb 28, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/9416
Updated this to reference the specific problem of the vision part of Granite 3.2 seeming to use the CPU, rather than the GPU, on 0.5.13.
What is the issue?
After upgrading to 0.5.13-rc1, I have noticed that ROCm fails to actually run. When the model is loaded, it loads onto the GPU (confirmed via
rocm-smi), but when trying to chat, it seems to reload the model and I guess uses the CPU? This causes my computer to lock up too (but that might just be RAM thrashing).In the log output, you can see it first loading on to ROCm, and then it reloads the model when the chat endpoint is called, and that seems to skip the GPU for some reason.
Downgrading back to 0.5.12 works perfectly. I am not using the system ROCm as far as I know. I always untar the ROCm package from ollama when upgrading.
Debug Logs
Logs:OS
Gentoo Linux
GPU
AMD, NVidia
CPU
AMD
Ollama version
0.5.13-rc1
@jmorganca commented on GitHub (Feb 28, 2025):
@ProjectMoon thanks! Possible to share the logs?
@jmorganca commented on GitHub (Feb 28, 2025):
Oops, you did! Thanks so much. Looking
@githubdebugger commented on GitHub (Mar 1, 2025):
Maybe these issues are related, hence adding the logs here as I am facing issue with 0.5.13-rc2-rocm and this one in docker container. The model loads in GPU, but as soon as I am sending a message, it exits:
Click to view logs
➜ ~ docker stop ollama
ollama
➜ ~ docker rm ollama
ollama
➜ ~ docker run -d --device /dev/kfd --device /dev/dri -v ollama:/root/.ollama -e OLLAMA_ORIGINS="" -e HSA_OVERRIDE_GFX_VERSION=10.3.0 -p 11434:11434 --name ollama ollama/ollama:0.5.13-rc2-rocm
993105481ed3bcd488bd82b5477a9749248437e15aee10d99de73a104a37e090
➜ ~
➜ ~
➜ ~ docker logs -f ollama
2025/03/01 07:09:11 routes.go:1215: INFO server config env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION:10.3.0 HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:2048 OLLAMA_DEBUG:false OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://0.0.0.0:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/root/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[ http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]"
time=2025-03-01T07:09:11.854Z level=INFO source=images.go:432 msg="total blobs: 23"
time=2025-03-01T07:09:11.855Z level=INFO source=images.go:439 msg="total unused blobs removed: 0"
time=2025-03-01T07:09:11.856Z level=INFO source=routes.go:1281 msg="Listening on [::]:11434 (version 0.5.13-rc2)"
time=2025-03-01T07:09:11.856Z level=INFO source=gpu.go:217 msg="looking for compatible GPUs"
time=2025-03-01T07:09:11.859Z level=INFO source=amd_linux.go:389 msg="skipping rocm gfx compatibility check" HSA_OVERRIDE_GFX_VERSION=10.3.0
time=2025-03-01T07:09:11.864Z level=INFO source=types.go:130 msg="inference compute" id=0 library=rocm variant="" compute=gfx1035 driver=6.8 name=1002:1681 total="16.0 GiB" available="16.0 GiB"
^C% ➜ ~ docker exec -it ollama ollama ls
NAME ID SIZE MODIFIED
phi4-mini:latest 78fad5d182a7 2.5 GB 7 minutes ago
deepseek-r1:1.5b a42b25d8c10a 1.1 GB 3 hours ago
qwen2.5-coder:latest 2b0496514337 4.7 GB 12 days ago
nomic-embed-text:latest 0a109f422b47 274 MB 12 days ago
deepseek-r1:latest 0a8c26691023 4.7 GB 12 days ago
bge-m3:latest 790764642607 1.2 GB 12 days ago
➜ ~ docker exec -it ollama ollama ps
NAME ID SIZE PROCESSOR UNTIL
➜ ~ docker exec -it ollama ollama ps
NAME ID SIZE PROCESSOR UNTIL
phi4-mini:latest 78fad5d182a7 4.7 GB 100% GPU 4 minutes from now
➜ ~
➜ ~ docker exec -it ollama ollama ps
NAME ID SIZE PROCESSOR UNTIL
phi4-mini:latest 78fad5d182a7 4.7 GB 100% GPU 4 minutes from now
➜ ~ docker logs -f ollama
2025/03/01 07:09:11 routes.go:1215: INFO server config env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION:10.3.0 HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:2048 OLLAMA_DEBUG:false OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://0.0.0.0:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/root/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[ http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]"
time=2025-03-01T07:09:11.854Z level=INFO source=images.go:432 msg="total blobs: 23"
time=2025-03-01T07:09:11.855Z level=INFO source=images.go:439 msg="total unused blobs removed: 0"
time=2025-03-01T07:09:11.856Z level=INFO source=routes.go:1281 msg="Listening on [::]:11434 (version 0.5.13-rc2)"
time=2025-03-01T07:09:11.856Z level=INFO source=gpu.go:217 msg="looking for compatible GPUs"
time=2025-03-01T07:09:11.859Z level=INFO source=amd_linux.go:389 msg="skipping rocm gfx compatibility check" HSA_OVERRIDE_GFX_VERSION=10.3.0
time=2025-03-01T07:09:11.864Z level=INFO source=types.go:130 msg="inference compute" id=0 library=rocm variant="" compute=gfx1035 driver=6.8 name=1002:1681 total="16.0 GiB" available="16.0 GiB"
[GIN] 2025/03/01 - 07:09:44 | 200 | 106.108µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/03/01 - 07:09:44 | 200 | 1.294668ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/03/01 - 07:09:47 | 200 | 20.921µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/03/01 - 07:09:47 | 200 | 118.183µs | 127.0.0.1 | GET "/api/ps"
[GIN] 2025/03/01 - 07:09:52 | 200 | 29.288µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/03/01 - 07:09:52 | 200 | 23.668722ms | 127.0.0.1 | POST "/api/show"
time=2025-03-01T07:09:52.534Z level=WARN source=ggml.go:136 msg="key not found" key=phi3.attention.key_length default=128
time=2025-03-01T07:09:52.534Z level=WARN source=ggml.go:136 msg="key not found" key=phi3.attention.value_length default=128
time=2025-03-01T07:09:52.534Z level=INFO source=sched.go:715 msg="new model will fit in available VRAM in single GPU, loading" model=/root/.ollama/models/blobs/sha256-3c168af1dea0a414299c7d9077e100ac763370e5a98b3c53801a958a47f0a5db gpu=0 parallel=4 available=17163341824 required="4.4 GiB"
time=2025-03-01T07:09:52.534Z level=INFO source=server.go:97 msg="system memory" total="15.4 GiB" free="12.2 GiB" free_swap="0 B"
time=2025-03-01T07:09:52.534Z level=WARN source=ggml.go:136 msg="key not found" key=phi3.attention.key_length default=128
time=2025-03-01T07:09:52.534Z level=WARN source=ggml.go:136 msg="key not found" key=phi3.attention.value_length default=128
time=2025-03-01T07:09:52.534Z level=INFO source=server.go:130 msg=offload library=rocm layers.requested=-1 layers.model=33 layers.offload=33 layers.split="" memory.available="[16.0 GiB]" memory.gpu_overhead="0 B" memory.required.full="4.4 GiB" memory.required.partial="4.4 GiB" memory.required.kv="1.0 GiB" memory.required.allocations="[4.4 GiB]" memory.weights.total="2.8 GiB" memory.weights.repeating="2.4 GiB" memory.weights.nonrepeating="480.8 MiB" memory.graph.full="512.0 MiB" memory.graph.partial="512.0 MiB"
time=2025-03-01T07:09:52.535Z level=INFO source=server.go:380 msg="starting llama server" cmd="/usr/bin/ollama runner --model /root/.ollama/models/blobs/sha256-3c168af1dea0a414299c7d9077e100ac763370e5a98b3c53801a958a47f0a5db --ctx-size 8192 --batch-size 512 --n-gpu-layers 33 --threads 8 --parallel 4 --port 44845"
time=2025-03-01T07:09:52.535Z level=INFO source=sched.go:450 msg="loaded runners" count=1
time=2025-03-01T07:09:52.535Z level=INFO source=server.go:557 msg="waiting for llama runner to start responding"
time=2025-03-01T07:09:52.536Z level=INFO source=server.go:591 msg="waiting for server to become available" status="llm server error"
time=2025-03-01T07:09:52.552Z level=INFO source=runner.go:931 msg="starting go runner"
/opt/amdgpu/share/libdrm/amdgpu.ids: No such file or directory
ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 ROCm devices:
Device 0: AMD Radeon Graphics, gfx1030 (0x1030), VMM: yes, Wave Size: 32
load_backend: loaded ROCm backend from /usr/lib/ollama/rocm/libggml-hip.so
load_backend: loaded CPU backend from /usr/lib/ollama/libggml-cpu-haswell.so
time=2025-03-01T07:09:54.075Z level=INFO source=runner.go:934 msg=system info="CPU : LLAMAFILE = 1 | ROCm : PEER_MAX_BATCH_SIZE = 128 | CPU : LLAMAFILE = 1 | cgo(gcc)" threads=8
llama_model_load_from_file_impl: using device ROCm0 (AMD Radeon Graphics) - 7860 MiB free
time=2025-03-01T07:09:54.076Z level=INFO source=runner.go:992 msg="Server listening on 127.0.0.1:44845"
llama_model_loader: loaded meta data with 36 key-value pairs and 196 tensors from /root/.ollama/models/blobs/sha256-3c168af1dea0a414299c7d9077e100ac763370e5a98b3c53801a958a47f0a5db (version GGUF V3 (latest))
llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output.
llama_model_loader: - kv 0: general.architecture str = phi3
llama_model_loader: - kv 1: phi3.rope.scaling.attn_factor f32 = 1.190238
llama_model_loader: - kv 2: general.type str = model
llama_model_loader: - kv 3: general.name str = Phi 4 Mini Instruct
llama_model_loader: - kv 4: general.finetune str = instruct
llama_model_loader: - kv 5: general.basename str = Phi-4
llama_model_loader: - kv 6: general.size_label str = mini
llama_model_loader: - kv 7: general.license str = mit
llama_model_loader: - kv 8: general.license.link str = https://huggingface.co/microsoft/Phi-...
llama_model_loader: - kv 9: general.tags arr[str,3] = ["nlp", "code", "text-generation"]
llama_model_loader: - kv 10: general.languages arr[str,24] = ["multilingual", "ar", "zh", "cs", "d...
llama_model_loader: - kv 11: phi3.context_length u32 = 131072
llama_model_loader: - kv 12: phi3.rope.scaling.original_context_length u32 = 4096
llama_model_loader: - kv 13: phi3.embedding_length u32 = 3072
llama_model_loader: - kv 14: phi3.feed_forward_length u32 = 8192
llama_model_loader: - kv 15: phi3.block_count u32 = 32
llama_model_loader: - kv 16: phi3.attention.head_count u32 = 24
llama_model_loader: - kv 17: phi3.attention.head_count_kv u32 = 8
llama_model_loader: - kv 18: phi3.attention.layer_norm_rms_epsilon f32 = 0.000010
llama_model_loader: - kv 19: phi3.rope.dimension_count u32 = 96
llama_model_loader: - kv 20: phi3.rope.freq_base f32 = 10000.000000
llama_model_loader: - kv 21: phi3.attention.sliding_window u32 = 262144
llama_model_loader: - kv 22: tokenizer.ggml.model str = gpt2
llama_model_loader: - kv 23: tokenizer.ggml.pre str = gpt-4o
llama_model_loader: - kv 24: tokenizer.ggml.tokens arr[str,200064] = ["!", """, "#", "$", "%", "&", "'", ...
llama_model_loader: - kv 25: tokenizer.ggml.token_type arr[i32,200064] = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, ...
llama_model_loader: - kv 26: tokenizer.ggml.merges arr[str,199742] = ["Ġ Ġ", "ĠĠ ĠĠ", "i n", "e r", ...
llama_model_loader: - kv 27: tokenizer.ggml.bos_token_id u32 = 199999
llama_model_loader: - kv 28: tokenizer.ggml.eos_token_id u32 = 199999
llama_model_loader: - kv 29: tokenizer.ggml.unknown_token_id u32 = 199999
llama_model_loader: - kv 30: tokenizer.ggml.padding_token_id u32 = 199999
llama_model_loader: - kv 31: tokenizer.ggml.add_bos_token bool = false
llama_model_loader: - kv 32: tokenizer.ggml.add_eos_token bool = false
llama_model_loader: - kv 33: tokenizer.chat_template str = {% for message in messages %}{% if me...
llama_model_loader: - kv 34: general.quantization_version u32 = 2
llama_model_loader: - kv 35: general.file_type u32 = 15
llama_model_loader: - type f32: 67 tensors
llama_model_loader: - type q4_K: 80 tensors
llama_model_loader: - type q5_K: 32 tensors
llama_model_loader: - type q6_K: 17 tensors
print_info: file format = GGUF V3 (latest)
print_info: file type = Q4_K - Medium
print_info: file size = 2.31 GiB (5.18 BPW)
load: special tokens cache size = 12
time=2025-03-01T07:09:54.293Z level=INFO source=server.go:591 msg="waiting for server to become available" status="llm server loading model"
load: token to piece cache size = 1.3333 MB
print_info: arch = phi3
print_info: vocab_only = 0
print_info: n_ctx_train = 131072
print_info: n_embd = 3072
print_info: n_layer = 32
print_info: n_head = 24
print_info: n_head_kv = 8
print_info: n_rot = 96
print_info: n_swa = 262144
print_info: n_embd_head_k = 128
print_info: n_embd_head_v = 128
print_info: n_gqa = 3
print_info: n_embd_k_gqa = 1024
print_info: n_embd_v_gqa = 1024
print_info: f_norm_eps = 0.0e+00
print_info: f_norm_rms_eps = 1.0e-05
print_info: f_clamp_kqv = 0.0e+00
print_info: f_max_alibi_bias = 0.0e+00
print_info: f_logit_scale = 0.0e+00
print_info: n_ff = 8192
print_info: n_expert = 0
print_info: n_expert_used = 0
print_info: causal attn = 1
print_info: pooling type = 0
print_info: rope type = 2
print_info: rope scaling = linear
print_info: freq_base_train = 10000.0
print_info: freq_scale_train = 1
print_info: n_ctx_orig_yarn = 4096
print_info: rope_finetuned = unknown
print_info: ssm_d_conv = 0
print_info: ssm_d_inner = 0
print_info: ssm_d_state = 0
print_info: ssm_dt_rank = 0
print_info: ssm_dt_b_c_rms = 0
print_info: model type = 3B
print_info: model params = 3.84 B
print_info: general.name = Phi 4 Mini Instruct
print_info: vocab type = BPE
print_info: n_vocab = 200064
print_info: n_merges = 199742
print_info: BOS token = 199999 '<|endoftext|>'
print_info: EOS token = 199999 '<|endoftext|>'
print_info: EOT token = 199999 '<|endoftext|>'
print_info: UNK token = 199999 '<|endoftext|>'
print_info: PAD token = 199999 '<|endoftext|>'
print_info: LF token = 198 'Ċ'
print_info: EOG token = 199999 '<|endoftext|>'
print_info: EOG token = 200020 '<|end|>'
print_info: max token length = 256
load_tensors: loading model tensors, this can take a while... (mmap = true)
load_tensors: offloading 32 repeating layers to GPU
load_tensors: offloading output layer to GPU
load_tensors: offloaded 33/33 layers to GPU
load_tensors: CPU_Mapped model buffer size = 480.81 MiB
load_tensors: ROCm0 model buffer size = 2368.57 MiB
llama_init_from_model: n_seq_max = 4
llama_init_from_model: n_ctx = 8192
llama_init_from_model: n_ctx_per_seq = 2048
llama_init_from_model: n_batch = 2048
llama_init_from_model: n_ubatch = 512
llama_init_from_model: flash_attn = 0
llama_init_from_model: freq_base = 10000.0
llama_init_from_model: freq_scale = 1
llama_init_from_model: n_ctx_per_seq (2048) < n_ctx_train (131072) -- the full capacity of the model will not be utilized
llama_kv_cache_init: kv_size = 8192, offload = 1, type_k = 'f16', type_v = 'f16', n_layer = 32, can_shift = 1
llama_kv_cache_init: ROCm0 KV buffer size = 1024.00 MiB
llama_init_from_model: KV self size = 1024.00 MiB, K (f16): 512.00 MiB, V (f16): 512.00 MiB
llama_init_from_model: ROCm_Host output buffer size = 3.10 MiB
llama_init_from_model: ROCm0 compute buffer size = 428.00 MiB
llama_init_from_model: ROCm_Host compute buffer size = 22.01 MiB
llama_init_from_model: graph nodes = 1286
llama_init_from_model: graph splits = 2
time=2025-03-01T07:09:55.298Z level=INFO source=server.go:596 msg="llama runner started in 2.76 seconds"
[GIN] 2025/03/01 - 07:09:55 | 200 | 2.832161663s | 127.0.0.1 | POST "/api/generate"
[GIN] 2025/03/01 - 07:09:58 | 200 | 25.49µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/03/01 - 07:09:58 | 200 | 26.983µs | 127.0.0.1 | GET "/api/ps"
[GIN] 2025/03/01 - 07:10:00 | 200 | 34.017µs | 127.0.0.1 | HEAD "/"
[GIN] 2025/03/01 - 07:10:00 | 200 | 29.648µs | 127.0.0.1 | GET "/api/ps"
//ml/backend/ggml/ggml/src/ggml-cuda/ggml-cuda.cu:449: HipVMM Failure: out of memory
Memory critical error by agent node-0 (Agent handle: 0x63996d6c1680) on address 0x70d978500000. Reason: Memory in use.
SIGABRT: abort
PC=0x70d9e8efe00b m=0 sigcode=18446744073709551610
signal arrived during cgo execution
goroutine 16 gp=0xc000504a80 m=0 mp=0x6399654f96c0 [syscall]:
runtime.cgocall(0x639964647600, 0xc000093bc8)
runtime/cgocall.go:167 +0x4b fp=0xc000093ba0 sp=0xc000093b68 pc=0x6399639f76ab
github.com/ollama/ollama/llama._Cfunc_llama_decode(0x70d7cc836840, {0x4, 0x70d7cc857970, 0x0, 0x0, 0x70d7cc8610d0, 0x70d7cc859e30, 0x70d7cc78bbc0, 0x70d97c6252d0})
_cgo_gotypes.go:557 +0x4a fp=0xc000093bc8 sp=0xc000093ba0 pc=0x639963d7d46a
github.com/ollama/ollama/llama.(*Context).Decode.func1(...)
github.com/ollama/ollama/llama/llama.go:157
github.com/ollama/ollama/llama.(*Context).Decode(0xc00011c5d0?, 0x0?)
github.com/ollama/ollama/llama/llama.go:157 +0xf6 fp=0xc000093cc8 sp=0xc000093bc8 pc=0x639963d80076
github.com/ollama/ollama/runner/llamarunner.(*Server).processBatch(0xc0004ba000, 0xc0004264e0, 0xc00011c720)
github.com/ollama/ollama/runner/llamarunner/runner.go:435 +0x23e fp=0xc000093ee0 sp=0xc000093cc8 pc=0x639963d990be
github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc0004ba000, {0x639964ca7e60, 0xc0001280a0})
github.com/ollama/ollama/runner/llamarunner/runner.go:343 +0x1d5 fp=0xc000093fb8 sp=0xc000093ee0 pc=0x639963d98d15
github.com/ollama/ollama/runner/llamarunner.Execute.gowrap2()
github.com/ollama/ollama/runner/llamarunner/runner.go:973 +0x28 fp=0xc000093fe0 sp=0xc000093fb8 pc=0x639963d9d6a8
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000093fe8 sp=0xc000093fe0 pc=0x639963a020c1
created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1
github.com/ollama/ollama/runner/llamarunner/runner.go:973 +0xd97
goroutine 1 gp=0xc000002380 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc0005875b8 sp=0xc000587598 pc=0x6399639fa98e
runtime.netpollblock(0xc000587608?, 0x639942c6?, 0x99?)
runtime/netpoll.go:575 +0xf7 fp=0xc0005875f0 sp=0xc0005875b8 pc=0x6399639bf797
internal/poll.runtime_pollWait(0x70d9a23c7eb0, 0x72)
runtime/netpoll.go:351 +0x85 fp=0xc000587610 sp=0xc0005875f0 pc=0x6399639f9ba5
internal/poll.(*pollDesc).wait(0xc00004e100?, 0x900000036?, 0x0)
internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000587638 sp=0xc000587610 pc=0x639963a81027
internal/poll.(*pollDesc).waitRead(...)
internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc00004e100)
internal/poll/fd_unix.go:620 +0x295 fp=0xc0005876e0 sp=0xc000587638 pc=0x639963a863f5
net.(*netFD).accept(0xc00004e100)
net/fd_unix.go:172 +0x29 fp=0xc000587798 sp=0xc0005876e0 pc=0x639963af8869
net.(*TCPListener).accept(0xc00071a080)
net/tcpsock_posix.go:159 +0x1b fp=0xc0005877e8 sp=0xc000587798 pc=0x639963b0e21b
net.(*TCPListener).Accept(0xc00071a080)
net/tcpsock.go:380 +0x30 fp=0xc000587818 sp=0xc0005877e8 pc=0x639963b0d0d0
net/http.(*onceCloseListener).Accept(0xc0004ba120?)
:1 +0x24 fp=0xc000587830 sp=0xc000587818 pc=0x639963d23f84
net/http.(*Server).Serve(0xc000534200, {0x639964ca5be8, 0xc00071a080})
net/http/server.go:3424 +0x30c fp=0xc000587960 sp=0xc000587830 pc=0x639963cfb84c
github.com/ollama/ollama/runner/llamarunner.Execute({0xc000034120, 0xe, 0xe})
github.com/ollama/ollama/runner/llamarunner/runner.go:993 +0x116a fp=0xc000587d08 sp=0xc000587960 pc=0x639963d9d3ea
github.com/ollama/ollama/runner.Execute({0xc000034110?, 0x0?, 0x0?})
github.com/ollama/ollama/runner/runner.go:22 +0xd4 fp=0xc000587d30 sp=0xc000587d08 pc=0x639963fc6514
github.com/ollama/ollama/cmd.NewCLI.func2(0xc000035300?, {0x639964825055?, 0x4?, 0x639964825059?})
github.com/ollama/ollama/cmd/cmd.go:1280 +0x45 fp=0xc000587d58 sp=0xc000587d30 pc=0x6399645daae5
github.com/spf13/cobra.(*Command).execute(0xc00013ef08, {0xc0005149a0, 0xe, 0xe})
github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc000587e78 sp=0xc000587d58 pc=0x639963b71afc
github.com/spf13/cobra.(*Command).ExecuteC(0xc00054e908)
github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000587f30 sp=0xc000587e78 pc=0x639963b72345
github.com/spf13/cobra.(*Command).Execute(...)
github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
github.com/ollama/ollama/main.go:12 +0x4d fp=0xc000587f50 sp=0xc000587f30 pc=0x6399645dae4d
runtime.main()
runtime/proc.go:283 +0x29d fp=0xc000587fe0 sp=0xc000587f50 pc=0x6399639c6d9d
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000587fe8 sp=0xc000587fe0 pc=0x639963a020c1
goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000084fa8 sp=0xc000084f88 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.forcegchelper()
runtime/proc.go:348 +0xb8 fp=0xc000084fe0 sp=0xc000084fa8 pc=0x6399639c70d8
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000084fe8 sp=0xc000084fe0 pc=0x639963a020c1
created by runtime.init.7 in goroutine 1
runtime/proc.go:336 +0x1a
goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000085780 sp=0xc000085760 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.bgsweep(0xc00003e080)
runtime/mgcsweep.go:316 +0xdf fp=0xc0000857c8 sp=0xc000085780 pc=0x6399639b18ff
runtime.gcenable.gowrap1()
runtime/mgc.go:204 +0x25 fp=0xc0000857e0 sp=0xc0000857c8 pc=0x6399639a5ce5
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000857e8 sp=0xc0000857e0 pc=0x639963a020c1
created by runtime.gcenable in goroutine 1
runtime/mgc.go:204 +0x66
goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]:
runtime.gopark(0x10000?, 0x6399649d76b8?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000085f78 sp=0xc000085f58 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.(*scavengerState).park(0x6399654f68a0)
runtime/mgcscavenge.go:425 +0x49 fp=0xc000085fa8 sp=0xc000085f78 pc=0x6399639af349
runtime.bgscavenge(0xc00003e080)
runtime/mgcscavenge.go:658 +0x59 fp=0xc000085fc8 sp=0xc000085fa8 pc=0x6399639af8d9
runtime.gcenable.gowrap2()
runtime/mgc.go:205 +0x25 fp=0xc000085fe0 sp=0xc000085fc8 pc=0x6399639a5c85
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000085fe8 sp=0xc000085fe0 pc=0x639963a020c1
created by runtime.gcenable in goroutine 1
runtime/mgc.go:205 +0xa5
goroutine 5 gp=0xc000003dc0 m=nil [finalizer wait]:
runtime.gopark(0x1b8?, 0xc000002380?, 0x1?, 0x23?, 0xc000084688?)
runtime/proc.go:435 +0xce fp=0xc000084630 sp=0xc000084610 pc=0x6399639fa98e
runtime.runfinq()
runtime/mfinal.go:196 +0x107 fp=0xc0000847e0 sp=0xc000084630 pc=0x6399639a4ca7
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000847e8 sp=0xc0000847e0 pc=0x639963a020c1
created by runtime.createfing in goroutine 1
runtime/mfinal.go:166 +0x3d
goroutine 6 gp=0xc0001e08c0 m=nil [chan receive]:
runtime.gopark(0xc0000ff900?, 0xc000588018?, 0x60?, 0x67?, 0x639963adf5a8?)
runtime/proc.go:435 +0xce fp=0xc000086718 sp=0xc0000866f8 pc=0x6399639fa98e
runtime.chanrecv(0xc0000b6380, 0x0, 0x1)
runtime/chan.go:664 +0x445 fp=0xc000086790 sp=0xc000086718 pc=0x639963996ea5
runtime.chanrecv1(0x0?, 0x0?)
runtime/chan.go:506 +0x12 fp=0xc0000867b8 sp=0xc000086790 pc=0x639963996a32
runtime.unique_runtime_registerUniqueMapCleanup.func2(...)
runtime/mgc.go:1796
runtime.unique_runtime_registerUniqueMapCleanup.gowrap1()
runtime/mgc.go:1799 +0x2f fp=0xc0000867e0 sp=0xc0000867b8 pc=0x6399639a8e8f
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000867e8 sp=0xc0000867e0 pc=0x639963a020c1
created by unique.runtime_registerUniqueMapCleanup in goroutine 1
runtime/mgc.go:1794 +0x85
goroutine 7 gp=0xc0001e1180 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000086f38 sp=0xc000086f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000086fc8 sp=0xc000086f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000086fe0 sp=0xc000086fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000086fe8 sp=0xc000086fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 8 gp=0xc0001e1340 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000087738 sp=0xc000087718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000877c8 sp=0xc000087738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000877e0 sp=0xc0000877c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000877e8 sp=0xc0000877e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 18 gp=0xc000504000 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000080738 sp=0xc000080718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000807c8 sp=0xc000080738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000807e0 sp=0xc0000807c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000807e8 sp=0xc0000807e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 34 gp=0xc000102380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011a738 sp=0xc00011a718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011a7c8 sp=0xc00011a738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011a7e0 sp=0xc00011a7c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011a7e8 sp=0xc00011a7e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 9 gp=0xc0001e1500 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000087f38 sp=0xc000087f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000087fc8 sp=0xc000087f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000087fe0 sp=0xc000087fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000087fe8 sp=0xc000087fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 10 gp=0xc0001e16c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000116738 sp=0xc000116718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001167c8 sp=0xc000116738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001167e0 sp=0xc0001167c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001167e8 sp=0xc0001167e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 11 gp=0xc0001e1880 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000116f38 sp=0xc000116f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000116fc8 sp=0xc000116f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000116fe0 sp=0xc000116fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000116fe8 sp=0xc000116fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 19 gp=0xc0005041c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000080f38 sp=0xc000080f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000080fc8 sp=0xc000080f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000080fe0 sp=0xc000080fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000080fe8 sp=0xc000080fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 20 gp=0xc000504380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000081738 sp=0xc000081718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000817c8 sp=0xc000081738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000817e0 sp=0xc0000817c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000817e8 sp=0xc0000817e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 35 gp=0xc000102540 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011af38 sp=0xc00011af18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011afc8 sp=0xc00011af38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011afe0 sp=0xc00011afc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011afe8 sp=0xc00011afe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 12 gp=0xc0001e1a40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000117738 sp=0xc000117718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001177c8 sp=0xc000117738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001177e0 sp=0xc0001177c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001177e8 sp=0xc0001177e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 21 gp=0xc000504540 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049ba1f?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000081f38 sp=0xc000081f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000081fc8 sp=0xc000081f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000081fe0 sp=0xc000081fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000081fe8 sp=0xc000081fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 13 gp=0xc0001e1c00 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049ac80?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000117f38 sp=0xc000117f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000117fc8 sp=0xc000117f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000117fe0 sp=0xc000117fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000117fe8 sp=0xc000117fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 22 gp=0xc000504700 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049b309?, 0x3?, 0xe9?, 0x5?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000082738 sp=0xc000082718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000827c8 sp=0xc000082738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000827e0 sp=0xc0000827c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000827e8 sp=0xc0000827e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 14 gp=0xc0001e1dc0 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049b3b3?, 0x3?, 0x29?, 0x9?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000118738 sp=0xc000118718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001187c8 sp=0xc000118738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001187e0 sp=0xc0001187c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001187e8 sp=0xc0001187e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 36 gp=0xc000102700 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049bbce?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011b738 sp=0xc00011b718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011b7c8 sp=0xc00011b738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011b7e0 sp=0xc00011b7c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011b7e8 sp=0xc00011b7e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 50 gp=0xc000504c40 m=nil [select]:
runtime.gopark(0xc000047a58?, 0x2?, 0x40?, 0x68?, 0xc000047834?)
runtime/proc.go:435 +0xce fp=0xc000047648 sp=0xc000047628 pc=0x6399639fa98e
runtime.selectgo(0xc000047a58, 0xc000047830, 0x4?, 0x0, 0x1?, 0x1)
runtime/select.go:351 +0x837 fp=0xc000047780 sp=0xc000047648 pc=0x6399639d9297
github.com/ollama/ollama/runner/llamarunner.(*Server).completion(0xc0004ba000, {0x639964ca5dc8, 0xc000514d20}, 0xc000154280)
github.com/ollama/ollama/runner/llamarunner/runner.go:688 +0xa25 fp=0xc000047ac0 sp=0xc000047780 pc=0x639963d9aac5
github.com/ollama/ollama/runner/llamarunner.(*Server).completion-fm({0x639964ca5dc8?, 0xc000514d20?}, 0xc0004e3b40?)
:1 +0x36 fp=0xc000047af0 sp=0xc000047ac0 pc=0x639963d9dad6
net/http.HandlerFunc.ServeHTTP(0xc0000ea240?, {0x639964ca5dc8?, 0xc000514d20?}, 0xc0004e3b60?)
net/http/server.go:2294 +0x29 fp=0xc000047b18 sp=0xc000047af0 pc=0x639963cf7e89
net/http.(*ServeMux).ServeHTTP(0x63996399f1c5?, {0x639964ca5dc8, 0xc000514d20}, 0xc000154280)
net/http/server.go:2822 +0x1c4 fp=0xc000047b68 sp=0xc000047b18 pc=0x639963cf9d84
net/http.serverHandler.ServeHTTP({0x639964ca2370?}, {0x639964ca5dc8?, 0xc000514d20?}, 0x1?)
net/http/server.go:3301 +0x8e fp=0xc000047b98 sp=0xc000047b68 pc=0x639963d1780e
net/http.(*conn).serve(0xc0004ba120, {0x639964ca7e28, 0xc00071c3f0})
net/http/server.go:2102 +0x625 fp=0xc000047fb8 sp=0xc000047b98 pc=0x639963cf6385
net/http.(*Server).Serve.gowrap3()
net/http/server.go:3454 +0x28 fp=0xc000047fe0 sp=0xc000047fb8 pc=0x639963cfbc48
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000047fe8 sp=0xc000047fe0 pc=0x639963a020c1
created by net/http.(*Server).Serve in goroutine 1
net/http/server.go:3454 +0x485
goroutine 39 gp=0xc000102a80 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0xb?)
runtime/proc.go:435 +0xce fp=0xc0002445d8 sp=0xc0002445b8 pc=0x6399639fa98e
runtime.netpollblock(0x639963a1de18?, 0x639942c6?, 0x99?)
runtime/netpoll.go:575 +0xf7 fp=0xc000244610 sp=0xc0002445d8 pc=0x6399639bf797
internal/poll.runtime_pollWait(0x70d9a23c7d98, 0x72)
runtime/netpoll.go:351 +0x85 fp=0xc000244630 sp=0xc000244610 pc=0x6399639f9ba5
internal/poll.(*pollDesc).wait(0xc00004eb80?, 0xc00071c521?, 0x0)
internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000244658 sp=0xc000244630 pc=0x639963a81027
internal/poll.(*pollDesc).waitRead(...)
internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc00004eb80, {0xc00071c521, 0x1, 0x1})
internal/poll/fd_unix.go:165 +0x27a fp=0xc0002446f0 sp=0xc000244658 pc=0x639963a8231a
net.(*netFD).Read(0xc00004eb80, {0xc00071c521?, 0xc00071a158?, 0xc000244770?})
net/fd_posix.go:55 +0x25 fp=0xc000244738 sp=0xc0002446f0 pc=0x639963af68c5
net.(*conn).Read(0xc00051e078, {0xc00071c521?, 0x0?, 0x0?})
net/net.go:194 +0x45 fp=0xc000244780 sp=0xc000244738 pc=0x639963b04c85
net/http.(*connReader).backgroundRead(0xc00071c510)
net/http/server.go:690 +0x37 fp=0xc0002447c8 sp=0xc000244780 pc=0x639963cf0257
net/http.(*connReader).startBackgroundRead.gowrap2()
net/http/server.go:686 +0x25 fp=0xc0002447e0 sp=0xc0002447c8 pc=0x639963cf0185
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0002447e8 sp=0xc0002447e0 pc=0x639963a020c1
created by net/http.(*connReader).startBackgroundRead in goroutine 50
net/http/server.go:686 +0xb6
rax 0x0
rbx 0x70d9e8eb9fc0
rcx 0x70d9e8efe00b
rdx 0x0
rdi 0x2
rsi 0x7ffec1e71dd0
rbp 0x70d978500000
rsp 0x7ffec1e71dd0
r8 0x0
r9 0x7ffec1e71dd0
r10 0x8
r11 0x246
r12 0x7ffec1e72050
r13 0x0
r14 0x1000
r15 0x0
rip 0x70d9e8efe00b
rflags 0x246
cs 0x33
fs 0x0
gs 0x0
SIGABRT: abort
PC=0x70d9e8efe00b m=0 sigcode=18446744073709551610
signal arrived during cgo execution
goroutine 16 gp=0xc000504a80 m=0 mp=0x6399654f96c0 [syscall]:
runtime.cgocall(0x639964647600, 0xc000093bc8)
runtime/cgocall.go:167 +0x4b fp=0xc000093ba0 sp=0xc000093b68 pc=0x6399639f76ab
github.com/ollama/ollama/llama._Cfunc_llama_decode(0x70d7cc836840, {0x4, 0x70d7cc857970, 0x0, 0x0, 0x70d7cc8610d0, 0x70d7cc859e30, 0x70d7cc78bbc0, 0x70d97c6252d0})
_cgo_gotypes.go:557 +0x4a fp=0xc000093bc8 sp=0xc000093ba0 pc=0x639963d7d46a
github.com/ollama/ollama/llama.(*Context).Decode.func1(...)
github.com/ollama/ollama/llama/llama.go:157
github.com/ollama/ollama/llama.(*Context).Decode(0xc00011c5d0?, 0x0?)
github.com/ollama/ollama/llama/llama.go:157 +0xf6 fp=0xc000093cc8 sp=0xc000093bc8 pc=0x639963d80076
github.com/ollama/ollama/runner/llamarunner.(*Server).processBatch(0xc0004ba000, 0xc0004264e0, 0xc00011c720)
github.com/ollama/ollama/runner/llamarunner/runner.go:435 +0x23e fp=0xc000093ee0 sp=0xc000093cc8 pc=0x639963d990be
github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc0004ba000, {0x639964ca7e60, 0xc0001280a0})
github.com/ollama/ollama/runner/llamarunner/runner.go:343 +0x1d5 fp=0xc000093fb8 sp=0xc000093ee0 pc=0x639963d98d15
github.com/ollama/ollama/runner/llamarunner.Execute.gowrap2()
github.com/ollama/ollama/runner/llamarunner/runner.go:973 +0x28 fp=0xc000093fe0 sp=0xc000093fb8 pc=0x639963d9d6a8
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000093fe8 sp=0xc000093fe0 pc=0x639963a020c1
created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1
github.com/ollama/ollama/runner/llamarunner/runner.go:973 +0xd97
goroutine 1 gp=0xc000002380 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc0005875b8 sp=0xc000587598 pc=0x6399639fa98e
runtime.netpollblock(0xc000587608?, 0x639942c6?, 0x99?)
runtime/netpoll.go:575 +0xf7 fp=0xc0005875f0 sp=0xc0005875b8 pc=0x6399639bf797
internal/poll.runtime_pollWait(0x70d9a23c7eb0, 0x72)
runtime/netpoll.go:351 +0x85 fp=0xc000587610 sp=0xc0005875f0 pc=0x6399639f9ba5
internal/poll.(*pollDesc).wait(0xc00004e100?, 0x900000036?, 0x0)
internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000587638 sp=0xc000587610 pc=0x639963a81027
internal/poll.(*pollDesc).waitRead(...)
internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc00004e100)
internal/poll/fd_unix.go:620 +0x295 fp=0xc0005876e0 sp=0xc000587638 pc=0x639963a863f5
net.(*netFD).accept(0xc00004e100)
net/fd_unix.go:172 +0x29 fp=0xc000587798 sp=0xc0005876e0 pc=0x639963af8869
net.(*TCPListener).accept(0xc00071a080)
net/tcpsock_posix.go:159 +0x1b fp=0xc0005877e8 sp=0xc000587798 pc=0x639963b0e21b
net.(*TCPListener).Accept(0xc00071a080)
net/tcpsock.go:380 +0x30 fp=0xc000587818 sp=0xc0005877e8 pc=0x639963b0d0d0
net/http.(*onceCloseListener).Accept(0xc0004ba120?)
:1 +0x24 fp=0xc000587830 sp=0xc000587818 pc=0x639963d23f84
net/http.(*Server).Serve(0xc000534200, {0x639964ca5be8, 0xc00071a080})
net/http/server.go:3424 +0x30c fp=0xc000587960 sp=0xc000587830 pc=0x639963cfb84c
github.com/ollama/ollama/runner/llamarunner.Execute({0xc000034120, 0xe, 0xe})
github.com/ollama/ollama/runner/llamarunner/runner.go:993 +0x116a fp=0xc000587d08 sp=0xc000587960 pc=0x639963d9d3ea
github.com/ollama/ollama/runner.Execute({0xc000034110?, 0x0?, 0x0?})
github.com/ollama/ollama/runner/runner.go:22 +0xd4 fp=0xc000587d30 sp=0xc000587d08 pc=0x639963fc6514
github.com/ollama/ollama/cmd.NewCLI.func2(0xc000035300?, {0x639964825055?, 0x4?, 0x639964825059?})
github.com/ollama/ollama/cmd/cmd.go:1280 +0x45 fp=0xc000587d58 sp=0xc000587d30 pc=0x6399645daae5
github.com/spf13/cobra.(*Command).execute(0xc00013ef08, {0xc0005149a0, 0xe, 0xe})
github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc000587e78 sp=0xc000587d58 pc=0x639963b71afc
github.com/spf13/cobra.(*Command).ExecuteC(0xc00054e908)
github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000587f30 sp=0xc000587e78 pc=0x639963b72345
github.com/spf13/cobra.(*Command).Execute(...)
github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
github.com/ollama/ollama/main.go:12 +0x4d fp=0xc000587f50 sp=0xc000587f30 pc=0x6399645dae4d
runtime.main()
runtime/proc.go:283 +0x29d fp=0xc000587fe0 sp=0xc000587f50 pc=0x6399639c6d9d
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000587fe8 sp=0xc000587fe0 pc=0x639963a020c1
goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000084fa8 sp=0xc000084f88 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.forcegchelper()
runtime/proc.go:348 +0xb8 fp=0xc000084fe0 sp=0xc000084fa8 pc=0x6399639c70d8
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000084fe8 sp=0xc000084fe0 pc=0x639963a020c1
created by runtime.init.7 in goroutine 1
runtime/proc.go:336 +0x1a
goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000085780 sp=0xc000085760 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.bgsweep(0xc00003e080)
runtime/mgcsweep.go:316 +0xdf fp=0xc0000857c8 sp=0xc000085780 pc=0x6399639b18ff
runtime.gcenable.gowrap1()
runtime/mgc.go:204 +0x25 fp=0xc0000857e0 sp=0xc0000857c8 pc=0x6399639a5ce5
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000857e8 sp=0xc0000857e0 pc=0x639963a020c1
created by runtime.gcenable in goroutine 1
runtime/mgc.go:204 +0x66
goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]:
runtime.gopark(0x10000?, 0x6399649d76b8?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000085f78 sp=0xc000085f58 pc=0x6399639fa98e
runtime.goparkunlock(...)
runtime/proc.go:441
runtime.(*scavengerState).park(0x6399654f68a0)
runtime/mgcscavenge.go:425 +0x49 fp=0xc000085fa8 sp=0xc000085f78 pc=0x6399639af349
runtime.bgscavenge(0xc00003e080)
runtime/mgcscavenge.go:658 +0x59 fp=0xc000085fc8 sp=0xc000085fa8 pc=0x6399639af8d9
runtime.gcenable.gowrap2()
runtime/mgc.go:205 +0x25 fp=0xc000085fe0 sp=0xc000085fc8 pc=0x6399639a5c85
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000085fe8 sp=0xc000085fe0 pc=0x639963a020c1
created by runtime.gcenable in goroutine 1
runtime/mgc.go:205 +0xa5
goroutine 5 gp=0xc000003dc0 m=nil [finalizer wait]:
runtime.gopark(0x1b8?, 0xc000002380?, 0x1?, 0x23?, 0xc000084688?)
runtime/proc.go:435 +0xce fp=0xc000084630 sp=0xc000084610 pc=0x6399639fa98e
runtime.runfinq()
runtime/mfinal.go:196 +0x107 fp=0xc0000847e0 sp=0xc000084630 pc=0x6399639a4ca7
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000847e8 sp=0xc0000847e0 pc=0x639963a020c1
created by runtime.createfing in goroutine 1
runtime/mfinal.go:166 +0x3d
goroutine 6 gp=0xc0001e08c0 m=nil [chan receive]:
runtime.gopark(0xc0000ff900?, 0xc000588018?, 0x60?, 0x67?, 0x639963adf5a8?)
runtime/proc.go:435 +0xce fp=0xc000086718 sp=0xc0000866f8 pc=0x6399639fa98e
runtime.chanrecv(0xc0000b6380, 0x0, 0x1)
runtime/chan.go:664 +0x445 fp=0xc000086790 sp=0xc000086718 pc=0x639963996ea5
runtime.chanrecv1(0x0?, 0x0?)
runtime/chan.go:506 +0x12 fp=0xc0000867b8 sp=0xc000086790 pc=0x639963996a32
runtime.unique_runtime_registerUniqueMapCleanup.func2(...)
runtime/mgc.go:1796
runtime.unique_runtime_registerUniqueMapCleanup.gowrap1()
runtime/mgc.go:1799 +0x2f fp=0xc0000867e0 sp=0xc0000867b8 pc=0x6399639a8e8f
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000867e8 sp=0xc0000867e0 pc=0x639963a020c1
created by unique.runtime_registerUniqueMapCleanup in goroutine 1
runtime/mgc.go:1794 +0x85
goroutine 7 gp=0xc0001e1180 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000086f38 sp=0xc000086f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000086fc8 sp=0xc000086f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000086fe0 sp=0xc000086fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000086fe8 sp=0xc000086fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 8 gp=0xc0001e1340 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000087738 sp=0xc000087718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000877c8 sp=0xc000087738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000877e0 sp=0xc0000877c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000877e8 sp=0xc0000877e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 18 gp=0xc000504000 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000080738 sp=0xc000080718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000807c8 sp=0xc000080738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000807e0 sp=0xc0000807c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000807e8 sp=0xc0000807e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 34 gp=0xc000102380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011a738 sp=0xc00011a718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011a7c8 sp=0xc00011a738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011a7e0 sp=0xc00011a7c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011a7e8 sp=0xc00011a7e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 9 gp=0xc0001e1500 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000087f38 sp=0xc000087f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000087fc8 sp=0xc000087f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000087fe0 sp=0xc000087fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000087fe8 sp=0xc000087fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 10 gp=0xc0001e16c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000116738 sp=0xc000116718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001167c8 sp=0xc000116738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001167e0 sp=0xc0001167c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001167e8 sp=0xc0001167e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 11 gp=0xc0001e1880 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000116f38 sp=0xc000116f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000116fc8 sp=0xc000116f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000116fe0 sp=0xc000116fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000116fe8 sp=0xc000116fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 19 gp=0xc0005041c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000080f38 sp=0xc000080f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000080fc8 sp=0xc000080f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000080fe0 sp=0xc000080fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000080fe8 sp=0xc000080fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 20 gp=0xc000504380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000081738 sp=0xc000081718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000817c8 sp=0xc000081738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000817e0 sp=0xc0000817c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000817e8 sp=0xc0000817e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 35 gp=0xc000102540 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011af38 sp=0xc00011af18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011afc8 sp=0xc00011af38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011afe0 sp=0xc00011afc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011afe8 sp=0xc00011afe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 12 gp=0xc0001e1a40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000117738 sp=0xc000117718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001177c8 sp=0xc000117738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001177e0 sp=0xc0001177c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001177e8 sp=0xc0001177e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 21 gp=0xc000504540 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049ba1f?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000081f38 sp=0xc000081f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000081fc8 sp=0xc000081f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000081fe0 sp=0xc000081fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000081fe8 sp=0xc000081fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 13 gp=0xc0001e1c00 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049ac80?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000117f38 sp=0xc000117f18 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc000117fc8 sp=0xc000117f38 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc000117fe0 sp=0xc000117fc8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000117fe8 sp=0xc000117fe0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 22 gp=0xc000504700 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049b309?, 0x3?, 0xe9?, 0x5?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000082738 sp=0xc000082718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0000827c8 sp=0xc000082738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0000827e0 sp=0xc0000827c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0000827e8 sp=0xc0000827e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 14 gp=0xc0001e1dc0 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049b3b3?, 0x3?, 0x29?, 0x9?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc000118738 sp=0xc000118718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc0001187c8 sp=0xc000118738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc0001187e0 sp=0xc0001187c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0001187e8 sp=0xc0001187e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 36 gp=0xc000102700 m=nil [GC worker (idle)]:
runtime.gopark(0x21a0c049bbce?, 0x0?, 0x0?, 0x0?, 0x0?)
runtime/proc.go:435 +0xce fp=0xc00011b738 sp=0xc00011b718 pc=0x6399639fa98e
runtime.gcBgMarkWorker(0xc0000b7b20)
runtime/mgc.go:1423 +0xe9 fp=0xc00011b7c8 sp=0xc00011b738 pc=0x6399639a81a9
runtime.gcBgMarkStartWorkers.gowrap1()
runtime/mgc.go:1339 +0x25 fp=0xc00011b7e0 sp=0xc00011b7c8 pc=0x6399639a8085
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc00011b7e8 sp=0xc00011b7e0 pc=0x639963a020c1
created by runtime.gcBgMarkStartWorkers in goroutine 1
runtime/mgc.go:1339 +0x105
goroutine 50 gp=0xc000504c40 m=nil [select]:
runtime.gopark(0xc000047a58?, 0x2?, 0x40?, 0x68?, 0xc000047834?)
runtime/proc.go:435 +0xce fp=0xc000047648 sp=0xc000047628 pc=0x6399639fa98e
runtime.selectgo(0xc000047a58, 0xc000047830, 0x4?, 0x0, 0x1?, 0x1)
runtime/select.go:351 +0x837 fp=0xc000047780 sp=0xc000047648 pc=0x6399639d9297
github.com/ollama/ollama/runner/llamarunner.(*Server).completion(0xc0004ba000, {0x639964ca5dc8, 0xc000514d20}, 0xc000154280)
github.com/ollama/ollama/runner/llamarunner/runner.go:688 +0xa25 fp=0xc000047ac0 sp=0xc000047780 pc=0x639963d9aac5
github.com/ollama/ollama/runner/llamarunner.(*Server).completion-fm({0x639964ca5dc8?, 0xc000514d20?}, 0xc0004e3b40?)
:1 +0x36 fp=0xc000047af0 sp=0xc000047ac0 pc=0x639963d9dad6
net/http.HandlerFunc.ServeHTTP(0xc0000ea240?, {0x639964ca5dc8?, 0xc000514d20?}, 0xc0004e3b60?)
net/http/server.go:2294 +0x29 fp=0xc000047b18 sp=0xc000047af0 pc=0x639963cf7e89
net/http.(*ServeMux).ServeHTTP(0x63996399f1c5?, {0x639964ca5dc8, 0xc000514d20}, 0xc000154280)
net/http/server.go:2822 +0x1c4 fp=0xc000047b68 sp=0xc000047b18 pc=0x639963cf9d84
net/http.serverHandler.ServeHTTP({0x639964ca2370?}, {0x639964ca5dc8?, 0xc000514d20?}, 0x1?)
net/http/server.go:3301 +0x8e fp=0xc000047b98 sp=0xc000047b68 pc=0x639963d1780e
net/http.(*conn).serve(0xc0004ba120, {0x639964ca7e28, 0xc00071c3f0})
net/http/server.go:2102 +0x625 fp=0xc000047fb8 sp=0xc000047b98 pc=0x639963cf6385
net/http.(*Server).Serve.gowrap3()
net/http/server.go:3454 +0x28 fp=0xc000047fe0 sp=0xc000047fb8 pc=0x639963cfbc48
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc000047fe8 sp=0xc000047fe0 pc=0x639963a020c1
created by net/http.(*Server).Serve in goroutine 1
net/http/server.go:3454 +0x485
goroutine 39 gp=0xc000102a80 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0xb?)
runtime/proc.go:435 +0xce fp=0xc0002445d8 sp=0xc0002445b8 pc=0x6399639fa98e
runtime.netpollblock(0x639963a1de18?, 0x639942c6?, 0x99?)
runtime/netpoll.go:575 +0xf7 fp=0xc000244610 sp=0xc0002445d8 pc=0x6399639bf797
internal/poll.runtime_pollWait(0x70d9a23c7d98, 0x72)
runtime/netpoll.go:351 +0x85 fp=0xc000244630 sp=0xc000244610 pc=0x6399639f9ba5
internal/poll.(*pollDesc).wait(0xc00004eb80?, 0xc00071c521?, 0x0)
internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000244658 sp=0xc000244630 pc=0x639963a81027
internal/poll.(*pollDesc).waitRead(...)
internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc00004eb80, {0xc00071c521, 0x1, 0x1})
internal/poll/fd_unix.go:165 +0x27a fp=0xc0002446f0 sp=0xc000244658 pc=0x639963a8231a
net.(*netFD).Read(0xc00004eb80, {0xc00071c521?, 0xc00071a158?, 0xc000244770?})
net/fd_posix.go:55 +0x25 fp=0xc000244738 sp=0xc0002446f0 pc=0x639963af68c5
net.(*conn).Read(0xc00051e078, {0xc00071c521?, 0x0?, 0x0?})
net/net.go:194 +0x45 fp=0xc000244780 sp=0xc000244738 pc=0x639963b04c85
net/http.(*connReader).backgroundRead(0xc00071c510)
net/http/server.go:690 +0x37 fp=0xc0002447c8 sp=0xc000244780 pc=0x639963cf0257
net/http.(*connReader).startBackgroundRead.gowrap2()
net/http/server.go:686 +0x25 fp=0xc0002447e0 sp=0xc0002447c8 pc=0x639963cf0185
runtime.goexit({})
runtime/asm_amd64.s:1700 +0x1 fp=0xc0002447e8 sp=0xc0002447e0 pc=0x639963a020c1
created by net/http.(*connReader).startBackgroundRead in goroutine 50
net/http/server.go:686 +0xb6
rax 0x0
rbx 0x70d9e8eb9fc0
rcx 0x70d9e8efe00b
rdx 0x0
rdi 0x2
rsi 0x7ffec1e72040
rbp 0x70d94eeb6303
rsp 0x7ffec1e72040
r8 0x0
r9 0x7ffec1e72040
r10 0x8
r11 0x246
r12 0x70d94eebee13
r13 0x1c1
r14 0x63996e7b6fd0
r15 0x63996e7b6fc0
rip 0x70d9e8efe00b
rflags 0x246
cs 0x33
fs 0x0
gs 0x0
[GIN] 2025/03/01 - 07:10:04 | 200 | 117.845349ms | 127.0.0.1 | POST "/api/chat"
^C% ➜ ~
This is on another terminal where I am running the model:
➜ ~ docker exec -it ollama ollama run phi4-mini
hi
Error: POST predict: Post "http://127.0.0.1:44845/completion": EOF
➜ ~
As you can see it exits from the prompt with the log dump as above. As phi4-mini has a requirement of 0.5.13 I could never run phi4-mini on 0.5.12, but, however with the same commands/step I was able to deepseek-r1:latest distilled models on 0.5.12.
OS: Docker container running on Linuxmint with rocm support
CPU/iGPU: 7735HS hence passing HSA_OVERRIDE_GFX_VERSION=10.3.0 and device driver location in docker run command
PS: If the OP issue and the issue I am facing are different (in my case its docker and able to load the model into GPU, can see with ollama ps command, however, added here as both are ROCm issue), I will create a new issue. Please let me know.
@githubdebugger commented on GitHub (Mar 2, 2025):
FYI: Upgraded to latest 0.5.13-rc4-rocm and I do not see this anymore in 0.5.13-rc4-rocm, the model loads fine in iGPU and everything works fine.
@ProjectMoon commented on GitHub (Mar 3, 2025):
OK, latest updates from 0.5.13 RC5:
Edit: With Granite, it also just seems to be stuck on "Image added!" with the little spinny progress indicator thing. I dunno if I should wait longer, but I let it spin for about 10 mins before killing the ollama runner.
Here are some debug logs. When the image is added, it just gets stuck at this point:
@ProjectMoon commented on GitHub (Mar 4, 2025):
This still seems to be an issue with the final version of 0.5.13.
Edit: And I'm not sure it has anything to do with ROCm. As the same seems to be happening on my machine with a small nvidia GPU. Thing just gets locked up; I would expect at least SOME response after a few minutes, even on CPU mode.
@ProjectMoon commented on GitHub (Mar 5, 2025):
Closing this in favor of #9514 because I'm pretty sure that's the problem (I have flash attention and q8_0 cache).