[GH-ISSUE #13803] AMD GPU 500 EOF Error #71101

Open
opened 2026-05-05 00:18:32 -05:00 by GiteaMirror · 20 comments
Owner

Originally created by @yogeshbabu on GitHub (Jan 20, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/13803

What is the issue?

I have started seeing this error after latest ubuntu updates. Tried uninstalling and reinstalling of amdgpu as well as ollama, still no help. Tried with rocm 6.4 as well as latest 7.1.x

ollama run gpt-oss
Error: 500 Internal Server Error: do load request: Post "http://127.0.0.1:40065/load": EOF

Relevant log output

Error: 500 Internal Server Error: do load request: Post "http://127.0.0.1:40065/load": EOF

OS

Ubuntu 24.04
Kernal : 6.14.0-1019-oem

GPU

AMD Strix Halo AI 395+
Rocm : 7.1.x

CPU

AMD Strix Halo AI 395+ 128GB (Framework)

Ollama version

0.14.x

Originally created by @yogeshbabu on GitHub (Jan 20, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/13803 ### What is the issue? I have started seeing this error after latest ubuntu updates. Tried uninstalling and reinstalling of amdgpu as well as ollama, still no help. Tried with rocm 6.4 as well as latest 7.1.x ``` ollama run gpt-oss Error: 500 Internal Server Error: do load request: Post "http://127.0.0.1:40065/load": EOF ``` ### Relevant log output ```shell Error: 500 Internal Server Error: do load request: Post "http://127.0.0.1:40065/load": EOF ``` ### OS Ubuntu 24.04 Kernal : 6.14.0-1019-oem ### GPU AMD Strix Halo AI 395+ Rocm : 7.1.x ### CPU AMD Strix Halo AI 395+ 128GB (Framework) ### Ollama version 0.14.x
GiteaMirror added the bug label 2026-05-05 00:18:32 -05:00
Author
Owner

@rick-github commented on GitHub (Jan 20, 2026):

Server log may aid in debugging.

<!-- gh-comment-id:3775228791 --> @rick-github commented on GitHub (Jan 20, 2026): [Server log](https://docs.ollama.com/troubleshooting) may aid in debugging.
Author
Owner

@rickytato commented on GitHub (Jan 21, 2026):

I think I've same problem, same architecture, ollama:rocm in docker:

ollama  | SIGSEGV: segmentation violation
ollama  | PC=0x7f8400ac0bf5 m=0 sigcode=1 addr=0x18
ollama  | signal arrived during cgo execution
ollama  |
ollama  | goroutine 57 gp=0xc000455500 m=0 mp=0x5599229453c0 [syscall]:
ollama  | runtime.cgocall(0x5599217b4c30, 0xc0000bab58)
ollama  |       runtime/cgocall.go:167 +0x4b fp=0xc0000bab30 sp=0xc0000baaf8 pc=0x559920a1494b
ollama  | github.com/ollama/ollama/llama._Cfunc_llama_model_load_from_file(0x55992ae36e70, {0xc000398af0, 0x0, 0x23, 0x1, 0x0, 0xc0003987a8, 0x5599217b4420, 0xc000398ae8, 0x0, ...})
ollama  |       _cgo_gotypes.go:910 +0x4b fp=0xc0000bab58 sp=0xc0000bab30 pc=0x559920e15a4b
ollama  | github.com/ollama/ollama/llama.LoadModelFromFile.func1(...)
ollama  |       github.com/ollama/ollama/llama/llama.go:308
ollama  | github.com/ollama/ollama/llama.LoadModelFromFile({0x7fff73e03de1, 0x62}, {{0xc0003987e0, 0x1, 0x1}, 0x23, 0x0, 0x1, {0xc0003987a8, 0x1, ...}, ...})
ollama  |       github.com/ollama/ollama/llama/llama.go:308 +0x56d fp=0xc0000bada0 sp=0xc0000bab58 pc=0x559920e18dcd
ollama  | github.com/ollama/ollama/runner/llamarunner.(*Server).loadModel(0xc00063cc80, {{0xc0003987e0, 0x1, 0x1}, 0x23, 0x0, 0x1, {0xc0003987a8, 0x1, 0x2}, ...}, ...)
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:841 +0x9e fp=0xc0000baee8 sp=0xc0000bada0 pc=0x559920ed2d5e
ollama  | github.com/ollama/ollama/runner/llamarunner.(*Server).load.gowrap2()
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x114 fp=0xc0000bafe0 sp=0xc0000baee8 pc=0x559920ed4054
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000bafe8 sp=0xc0000bafe0 pc=0x559920a1fc61
ollama  | created by github.com/ollama/ollama/runner/llamarunner.(*Server).load in goroutine 54
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x889
ollama  |
ollama  | goroutine 1 gp=0xc000002380 m=nil [IO wait]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000587790 sp=0xc000587770 pc=0x559920a17dce
ollama  | runtime.netpollblock(0xc0005877e0?, 0x209b1506?, 0x99?)
ollama  |       runtime/netpoll.go:575 +0xf7 fp=0xc0005877c8 sp=0xc000587790 pc=0x5599209dd0f7
ollama  | internal/poll.runtime_pollWait(0x7f8419253de0, 0x72)
ollama  |       runtime/netpoll.go:351 +0x85 fp=0xc0005877e8 sp=0xc0005877c8 pc=0x559920a16fe5
ollama  | internal/poll.(*pollDesc).wait(0xc000624d00?, 0x900000036?, 0x0)
ollama  |       internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000587810 sp=0xc0005877e8 pc=0x559920a9f167
ollama  | internal/poll.(*pollDesc).waitRead(...)
ollama  |       internal/poll/fd_poll_runtime.go:89
ollama  | internal/poll.(*FD).Accept(0xc000624d00)
ollama  |       internal/poll/fd_unix.go:620 +0x295 fp=0xc0005878b8 sp=0xc000587810 pc=0x559920aa4535
ollama  | net.(*netFD).accept(0xc000624d00)
ollama  |       net/fd_unix.go:172 +0x29 fp=0xc000587970 sp=0xc0005878b8 pc=0x559920b17549
ollama  | net.(*TCPListener).accept(0xc000695b40)
ollama  |       net/tcpsock_posix.go:159 +0x1b fp=0xc0005879c0 sp=0xc000587970 pc=0x559920b2d45b
ollama  | net.(*TCPListener).Accept(0xc000695b40)
ollama  |       net/tcpsock.go:380 +0x30 fp=0xc0005879f0 sp=0xc0005879c0 pc=0x559920b2c310
ollama  | net/http.(*onceCloseListener).Accept(0xc00063a480?)
ollama  |       <autogenerated>:1 +0x24 fp=0xc000587a08 sp=0xc0005879f0 pc=0x559920d43ea4
ollama  | net/http.(*Server).Serve(0xc000051300, {0x5599220398c0, 0xc000695b40})
ollama  |       net/http/server.go:3424 +0x30c fp=0xc000587b38 sp=0xc000587a08 pc=0x559920d1b76c
ollama  | github.com/ollama/ollama/runner/llamarunner.Execute({0xc000130140, 0x4, 0x4})
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:1002 +0x8f5 fp=0xc000587d08 sp=0xc000587b38 pc=0x559920ed4a15
ollama  | github.com/ollama/ollama/runner.Execute({0xc000130130?, 0x0?, 0x0?})
ollama  |       github.com/ollama/ollama/runner/runner.go:30 +0x13b fp=0xc000587d30 sp=0xc000587d08 pc=0x559920f8057b
ollama  | github.com/ollama/ollama/cmd.NewCLI.func2(0xc000051000?, {0x559921afd0d0?, 0x4?, 0x559921afd0d4?})
ollama  |       github.com/ollama/ollama/cmd/cmd.go:1881 +0x45 fp=0xc000587d58 sp=0xc000587d30 pc=0x559921744c65
ollama  | github.com/spf13/cobra.(*Command).execute(0xc00063f508, {0xc000695940, 0x4, 0x4})
ollama  |       github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc000587e78 sp=0xc000587d58 pc=0x559920b914bc
ollama  | github.com/spf13/cobra.(*Command).ExecuteC(0xc000610908)
ollama  |       github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000587f30 sp=0xc000587e78 pc=0x559920b91d05
ollama  | github.com/spf13/cobra.(*Command).Execute(...)
ollama  |       github.com/spf13/cobra@v1.7.0/command.go:992
ollama  | github.com/spf13/cobra.(*Command).ExecuteContext(...)
ollama  |       github.com/spf13/cobra@v1.7.0/command.go:985
ollama  | main.main()
ollama  |       github.com/ollama/ollama/main.go:12 +0x4d fp=0xc000587f50 sp=0xc000587f30 pc=0x55992174574d
ollama  | runtime.main()
ollama  |       runtime/proc.go:283 +0x29d fp=0xc000587fe0 sp=0xc000587f50 pc=0x5599209e477d
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc000587fe8 sp=0xc000587fe0 pc=0x559920a1fc61
ollama  |
ollama  | goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000aafa8 sp=0xc0000aaf88 pc=0x559920a17dce
ollama  | runtime.goparkunlock(...)
ollama  |       runtime/proc.go:441
ollama  | runtime.forcegchelper()
ollama  |       runtime/proc.go:348 +0xb8 fp=0xc0000aafe0 sp=0xc0000aafa8 pc=0x5599209e4ab8
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aafe8 sp=0xc0000aafe0 pc=0x559920a1fc61
ollama  | created by runtime.init.7 in goroutine 1
ollama  |       runtime/proc.go:336 +0x1a
ollama  |
ollama  | goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]:
ollama  | runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000ab780 sp=0xc0000ab760 pc=0x559920a17dce
ollama  | runtime.goparkunlock(...)
ollama  |       runtime/proc.go:441
ollama  | runtime.bgsweep(0xc0000d6000)
ollama  |       runtime/mgcsweep.go:316 +0xdf fp=0xc0000ab7c8 sp=0xc0000ab780 pc=0x5599209cf25f
ollama  | runtime.gcenable.gowrap1()
ollama  |       runtime/mgc.go:204 +0x25 fp=0xc0000ab7e0 sp=0xc0000ab7c8 pc=0x5599209c3645
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ab7e8 sp=0xc0000ab7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcenable in goroutine 1
ollama  |       runtime/mgc.go:204 +0x66
ollama  |
ollama  | goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]:
ollama  | runtime.gopark(0x10000?, 0x559921cdb140?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000abf78 sp=0xc0000abf58 pc=0x559920a17dce
ollama  | runtime.goparkunlock(...)
ollama  |       runtime/proc.go:441
ollama  | runtime.(*scavengerState).park(0x5599229425a0)
ollama  |       runtime/mgcscavenge.go:425 +0x49 fp=0xc0000abfa8 sp=0xc0000abf78 pc=0x5599209ccca9
ollama  | runtime.bgscavenge(0xc0000d6000)
ollama  |       runtime/mgcscavenge.go:658 +0x59 fp=0xc0000abfc8 sp=0xc0000abfa8 pc=0x5599209cd239
ollama  | runtime.gcenable.gowrap2()
ollama  |       runtime/mgc.go:205 +0x25 fp=0xc0000abfe0 sp=0xc0000abfc8 pc=0x5599209c35e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000abfe8 sp=0xc0000abfe0 pc=0x559920a1fc61
ollama  | created by runtime.gcenable in goroutine 1
ollama  |       runtime/mgc.go:205 +0xa5
ollama  |
ollama  | goroutine 18 gp=0xc000102700 m=nil [finalizer wait]:
ollama  | runtime.gopark(0x1b8?, 0xc000002380?, 0x1?, 0x23?, 0xc0000aa688?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000aa630 sp=0xc0000aa610 pc=0x559920a17dce
ollama  | runtime.runfinq()
ollama  |       runtime/mfinal.go:196 +0x107 fp=0xc0000aa7e0 sp=0xc0000aa630 pc=0x5599209c2607
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aa7e8 sp=0xc0000aa7e0 pc=0x559920a1fc61
ollama  | created by runtime.createfing in goroutine 1
ollama  |       runtime/mfinal.go:166 +0x3d
ollama  |
ollama  | goroutine 19 gp=0xc000103180 m=nil [chan receive]:
ollama  | runtime.gopark(0xc00022d540?, 0xc000588018?, 0x60?, 0x67?, 0x559920afe188?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a6718 sp=0xc0000a66f8 pc=0x559920a17dce
ollama  | runtime.chanrecv(0xc000110310, 0x0, 0x1)
ollama  |       runtime/chan.go:664 +0x445 fp=0xc0000a6790 sp=0xc0000a6718 pc=0x5599209b40e5
ollama  | runtime.chanrecv1(0x0?, 0x0?)
ollama  |       runtime/chan.go:506 +0x12 fp=0xc0000a67b8 sp=0xc0000a6790 pc=0x5599209b3c72
ollama  | runtime.unique_runtime_registerUniqueMapCleanup.func2(...)
ollama  |       runtime/mgc.go:1796
ollama  | runtime.unique_runtime_registerUniqueMapCleanup.gowrap1()
ollama  |       runtime/mgc.go:1799 +0x2f fp=0xc0000a67e0 sp=0xc0000a67b8 pc=0x5599209c67ef
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a67e8 sp=0xc0000a67e0 pc=0x559920a1fc61
ollama  | created by unique.runtime_registerUniqueMapCleanup in goroutine 1
ollama  |       runtime/mgc.go:1794 +0x85
ollama  |
ollama  | goroutine 20 gp=0xc000103500 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a6f38 sp=0xc0000a6f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a6fc8 sp=0xc0000a6f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a6fe0 sp=0xc0000a6fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a6fe8 sp=0xc0000a6fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 21 gp=0xc0001036c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a7738 sp=0xc0000a7718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a77c8 sp=0xc0000a7738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a77e0 sp=0xc0000a77c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a77e8 sp=0xc0000a77e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 22 gp=0xc000103880 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a7f38 sp=0xc0000a7f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a7fc8 sp=0xc0000a7f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a7fe0 sp=0xc0000a7fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a7fe8 sp=0xc0000a7fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 5 gp=0xc000003dc0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000ac738 sp=0xc0000ac718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000ac7c8 sp=0xc0000ac738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000ac7e0 sp=0xc0000ac7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ac7e8 sp=0xc0000ac7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 6 gp=0xc0000ec000 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000acf38 sp=0xc0000acf18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000acfc8 sp=0xc0000acf38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000acfe0 sp=0xc0000acfc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000acfe8 sp=0xc0000acfe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 7 gp=0xc0000ec1c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000ad738 sp=0xc0000ad718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000ad7c8 sp=0xc0000ad738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000ad7e0 sp=0xc0000ad7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ad7e8 sp=0xc0000ad7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 8 gp=0xc0000ec380 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000adf38 sp=0xc0000adf18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000adfc8 sp=0xc0000adf38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000adfe0 sp=0xc0000adfc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000adfe8 sp=0xc0000adfe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 9 gp=0xc0000ec540 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f2738 sp=0xc0000f2718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f27c8 sp=0xc0000f2738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f27e0 sp=0xc0000f27c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f27e8 sp=0xc0000f27e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 10 gp=0xc0000ec700 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f2f38 sp=0xc0000f2f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f2fc8 sp=0xc0000f2f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f2fe0 sp=0xc0000f2fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f2fe8 sp=0xc0000f2fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 11 gp=0xc0000ec8c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f3738 sp=0xc0000f3718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f37c8 sp=0xc0000f3738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f37e0 sp=0xc0000f37c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f37e8 sp=0xc0000f37e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 12 gp=0xc0000eca80 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f3f38 sp=0xc0000f3f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f3fc8 sp=0xc0000f3f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f3fe0 sp=0xc0000f3fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f3fe8 sp=0xc0000f3fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 23 gp=0xc000103a40 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a8738 sp=0xc0000a8718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a87c8 sp=0xc0000a8738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a87e0 sp=0xc0000a87c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a87e8 sp=0xc0000a87e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 24 gp=0xc000103c00 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a8f38 sp=0xc0000a8f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a8fc8 sp=0xc0000a8f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a8fe0 sp=0xc0000a8fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a8fe8 sp=0xc0000a8fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 25 gp=0xc000103dc0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a9738 sp=0xc0000a9718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a97c8 sp=0xc0000a9738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a97e0 sp=0xc0000a97c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a97e8 sp=0xc0000a97e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 26 gp=0xc000454000 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000a9f38 sp=0xc0000a9f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000a9fc8 sp=0xc0000a9f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000a9fe0 sp=0xc0000a9fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a9fe8 sp=0xc0000a9fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 27 gp=0xc0004541c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000ee738 sp=0xc0000ee718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000ee7c8 sp=0xc0000ee738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000ee7e0 sp=0xc0000ee7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ee7e8 sp=0xc0000ee7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 28 gp=0xc000454380 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000eef38 sp=0xc0000eef18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000eefc8 sp=0xc0000eef38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000eefe0 sp=0xc0000eefc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000eefe8 sp=0xc0000eefe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 34 gp=0xc000484000 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048a738 sp=0xc00048a718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048a7c8 sp=0xc00048a738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048a7e0 sp=0xc00048a7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048a7e8 sp=0xc00048a7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 13 gp=0xc0000ecc40 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f4738 sp=0xc0000f4718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f47c8 sp=0xc0000f4738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f47e0 sp=0xc0000f47c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f47e8 sp=0xc0000f47e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 14 gp=0xc0000ece00 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f4f38 sp=0xc0000f4f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f4fc8 sp=0xc0000f4f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f4fe0 sp=0xc0000f4fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f4fe8 sp=0xc0000f4fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 15 gp=0xc0000ecfc0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f5738 sp=0xc0000f5718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f57c8 sp=0xc0000f5738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f57e0 sp=0xc0000f57c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f57e8 sp=0xc0000f57e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 16 gp=0xc0000ed180 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000f5f38 sp=0xc0000f5f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000f5fc8 sp=0xc0000f5f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000f5fe0 sp=0xc0000f5fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f5fe8 sp=0xc0000f5fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 50 gp=0xc0000ed340 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000486738 sp=0xc000486718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0004867c8 sp=0xc000486738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0004867e0 sp=0xc0004867c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0004867e8 sp=0xc0004867e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 51 gp=0xc0000ed500 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000486f38 sp=0xc000486f18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc000486fc8 sp=0xc000486f38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc000486fe0 sp=0xc000486fc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc000486fe8 sp=0xc000486fe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 52 gp=0xc0000ed6c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000487738 sp=0xc000487718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0004877c8 sp=0xc000487738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0004877e0 sp=0xc0004877c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0004877e8 sp=0xc0004877e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 29 gp=0xc000454540 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc0000ef738 sp=0xc0000ef718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc0000ef7c8 sp=0xc0000ef738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc0000ef7e0 sp=0xc0000ef7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ef7e8 sp=0xc0000ef7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 35 gp=0xc0004841c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048af38 sp=0xc00048af18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048afc8 sp=0xc00048af38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048afe0 sp=0xc00048afc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048afe8 sp=0xc00048afe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 36 gp=0xc000484380 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x559922a10a80?, 0x1?, 0x3c?, 0x0?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048b738 sp=0xc00048b718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048b7c8 sp=0xc00048b738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048b7e0 sp=0xc00048b7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048b7e8 sp=0xc00048b7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 37 gp=0xc000484540 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x363a0ac139?, 0x1?, 0x77?, 0x83?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048bf38 sp=0xc00048bf18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048bfc8 sp=0xc00048bf38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048bfe0 sp=0xc00048bfc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048bfe8 sp=0xc00048bfe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 38 gp=0xc000484700 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x363a0ad71a?, 0x1?, 0x37?, 0x44?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048c738 sp=0xc00048c718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048c7c8 sp=0xc00048c738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048c7e0 sp=0xc00048c7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048c7e8 sp=0xc00048c7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 39 gp=0xc0004848c0 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x363a0adae6?, 0x1?, 0xa4?, 0x29?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048cf38 sp=0xc00048cf18 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048cfc8 sp=0xc00048cf38 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048cfe0 sp=0xc00048cfc8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048cfe8 sp=0xc00048cfe0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 40 gp=0xc000484a80 m=nil [GC worker (idle)]:
ollama  | runtime.gopark(0x559922a10a80?, 0x1?, 0x3c?, 0x76?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc00048d738 sp=0xc00048d718 pc=0x559920a17dce
ollama  | runtime.gcBgMarkWorker(0xc000111570)
ollama  |       runtime/mgc.go:1423 +0xe9 fp=0xc00048d7c8 sp=0xc00048d738 pc=0x5599209c5b09
ollama  | runtime.gcBgMarkStartWorkers.gowrap1()
ollama  |       runtime/mgc.go:1339 +0x25 fp=0xc00048d7e0 sp=0xc00048d7c8 pc=0x5599209c59e5
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc00048d7e8 sp=0xc00048d7e0 pc=0x559920a1fc61
ollama  | created by runtime.gcBgMarkStartWorkers in goroutine 1
ollama  |       runtime/mgc.go:1339 +0x105
ollama  |
ollama  | goroutine 53 gp=0xc000454fc0 m=nil [sync.WaitGroup.Wait]:
ollama  | runtime.gopark(0x0?, 0x0?, 0x0?, 0x46?, 0x0?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000489620 sp=0xc000489600 pc=0x559920a17dce
ollama  | runtime.goparkunlock(...)
ollama  |       runtime/proc.go:441
ollama  | runtime.semacquire1(0xc00063cca0, 0x0, 0x1, 0x0, 0x18)
ollama  |       runtime/sema.go:188 +0x229 fp=0xc000489688 sp=0xc000489620 pc=0x5599209f7d49
ollama  | sync.runtime_SemacquireWaitGroup(0x0?)
ollama  |       runtime/sema.go:110 +0x25 fp=0xc0004896c0 sp=0xc000489688 pc=0x559920a19705
ollama  | sync.(*WaitGroup).Wait(0x0?)
ollama  |       sync/waitgroup.go:118 +0x48 fp=0xc0004896e8 sp=0xc0004896c0 pc=0x559920a2b6c8
ollama  | github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc00063cc80, {0x55992203bee0, 0xc00069b900})
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:360 +0x4b fp=0xc0004897b8 sp=0xc0004896e8 pc=0x559920ecf72b
ollama  | github.com/ollama/ollama/runner/llamarunner.Execute.gowrap1()
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x28 fp=0xc0004897e0 sp=0xc0004897b8 pc=0x559920ed4c88
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc0004897e8 sp=0xc0004897e0 pc=0x559920a1fc61
ollama  | created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1
ollama  |       github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x4c5
ollama  |
ollama  | goroutine 54 gp=0xc000455180 m=nil [IO wait]:
ollama  | runtime.gopark(0x559920aa2765?, 0xc000624d80?, 0x40?, 0x9a?, 0xb?)
ollama  |       runtime/proc.go:435 +0xce fp=0xc000049948 sp=0xc000049928 pc=0x559920a17dce
ollama  | runtime.netpollblock(0x559920a3b598?, 0x209b1506?, 0x99?)
ollama  |       runtime/netpoll.go:575 +0xf7 fp=0xc000049980 sp=0xc000049948 pc=0x5599209dd0f7
ollama  | internal/poll.runtime_pollWait(0x7f8419253cc8, 0x72)
ollama  |       runtime/netpoll.go:351 +0x85 fp=0xc0000499a0 sp=0xc000049980 pc=0x559920a16fe5
ollama  | internal/poll.(*pollDesc).wait(0xc000624d80?, 0xc000666000?, 0x0)
ollama  |       internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0000499c8 sp=0xc0000499a0 pc=0x559920a9f167
ollama  | internal/poll.(*pollDesc).waitRead(...)
ollama  |       internal/poll/fd_poll_runtime.go:89
ollama  | internal/poll.(*FD).Read(0xc000624d80, {0xc000666000, 0x1000, 0x1000})
ollama  |       internal/poll/fd_unix.go:165 +0x27a fp=0xc000049a60 sp=0xc0000499c8 pc=0x559920aa045a
ollama  | net.(*netFD).Read(0xc000624d80, {0xc000666000?, 0xc000049ad0?, 0x559920a9f625?})
ollama  |       net/fd_posix.go:55 +0x25 fp=0xc000049aa8 sp=0xc000049a60 pc=0x559920b155a5
ollama  | net.(*conn).Read(0xc00011c970, {0xc000666000?, 0x0?, 0x0?})
ollama  |       net/net.go:194 +0x45 fp=0xc000049af0 sp=0xc000049aa8 pc=0x559920b23965
ollama  | net/http.(*connReader).Read(0xc000223a70, {0xc000666000, 0x1000, 0x1000})
ollama  |       net/http/server.go:798 +0x159 fp=0xc000049b40 sp=0xc000049af0 pc=0x559920d10619
ollama  | bufio.(*Reader).fill(0xc0000348a0)
ollama  |       bufio/bufio.go:113 +0x103 fp=0xc000049b78 sp=0xc000049b40 pc=0x559920b3ba23
ollama  | bufio.(*Reader).Peek(0xc0000348a0, 0x4)
ollama  |       bufio/bufio.go:152 +0x53 fp=0xc000049b98 sp=0xc000049b78 pc=0x559920b3bb53
ollama  | net/http.(*conn).serve(0xc00063a480, {0x55992203bea8, 0xc0002235f0})
ollama  |       net/http/server.go:2137 +0x785 fp=0xc000049fb8 sp=0xc000049b98 pc=0x559920d16405
ollama  | net/http.(*Server).Serve.gowrap3()
ollama  |       net/http/server.go:3454 +0x28 fp=0xc000049fe0 sp=0xc000049fb8 pc=0x559920d1bb68
ollama  | runtime.goexit({})
ollama  |       runtime/asm_amd64.s:1700 +0x1 fp=0xc000049fe8 sp=0xc000049fe0 pc=0x559920a1fc61
ollama  | created by net/http.(*Server).Serve in goroutine 1
ollama  |       net/http/server.go:3454 +0x485
ollama  |
ollama  | rax    0x7f8461d62698
ollama  | rbx    0x559932c29a90
ollama  | rcx    0x0
ollama  | rdx    0x55992a6d7100
ollama  | rdi    0x7f8400e58e48
ollama  | rsi    0x3
ollama  | rbp    0x7f8461d62698
ollama  | rsp    0x7fff73e013b0
ollama  | r8     0x0
ollama  | r9     0x0
ollama  | r10    0x559932c2b670
ollama  | r11    0x290
ollama  | r12    0x0
ollama  | r13    0x0
ollama  | r14    0x18
ollama  | r15    0x1
ollama  | rip    0x7f8400ac0bf5
ollama  | rflags 0x10246
ollama  | cs     0x33
ollama  | fs     0x0
ollama  | gs     0x0
ollama  | time=2026-01-21T09:17:10.784Z level=INFO source=sched.go:479 msg="Load failed" model=/root/.ollama/models/blobs/sha256-7fce9a253a25197d72ccc5898fe0789ac7fc7e17cbe7a62560b304163a894a92 error="llama runner process has terminated: exit status 2"
<!-- gh-comment-id:3777113385 --> @rickytato commented on GitHub (Jan 21, 2026): I think I've same problem, same architecture, `ollama:rocm` in docker: ``` ollama | SIGSEGV: segmentation violation ollama | PC=0x7f8400ac0bf5 m=0 sigcode=1 addr=0x18 ollama | signal arrived during cgo execution ollama | ollama | goroutine 57 gp=0xc000455500 m=0 mp=0x5599229453c0 [syscall]: ollama | runtime.cgocall(0x5599217b4c30, 0xc0000bab58) ollama | runtime/cgocall.go:167 +0x4b fp=0xc0000bab30 sp=0xc0000baaf8 pc=0x559920a1494b ollama | github.com/ollama/ollama/llama._Cfunc_llama_model_load_from_file(0x55992ae36e70, {0xc000398af0, 0x0, 0x23, 0x1, 0x0, 0xc0003987a8, 0x5599217b4420, 0xc000398ae8, 0x0, ...}) ollama | _cgo_gotypes.go:910 +0x4b fp=0xc0000bab58 sp=0xc0000bab30 pc=0x559920e15a4b ollama | github.com/ollama/ollama/llama.LoadModelFromFile.func1(...) ollama | github.com/ollama/ollama/llama/llama.go:308 ollama | github.com/ollama/ollama/llama.LoadModelFromFile({0x7fff73e03de1, 0x62}, {{0xc0003987e0, 0x1, 0x1}, 0x23, 0x0, 0x1, {0xc0003987a8, 0x1, ...}, ...}) ollama | github.com/ollama/ollama/llama/llama.go:308 +0x56d fp=0xc0000bada0 sp=0xc0000bab58 pc=0x559920e18dcd ollama | github.com/ollama/ollama/runner/llamarunner.(*Server).loadModel(0xc00063cc80, {{0xc0003987e0, 0x1, 0x1}, 0x23, 0x0, 0x1, {0xc0003987a8, 0x1, 0x2}, ...}, ...) ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:841 +0x9e fp=0xc0000baee8 sp=0xc0000bada0 pc=0x559920ed2d5e ollama | github.com/ollama/ollama/runner/llamarunner.(*Server).load.gowrap2() ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x114 fp=0xc0000bafe0 sp=0xc0000baee8 pc=0x559920ed4054 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000bafe8 sp=0xc0000bafe0 pc=0x559920a1fc61 ollama | created by github.com/ollama/ollama/runner/llamarunner.(*Server).load in goroutine 54 ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x889 ollama | ollama | goroutine 1 gp=0xc000002380 m=nil [IO wait]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc000587790 sp=0xc000587770 pc=0x559920a17dce ollama | runtime.netpollblock(0xc0005877e0?, 0x209b1506?, 0x99?) ollama | runtime/netpoll.go:575 +0xf7 fp=0xc0005877c8 sp=0xc000587790 pc=0x5599209dd0f7 ollama | internal/poll.runtime_pollWait(0x7f8419253de0, 0x72) ollama | runtime/netpoll.go:351 +0x85 fp=0xc0005877e8 sp=0xc0005877c8 pc=0x559920a16fe5 ollama | internal/poll.(*pollDesc).wait(0xc000624d00?, 0x900000036?, 0x0) ollama | internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000587810 sp=0xc0005877e8 pc=0x559920a9f167 ollama | internal/poll.(*pollDesc).waitRead(...) ollama | internal/poll/fd_poll_runtime.go:89 ollama | internal/poll.(*FD).Accept(0xc000624d00) ollama | internal/poll/fd_unix.go:620 +0x295 fp=0xc0005878b8 sp=0xc000587810 pc=0x559920aa4535 ollama | net.(*netFD).accept(0xc000624d00) ollama | net/fd_unix.go:172 +0x29 fp=0xc000587970 sp=0xc0005878b8 pc=0x559920b17549 ollama | net.(*TCPListener).accept(0xc000695b40) ollama | net/tcpsock_posix.go:159 +0x1b fp=0xc0005879c0 sp=0xc000587970 pc=0x559920b2d45b ollama | net.(*TCPListener).Accept(0xc000695b40) ollama | net/tcpsock.go:380 +0x30 fp=0xc0005879f0 sp=0xc0005879c0 pc=0x559920b2c310 ollama | net/http.(*onceCloseListener).Accept(0xc00063a480?) ollama | <autogenerated>:1 +0x24 fp=0xc000587a08 sp=0xc0005879f0 pc=0x559920d43ea4 ollama | net/http.(*Server).Serve(0xc000051300, {0x5599220398c0, 0xc000695b40}) ollama | net/http/server.go:3424 +0x30c fp=0xc000587b38 sp=0xc000587a08 pc=0x559920d1b76c ollama | github.com/ollama/ollama/runner/llamarunner.Execute({0xc000130140, 0x4, 0x4}) ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:1002 +0x8f5 fp=0xc000587d08 sp=0xc000587b38 pc=0x559920ed4a15 ollama | github.com/ollama/ollama/runner.Execute({0xc000130130?, 0x0?, 0x0?}) ollama | github.com/ollama/ollama/runner/runner.go:30 +0x13b fp=0xc000587d30 sp=0xc000587d08 pc=0x559920f8057b ollama | github.com/ollama/ollama/cmd.NewCLI.func2(0xc000051000?, {0x559921afd0d0?, 0x4?, 0x559921afd0d4?}) ollama | github.com/ollama/ollama/cmd/cmd.go:1881 +0x45 fp=0xc000587d58 sp=0xc000587d30 pc=0x559921744c65 ollama | github.com/spf13/cobra.(*Command).execute(0xc00063f508, {0xc000695940, 0x4, 0x4}) ollama | github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc000587e78 sp=0xc000587d58 pc=0x559920b914bc ollama | github.com/spf13/cobra.(*Command).ExecuteC(0xc000610908) ollama | github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000587f30 sp=0xc000587e78 pc=0x559920b91d05 ollama | github.com/spf13/cobra.(*Command).Execute(...) ollama | github.com/spf13/cobra@v1.7.0/command.go:992 ollama | github.com/spf13/cobra.(*Command).ExecuteContext(...) ollama | github.com/spf13/cobra@v1.7.0/command.go:985 ollama | main.main() ollama | github.com/ollama/ollama/main.go:12 +0x4d fp=0xc000587f50 sp=0xc000587f30 pc=0x55992174574d ollama | runtime.main() ollama | runtime/proc.go:283 +0x29d fp=0xc000587fe0 sp=0xc000587f50 pc=0x5599209e477d ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc000587fe8 sp=0xc000587fe0 pc=0x559920a1fc61 ollama | ollama | goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000aafa8 sp=0xc0000aaf88 pc=0x559920a17dce ollama | runtime.goparkunlock(...) ollama | runtime/proc.go:441 ollama | runtime.forcegchelper() ollama | runtime/proc.go:348 +0xb8 fp=0xc0000aafe0 sp=0xc0000aafa8 pc=0x5599209e4ab8 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aafe8 sp=0xc0000aafe0 pc=0x559920a1fc61 ollama | created by runtime.init.7 in goroutine 1 ollama | runtime/proc.go:336 +0x1a ollama | ollama | goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]: ollama | runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000ab780 sp=0xc0000ab760 pc=0x559920a17dce ollama | runtime.goparkunlock(...) ollama | runtime/proc.go:441 ollama | runtime.bgsweep(0xc0000d6000) ollama | runtime/mgcsweep.go:316 +0xdf fp=0xc0000ab7c8 sp=0xc0000ab780 pc=0x5599209cf25f ollama | runtime.gcenable.gowrap1() ollama | runtime/mgc.go:204 +0x25 fp=0xc0000ab7e0 sp=0xc0000ab7c8 pc=0x5599209c3645 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ab7e8 sp=0xc0000ab7e0 pc=0x559920a1fc61 ollama | created by runtime.gcenable in goroutine 1 ollama | runtime/mgc.go:204 +0x66 ollama | ollama | goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]: ollama | runtime.gopark(0x10000?, 0x559921cdb140?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000abf78 sp=0xc0000abf58 pc=0x559920a17dce ollama | runtime.goparkunlock(...) ollama | runtime/proc.go:441 ollama | runtime.(*scavengerState).park(0x5599229425a0) ollama | runtime/mgcscavenge.go:425 +0x49 fp=0xc0000abfa8 sp=0xc0000abf78 pc=0x5599209ccca9 ollama | runtime.bgscavenge(0xc0000d6000) ollama | runtime/mgcscavenge.go:658 +0x59 fp=0xc0000abfc8 sp=0xc0000abfa8 pc=0x5599209cd239 ollama | runtime.gcenable.gowrap2() ollama | runtime/mgc.go:205 +0x25 fp=0xc0000abfe0 sp=0xc0000abfc8 pc=0x5599209c35e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000abfe8 sp=0xc0000abfe0 pc=0x559920a1fc61 ollama | created by runtime.gcenable in goroutine 1 ollama | runtime/mgc.go:205 +0xa5 ollama | ollama | goroutine 18 gp=0xc000102700 m=nil [finalizer wait]: ollama | runtime.gopark(0x1b8?, 0xc000002380?, 0x1?, 0x23?, 0xc0000aa688?) ollama | runtime/proc.go:435 +0xce fp=0xc0000aa630 sp=0xc0000aa610 pc=0x559920a17dce ollama | runtime.runfinq() ollama | runtime/mfinal.go:196 +0x107 fp=0xc0000aa7e0 sp=0xc0000aa630 pc=0x5599209c2607 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aa7e8 sp=0xc0000aa7e0 pc=0x559920a1fc61 ollama | created by runtime.createfing in goroutine 1 ollama | runtime/mfinal.go:166 +0x3d ollama | ollama | goroutine 19 gp=0xc000103180 m=nil [chan receive]: ollama | runtime.gopark(0xc00022d540?, 0xc000588018?, 0x60?, 0x67?, 0x559920afe188?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a6718 sp=0xc0000a66f8 pc=0x559920a17dce ollama | runtime.chanrecv(0xc000110310, 0x0, 0x1) ollama | runtime/chan.go:664 +0x445 fp=0xc0000a6790 sp=0xc0000a6718 pc=0x5599209b40e5 ollama | runtime.chanrecv1(0x0?, 0x0?) ollama | runtime/chan.go:506 +0x12 fp=0xc0000a67b8 sp=0xc0000a6790 pc=0x5599209b3c72 ollama | runtime.unique_runtime_registerUniqueMapCleanup.func2(...) ollama | runtime/mgc.go:1796 ollama | runtime.unique_runtime_registerUniqueMapCleanup.gowrap1() ollama | runtime/mgc.go:1799 +0x2f fp=0xc0000a67e0 sp=0xc0000a67b8 pc=0x5599209c67ef ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a67e8 sp=0xc0000a67e0 pc=0x559920a1fc61 ollama | created by unique.runtime_registerUniqueMapCleanup in goroutine 1 ollama | runtime/mgc.go:1794 +0x85 ollama | ollama | goroutine 20 gp=0xc000103500 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a6f38 sp=0xc0000a6f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a6fc8 sp=0xc0000a6f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a6fe0 sp=0xc0000a6fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a6fe8 sp=0xc0000a6fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 21 gp=0xc0001036c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a7738 sp=0xc0000a7718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a77c8 sp=0xc0000a7738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a77e0 sp=0xc0000a77c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a77e8 sp=0xc0000a77e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 22 gp=0xc000103880 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a7f38 sp=0xc0000a7f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a7fc8 sp=0xc0000a7f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a7fe0 sp=0xc0000a7fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a7fe8 sp=0xc0000a7fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 5 gp=0xc000003dc0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000ac738 sp=0xc0000ac718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000ac7c8 sp=0xc0000ac738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000ac7e0 sp=0xc0000ac7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ac7e8 sp=0xc0000ac7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 6 gp=0xc0000ec000 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000acf38 sp=0xc0000acf18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000acfc8 sp=0xc0000acf38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000acfe0 sp=0xc0000acfc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000acfe8 sp=0xc0000acfe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 7 gp=0xc0000ec1c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000ad738 sp=0xc0000ad718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000ad7c8 sp=0xc0000ad738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000ad7e0 sp=0xc0000ad7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ad7e8 sp=0xc0000ad7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 8 gp=0xc0000ec380 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000adf38 sp=0xc0000adf18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000adfc8 sp=0xc0000adf38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000adfe0 sp=0xc0000adfc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000adfe8 sp=0xc0000adfe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 9 gp=0xc0000ec540 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f2738 sp=0xc0000f2718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f27c8 sp=0xc0000f2738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f27e0 sp=0xc0000f27c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f27e8 sp=0xc0000f27e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 10 gp=0xc0000ec700 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f2f38 sp=0xc0000f2f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f2fc8 sp=0xc0000f2f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f2fe0 sp=0xc0000f2fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f2fe8 sp=0xc0000f2fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 11 gp=0xc0000ec8c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f3738 sp=0xc0000f3718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f37c8 sp=0xc0000f3738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f37e0 sp=0xc0000f37c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f37e8 sp=0xc0000f37e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 12 gp=0xc0000eca80 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f3f38 sp=0xc0000f3f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f3fc8 sp=0xc0000f3f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f3fe0 sp=0xc0000f3fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f3fe8 sp=0xc0000f3fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 23 gp=0xc000103a40 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a8738 sp=0xc0000a8718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a87c8 sp=0xc0000a8738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a87e0 sp=0xc0000a87c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a87e8 sp=0xc0000a87e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 24 gp=0xc000103c00 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a8f38 sp=0xc0000a8f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a8fc8 sp=0xc0000a8f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a8fe0 sp=0xc0000a8fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a8fe8 sp=0xc0000a8fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 25 gp=0xc000103dc0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a9738 sp=0xc0000a9718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a97c8 sp=0xc0000a9738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a97e0 sp=0xc0000a97c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a97e8 sp=0xc0000a97e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 26 gp=0xc000454000 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000a9f38 sp=0xc0000a9f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000a9fc8 sp=0xc0000a9f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000a9fe0 sp=0xc0000a9fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a9fe8 sp=0xc0000a9fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 27 gp=0xc0004541c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000ee738 sp=0xc0000ee718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000ee7c8 sp=0xc0000ee738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000ee7e0 sp=0xc0000ee7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ee7e8 sp=0xc0000ee7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 28 gp=0xc000454380 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000eef38 sp=0xc0000eef18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000eefc8 sp=0xc0000eef38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000eefe0 sp=0xc0000eefc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000eefe8 sp=0xc0000eefe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 34 gp=0xc000484000 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048a738 sp=0xc00048a718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048a7c8 sp=0xc00048a738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048a7e0 sp=0xc00048a7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048a7e8 sp=0xc00048a7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 13 gp=0xc0000ecc40 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f4738 sp=0xc0000f4718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f47c8 sp=0xc0000f4738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f47e0 sp=0xc0000f47c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f47e8 sp=0xc0000f47e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 14 gp=0xc0000ece00 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f4f38 sp=0xc0000f4f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f4fc8 sp=0xc0000f4f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f4fe0 sp=0xc0000f4fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f4fe8 sp=0xc0000f4fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 15 gp=0xc0000ecfc0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f5738 sp=0xc0000f5718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f57c8 sp=0xc0000f5738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f57e0 sp=0xc0000f57c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f57e8 sp=0xc0000f57e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 16 gp=0xc0000ed180 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000f5f38 sp=0xc0000f5f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000f5fc8 sp=0xc0000f5f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000f5fe0 sp=0xc0000f5fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000f5fe8 sp=0xc0000f5fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 50 gp=0xc0000ed340 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc000486738 sp=0xc000486718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0004867c8 sp=0xc000486738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0004867e0 sp=0xc0004867c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0004867e8 sp=0xc0004867e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 51 gp=0xc0000ed500 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc000486f38 sp=0xc000486f18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc000486fc8 sp=0xc000486f38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc000486fe0 sp=0xc000486fc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc000486fe8 sp=0xc000486fe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 52 gp=0xc0000ed6c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc000487738 sp=0xc000487718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0004877c8 sp=0xc000487738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0004877e0 sp=0xc0004877c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0004877e8 sp=0xc0004877e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 29 gp=0xc000454540 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc0000ef738 sp=0xc0000ef718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc0000ef7c8 sp=0xc0000ef738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc0000ef7e0 sp=0xc0000ef7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ef7e8 sp=0xc0000ef7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 35 gp=0xc0004841c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048af38 sp=0xc00048af18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048afc8 sp=0xc00048af38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048afe0 sp=0xc00048afc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048afe8 sp=0xc00048afe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 36 gp=0xc000484380 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x559922a10a80?, 0x1?, 0x3c?, 0x0?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048b738 sp=0xc00048b718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048b7c8 sp=0xc00048b738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048b7e0 sp=0xc00048b7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048b7e8 sp=0xc00048b7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 37 gp=0xc000484540 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x363a0ac139?, 0x1?, 0x77?, 0x83?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048bf38 sp=0xc00048bf18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048bfc8 sp=0xc00048bf38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048bfe0 sp=0xc00048bfc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048bfe8 sp=0xc00048bfe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 38 gp=0xc000484700 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x363a0ad71a?, 0x1?, 0x37?, 0x44?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048c738 sp=0xc00048c718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048c7c8 sp=0xc00048c738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048c7e0 sp=0xc00048c7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048c7e8 sp=0xc00048c7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 39 gp=0xc0004848c0 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x363a0adae6?, 0x1?, 0xa4?, 0x29?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048cf38 sp=0xc00048cf18 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048cfc8 sp=0xc00048cf38 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048cfe0 sp=0xc00048cfc8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048cfe8 sp=0xc00048cfe0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 40 gp=0xc000484a80 m=nil [GC worker (idle)]: ollama | runtime.gopark(0x559922a10a80?, 0x1?, 0x3c?, 0x76?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc00048d738 sp=0xc00048d718 pc=0x559920a17dce ollama | runtime.gcBgMarkWorker(0xc000111570) ollama | runtime/mgc.go:1423 +0xe9 fp=0xc00048d7c8 sp=0xc00048d738 pc=0x5599209c5b09 ollama | runtime.gcBgMarkStartWorkers.gowrap1() ollama | runtime/mgc.go:1339 +0x25 fp=0xc00048d7e0 sp=0xc00048d7c8 pc=0x5599209c59e5 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc00048d7e8 sp=0xc00048d7e0 pc=0x559920a1fc61 ollama | created by runtime.gcBgMarkStartWorkers in goroutine 1 ollama | runtime/mgc.go:1339 +0x105 ollama | ollama | goroutine 53 gp=0xc000454fc0 m=nil [sync.WaitGroup.Wait]: ollama | runtime.gopark(0x0?, 0x0?, 0x0?, 0x46?, 0x0?) ollama | runtime/proc.go:435 +0xce fp=0xc000489620 sp=0xc000489600 pc=0x559920a17dce ollama | runtime.goparkunlock(...) ollama | runtime/proc.go:441 ollama | runtime.semacquire1(0xc00063cca0, 0x0, 0x1, 0x0, 0x18) ollama | runtime/sema.go:188 +0x229 fp=0xc000489688 sp=0xc000489620 pc=0x5599209f7d49 ollama | sync.runtime_SemacquireWaitGroup(0x0?) ollama | runtime/sema.go:110 +0x25 fp=0xc0004896c0 sp=0xc000489688 pc=0x559920a19705 ollama | sync.(*WaitGroup).Wait(0x0?) ollama | sync/waitgroup.go:118 +0x48 fp=0xc0004896e8 sp=0xc0004896c0 pc=0x559920a2b6c8 ollama | github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc00063cc80, {0x55992203bee0, 0xc00069b900}) ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:360 +0x4b fp=0xc0004897b8 sp=0xc0004896e8 pc=0x559920ecf72b ollama | github.com/ollama/ollama/runner/llamarunner.Execute.gowrap1() ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x28 fp=0xc0004897e0 sp=0xc0004897b8 pc=0x559920ed4c88 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc0004897e8 sp=0xc0004897e0 pc=0x559920a1fc61 ollama | created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1 ollama | github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x4c5 ollama | ollama | goroutine 54 gp=0xc000455180 m=nil [IO wait]: ollama | runtime.gopark(0x559920aa2765?, 0xc000624d80?, 0x40?, 0x9a?, 0xb?) ollama | runtime/proc.go:435 +0xce fp=0xc000049948 sp=0xc000049928 pc=0x559920a17dce ollama | runtime.netpollblock(0x559920a3b598?, 0x209b1506?, 0x99?) ollama | runtime/netpoll.go:575 +0xf7 fp=0xc000049980 sp=0xc000049948 pc=0x5599209dd0f7 ollama | internal/poll.runtime_pollWait(0x7f8419253cc8, 0x72) ollama | runtime/netpoll.go:351 +0x85 fp=0xc0000499a0 sp=0xc000049980 pc=0x559920a16fe5 ollama | internal/poll.(*pollDesc).wait(0xc000624d80?, 0xc000666000?, 0x0) ollama | internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0000499c8 sp=0xc0000499a0 pc=0x559920a9f167 ollama | internal/poll.(*pollDesc).waitRead(...) ollama | internal/poll/fd_poll_runtime.go:89 ollama | internal/poll.(*FD).Read(0xc000624d80, {0xc000666000, 0x1000, 0x1000}) ollama | internal/poll/fd_unix.go:165 +0x27a fp=0xc000049a60 sp=0xc0000499c8 pc=0x559920aa045a ollama | net.(*netFD).Read(0xc000624d80, {0xc000666000?, 0xc000049ad0?, 0x559920a9f625?}) ollama | net/fd_posix.go:55 +0x25 fp=0xc000049aa8 sp=0xc000049a60 pc=0x559920b155a5 ollama | net.(*conn).Read(0xc00011c970, {0xc000666000?, 0x0?, 0x0?}) ollama | net/net.go:194 +0x45 fp=0xc000049af0 sp=0xc000049aa8 pc=0x559920b23965 ollama | net/http.(*connReader).Read(0xc000223a70, {0xc000666000, 0x1000, 0x1000}) ollama | net/http/server.go:798 +0x159 fp=0xc000049b40 sp=0xc000049af0 pc=0x559920d10619 ollama | bufio.(*Reader).fill(0xc0000348a0) ollama | bufio/bufio.go:113 +0x103 fp=0xc000049b78 sp=0xc000049b40 pc=0x559920b3ba23 ollama | bufio.(*Reader).Peek(0xc0000348a0, 0x4) ollama | bufio/bufio.go:152 +0x53 fp=0xc000049b98 sp=0xc000049b78 pc=0x559920b3bb53 ollama | net/http.(*conn).serve(0xc00063a480, {0x55992203bea8, 0xc0002235f0}) ollama | net/http/server.go:2137 +0x785 fp=0xc000049fb8 sp=0xc000049b98 pc=0x559920d16405 ollama | net/http.(*Server).Serve.gowrap3() ollama | net/http/server.go:3454 +0x28 fp=0xc000049fe0 sp=0xc000049fb8 pc=0x559920d1bb68 ollama | runtime.goexit({}) ollama | runtime/asm_amd64.s:1700 +0x1 fp=0xc000049fe8 sp=0xc000049fe0 pc=0x559920a1fc61 ollama | created by net/http.(*Server).Serve in goroutine 1 ollama | net/http/server.go:3454 +0x485 ollama | ollama | rax 0x7f8461d62698 ollama | rbx 0x559932c29a90 ollama | rcx 0x0 ollama | rdx 0x55992a6d7100 ollama | rdi 0x7f8400e58e48 ollama | rsi 0x3 ollama | rbp 0x7f8461d62698 ollama | rsp 0x7fff73e013b0 ollama | r8 0x0 ollama | r9 0x0 ollama | r10 0x559932c2b670 ollama | r11 0x290 ollama | r12 0x0 ollama | r13 0x0 ollama | r14 0x18 ollama | r15 0x1 ollama | rip 0x7f8400ac0bf5 ollama | rflags 0x10246 ollama | cs 0x33 ollama | fs 0x0 ollama | gs 0x0 ollama | time=2026-01-21T09:17:10.784Z level=INFO source=sched.go:479 msg="Load failed" model=/root/.ollama/models/blobs/sha256-7fce9a253a25197d72ccc5898fe0789ac7fc7e17cbe7a62560b304163a894a92 error="llama runner process has terminated: exit status 2" ```
Author
Owner

@rick-github commented on GitHub (Jan 21, 2026):

Please include the log before the crash.

<!-- gh-comment-id:3777702401 --> @rick-github commented on GitHub (Jan 21, 2026): Please include the log before the crash.
Author
Owner

@esanchezm commented on GitHub (Jan 23, 2026):

Reporting my logs, having the same issue

[GIN] 2026/01/23 - 09:57:48 | 200 |   28.693468ms |       127.0.0.1 | POST     "/api/show"
time=2026-01-23T09:57:49.010Z level=DEBUG source=runner.go:264 msg="refreshing free memory"
time=2026-01-23T09:57:49.010Z level=DEBUG source=runner.go:328 msg="unable to refresh all GPUs with existing runners, performing bootstrap discovery"
time=2026-01-23T09:57:49.010Z level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 35685"
time=2026-01-23T09:57:49.010Z level=DEBUG source=server.go:430 msg=subprocess LD_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm:/usr/local/nvidia/lib:/usr/local/nvidia/lib64 HSA_OVERRIDE_GFX_VERSION=11.0.0 OLLAMA_HOST=0.0.0.0:11434 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin OLLAMA_DEBUG=1 OLLAMA_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm ROCR_VISIBLE_DEVICES=0
time=2026-01-23T09:57:49.694Z level=DEBUG source=runner.go:437 msg="bootstrap discovery took" duration=683.877455ms OLLAMA_LIBRARY_PATH="[/usr/lib/ollama /usr/lib/ollama/rocm]" extra_envs=map[ROCR_VISIBLE_DEVICES:0]
time=2026-01-23T09:57:49.694Z level=DEBUG source=runner.go:40 msg="overall device VRAM discovery took" duration=684.048359ms
time=2026-01-23T09:57:49.696Z level=WARN source=cpu_linux.go:130 msg="failed to parse CPU allowed micro secs" error="strconv.ParseInt: parsing \"max\": invalid syntax"
time=2026-01-23T09:57:49.696Z level=DEBUG source=sched.go:195 msg="updating default concurrency" OLLAMA_MAX_LOADED_MODELS=3 gpu_count=1
time=2026-01-23T09:57:49.699Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=general.alignment default=32
time=2026-01-23T09:57:49.699Z level=DEBUG source=sched.go:220 msg="loading first model" model=/root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816
llama_model_loader: loaded meta data with 23 key-value pairs and 201 tensors from /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 (version GGUF V3 (latest))
llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output.
llama_model_loader: - kv   0:                       general.architecture str              = llama
llama_model_loader: - kv   1:                               general.name str              = TinyLlama
llama_model_loader: - kv   2:                       llama.context_length u32              = 2048
llama_model_loader: - kv   3:                     llama.embedding_length u32              = 2048
llama_model_loader: - kv   4:                          llama.block_count u32              = 22
llama_model_loader: - kv   5:                  llama.feed_forward_length u32              = 5632
llama_model_loader: - kv   6:                 llama.rope.dimension_count u32              = 64
llama_model_loader: - kv   7:                 llama.attention.head_count u32              = 32
llama_model_loader: - kv   8:              llama.attention.head_count_kv u32              = 4
llama_model_loader: - kv   9:     llama.attention.layer_norm_rms_epsilon f32              = 0.000010
llama_model_loader: - kv  10:                       llama.rope.freq_base f32              = 10000.000000
llama_model_loader: - kv  11:                          general.file_type u32              = 2
llama_model_loader: - kv  12:                       tokenizer.ggml.model str              = llama
llama_model_loader: - kv  13:                      tokenizer.ggml.tokens arr[str,32000]   = ["<unk>", "<s>", "</s>", "<0x00>", "<...
llama_model_loader: - kv  14:                      tokenizer.ggml.scores arr[f32,32000]   = [0.000000, 0.000000, 0.000000, 0.0000...
llama_model_loader: - kv  15:                  tokenizer.ggml.token_type arr[i32,32000]   = [2, 3, 3, 6, 6, 6, 6, 6, 6, 6, 6, 6, ...
llama_model_loader: - kv  16:                      tokenizer.ggml.merges arr[str,61249]   = ["▁ t", "e r", "i n", "▁ a", "e n...
llama_model_loader: - kv  17:                tokenizer.ggml.bos_token_id u32              = 1
llama_model_loader: - kv  18:                tokenizer.ggml.eos_token_id u32              = 2
llama_model_loader: - kv  19:            tokenizer.ggml.unknown_token_id u32              = 0
llama_model_loader: - kv  20:            tokenizer.ggml.padding_token_id u32              = 2
llama_model_loader: - kv  21:                    tokenizer.chat_template str              = {% for message in messages %}\n{% if m...
llama_model_loader: - kv  22:               general.quantization_version u32              = 2
llama_model_loader: - type  f32:   45 tensors
llama_model_loader: - type q4_0:  155 tensors
llama_model_loader: - type q6_K:    1 tensors
print_info: file format = GGUF V3 (latest)
print_info: file type   = Q4_0
print_info: file size   = 606.53 MiB (4.63 BPW) 
init_tokenizer: initializing tokenizer for type 1
load: control token:      2 '</s>' is not marked as EOG
load: control token:      1 '<s>' is not marked as EOG
load: special_eos_id is not in special_eog_ids - the tokenizer config may be incorrect
load: printing all EOG tokens:
load:   - 2 ('</s>')
load: special tokens cache size = 3
load: token to piece cache size = 0.1684 MB
print_info: arch             = llama
print_info: vocab_only       = 1
print_info: no_alloc         = 0
print_info: model type       = ?B
print_info: model params     = 1.10 B
print_info: general.name     = TinyLlama
print_info: vocab type       = SPM
print_info: n_vocab          = 32000
print_info: n_merges         = 0
print_info: BOS token        = 1 '<s>'
print_info: EOS token        = 2 '</s>'
print_info: UNK token        = 0 '<unk>'
print_info: PAD token        = 2 '</s>'
print_info: LF token         = 13 '<0x0A>'
print_info: EOG token        = 2 '</s>'
print_info: max token length = 48
llama_model_load: vocab only - skipping tensors
time=2026-01-23T09:57:49.741Z level=WARN source=server.go:167 msg="requested context size too large for model" num_ctx=4096 n_ctx_train=2048
time=2026-01-23T09:57:49.741Z level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --model /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 --port 34475"
time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:430 msg=subprocess LD_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm:/usr/local/nvidia/lib:/usr/local/nvidia/lib64 HSA_OVERRIDE_GFX_VERSION=11.0.0 OLLAMA_HOST=0.0.0.0:11434 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin OLLAMA_DEBUG=1 OLLAMA_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm ROCR_VISIBLE_DEVICES=0
time=2026-01-23T09:57:49.741Z level=INFO source=sched.go:452 msg="system memory" total="93.9 GiB" free="93.7 GiB" free_swap="8.0 GiB"
time=2026-01-23T09:57:49.741Z level=INFO source=sched.go:459 msg="gpu memory" id=0 library=ROCm available="78.4 GiB" free="78.8 GiB" minimum="457.0 MiB" overhead="0 B"
time=2026-01-23T09:57:49.741Z level=INFO source=server.go:496 msg="loading model" "model layers"=23 requested=-1
time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=llama.attention.key_length default=64
time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=llama.attention.value_length default=64
time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:633 msg="default cache size estimate" "attention MiB"=44 "attention bytes"=46137344 "recurrent MiB"=0 "recurrent bytes"=0
time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:974 msg="available gpu" id=0 library=ROCm "available layer vram"="78.3 GiB" backoff=0.00 minimum="457.0 MiB" overhead="0 B" graph="0 B"
time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:974 msg="available gpu" id=0 library=ROCm "available layer vram"="78.2 GiB" backoff=0.00 minimum="457.0 MiB" overhead="0 B" graph="148.0 MiB"
time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:667 msg=memory estimate.ROCm0.ID=0 estimate.ROCm0.Weights="[24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 53768192]" estimate.ROCm0.Cache="[2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 0]" estimate.ROCm0.Graph=155191296
time=2026-01-23T09:57:49.741Z level=INFO source=device.go:240 msg="model weights" device=ROCm0 size="571.4 MiB"
time=2026-01-23T09:57:49.741Z level=INFO source=device.go:251 msg="kv cache" device=ROCm0 size="44.0 MiB"
time=2026-01-23T09:57:49.741Z level=INFO source=device.go:262 msg="compute graph" device=ROCm0 size="148.0 MiB"
time=2026-01-23T09:57:49.741Z level=INFO source=device.go:272 msg="total memory" size="763.4 MiB"
time=2026-01-23T09:57:49.749Z level=INFO source=runner.go:965 msg="starting go runner"
time=2026-01-23T09:57:49.749Z level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=/usr/lib/ollama
load_backend: loaded CPU backend from /usr/lib/ollama/libggml-cpu-icelake.so
time=2026-01-23T09:57:49.753Z level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=/usr/lib/ollama/rocm
/opt/amdgpu/share/libdrm/amdgpu.ids: No such file or directory
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 ROCm devices:
  Device 0: AMD Radeon Graphics, gfx1100 (0x1100), VMM: no, Wave Size: 32, ID: 0
load_backend: loaded ROCm backend from /usr/lib/ollama/rocm/libggml-hip.so
time=2026-01-23T09:57:50.280Z level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.AVX512=1 CPU.0.AVX512_VBMI=1 CPU.0.AVX512_VNNI=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 ROCm.0.NO_VMM=1 ROCm.0.PEER_MAX_BATCH_SIZE=128 compiler=cgo(gcc)
time=2026-01-23T09:57:50.280Z level=INFO source=runner.go:1001 msg="Server listening on 127.0.0.1:34475"
time=2026-01-23T09:57:50.288Z level=INFO source=runner.go:895 msg=load request="{Operation:commit LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Auto KvSize:2048 KvCacheType: NumThreads:16 GPULayers:23[ID:0 Layers:23(0..22)] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:true}"
ggml_hip_get_device_memory searching for device 0000:c5:00.0
time=2026-01-23T09:57:50.288Z level=INFO source=server.go:1347 msg="waiting for llama runner to start responding"
ggml_backend_cuda_device_get_memory device 0000:c5:00.0 utilizing AMD specific memory reporting free: 84612214784 total: 84785856512
llama_model_load_from_file_impl: using device ROCm0 (AMD Radeon Graphics) (0000:c5:00.0) - 80692 MiB free
time=2026-01-23T09:57:50.289Z level=INFO source=server.go:1381 msg="waiting for server to become available" status="llm server loading model"
llama_model_loader: loaded meta data with 23 key-value pairs and 201 tensors from /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 (version GGUF V3 (latest))
llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output.
llama_model_loader: - kv   0:                       general.architecture str              = llama
llama_model_loader: - kv   1:                               general.name str              = TinyLlama
llama_model_loader: - kv   2:                       llama.context_length u32              = 2048
llama_model_loader: - kv   3:                     llama.embedding_length u32              = 2048
llama_model_loader: - kv   4:                          llama.block_count u32              = 22
llama_model_loader: - kv   5:                  llama.feed_forward_length u32              = 5632
llama_model_loader: - kv   6:                 llama.rope.dimension_count u32              = 64
llama_model_loader: - kv   7:                 llama.attention.head_count u32              = 32
llama_model_loader: - kv   8:              llama.attention.head_count_kv u32              = 4
llama_model_loader: - kv   9:     llama.attention.layer_norm_rms_epsilon f32              = 0.000010
llama_model_loader: - kv  10:                       llama.rope.freq_base f32              = 10000.000000
llama_model_loader: - kv  11:                          general.file_type u32              = 2
llama_model_loader: - kv  12:                       tokenizer.ggml.model str              = llama
llama_model_loader: - kv  13:                      tokenizer.ggml.tokens arr[str,32000]   = ["<unk>", "<s>", "</s>", "<0x00>", "<...
llama_model_loader: - kv  14:                      tokenizer.ggml.scores arr[f32,32000]   = [0.000000, 0.000000, 0.000000, 0.0000...
llama_model_loader: - kv  15:                  tokenizer.ggml.token_type arr[i32,32000]   = [2, 3, 3, 6, 6, 6, 6, 6, 6, 6, 6, 6, ...
llama_model_loader: - kv  16:                      tokenizer.ggml.merges arr[str,61249]   = ["▁ t", "e r", "i n", "▁ a", "e n...
llama_model_loader: - kv  17:                tokenizer.ggml.bos_token_id u32              = 1
llama_model_loader: - kv  18:                tokenizer.ggml.eos_token_id u32              = 2
llama_model_loader: - kv  19:            tokenizer.ggml.unknown_token_id u32              = 0
llama_model_loader: - kv  20:            tokenizer.ggml.padding_token_id u32              = 2
llama_model_loader: - kv  21:                    tokenizer.chat_template str              = {% for message in messages %}\n{% if m...
llama_model_loader: - kv  22:               general.quantization_version u32              = 2
llama_model_loader: - type  f32:   45 tensors
llama_model_loader: - type q4_0:  155 tensors
llama_model_loader: - type q6_K:    1 tensors
print_info: file format = GGUF V3 (latest)
print_info: file type   = Q4_0
print_info: file size   = 606.53 MiB (4.63 BPW) 
init_tokenizer: initializing tokenizer for type 1
load: control token:      2 '</s>' is not marked as EOG
load: control token:      1 '<s>' is not marked as EOG
load: special_eos_id is not in special_eog_ids - the tokenizer config may be incorrect
load: printing all EOG tokens:
load:   - 2 ('</s>')
load: special tokens cache size = 3
load: token to piece cache size = 0.1684 MB
print_info: arch             = llama
print_info: vocab_only       = 0
print_info: no_alloc         = 0
print_info: n_ctx_train      = 2048
print_info: n_embd           = 2048
print_info: n_embd_inp       = 2048
print_info: n_layer          = 22
print_info: n_head           = 32
print_info: n_head_kv        = 4
print_info: n_rot            = 64
print_info: n_swa            = 0
print_info: is_swa_any       = 0
print_info: n_embd_head_k    = 64
print_info: n_embd_head_v    = 64
print_info: n_gqa            = 8
print_info: n_embd_k_gqa     = 256
print_info: n_embd_v_gqa     = 256
print_info: f_norm_eps       = 0.0e+00
print_info: f_norm_rms_eps   = 1.0e-05
print_info: f_clamp_kqv      = 0.0e+00
print_info: f_max_alibi_bias = 0.0e+00
print_info: f_logit_scale    = 0.0e+00
print_info: f_attn_scale     = 0.0e+00
print_info: n_ff             = 5632
print_info: n_expert         = 0
print_info: n_expert_used    = 0
print_info: n_expert_groups  = 0
print_info: n_group_used     = 0
print_info: causal attn      = 1
print_info: pooling type     = 0
print_info: rope type        = 0
print_info: rope scaling     = linear
print_info: freq_base_train  = 10000.0
print_info: freq_scale_train = 1
print_info: n_ctx_orig_yarn  = 2048
print_info: rope_yarn_log_mul= 0.0000
print_info: rope_finetuned   = unknown
print_info: model type       = 1B
print_info: model params     = 1.10 B
print_info: general.name     = TinyLlama
print_info: vocab type       = SPM
print_info: n_vocab          = 32000
print_info: n_merges         = 0
print_info: BOS token        = 1 '<s>'
print_info: EOS token        = 2 '</s>'
print_info: UNK token        = 0 '<unk>'
print_info: PAD token        = 2 '</s>'
print_info: LF token         = 13 '<0x0A>'
print_info: EOG token        = 2 '</s>'
print_info: max token length = 48
load_tensors: loading model tensors, this can take a while... (mmap = true)
load_tensors: layer   0 assigned to device ROCm0, is_swa = 0
load_tensors: layer   1 assigned to device ROCm0, is_swa = 0
load_tensors: layer   2 assigned to device ROCm0, is_swa = 0
load_tensors: layer   3 assigned to device ROCm0, is_swa = 0
load_tensors: layer   4 assigned to device ROCm0, is_swa = 0
load_tensors: layer   5 assigned to device ROCm0, is_swa = 0
load_tensors: layer   6 assigned to device ROCm0, is_swa = 0
load_tensors: layer   7 assigned to device ROCm0, is_swa = 0
load_tensors: layer   8 assigned to device ROCm0, is_swa = 0
load_tensors: layer   9 assigned to device ROCm0, is_swa = 0
load_tensors: layer  10 assigned to device ROCm0, is_swa = 0
load_tensors: layer  11 assigned to device ROCm0, is_swa = 0
load_tensors: layer  12 assigned to device ROCm0, is_swa = 0
load_tensors: layer  13 assigned to device ROCm0, is_swa = 0
load_tensors: layer  14 assigned to device ROCm0, is_swa = 0
load_tensors: layer  15 assigned to device ROCm0, is_swa = 0
load_tensors: layer  16 assigned to device ROCm0, is_swa = 0
load_tensors: layer  17 assigned to device ROCm0, is_swa = 0
load_tensors: layer  18 assigned to device ROCm0, is_swa = 0
load_tensors: layer  19 assigned to device ROCm0, is_swa = 0
load_tensors: layer  20 assigned to device ROCm0, is_swa = 0
load_tensors: layer  21 assigned to device ROCm0, is_swa = 0
load_tensors: layer  22 assigned to device ROCm0, is_swa = 0
create_tensor: loading tensor token_embd.weight
create_tensor: loading tensor output_norm.weight
create_tensor: loading tensor output.weight
create_tensor: loading tensor blk.0.attn_norm.weight
create_tensor: loading tensor blk.0.attn_q.weight
create_tensor: loading tensor blk.0.attn_k.weight
create_tensor: loading tensor blk.0.attn_v.weight
create_tensor: loading tensor blk.0.attn_output.weight
create_tensor: loading tensor blk.0.ffn_norm.weight
create_tensor: loading tensor blk.0.ffn_gate.weight
create_tensor: loading tensor blk.0.ffn_down.weight
create_tensor: loading tensor blk.0.ffn_up.weight
create_tensor: loading tensor blk.1.attn_norm.weight
create_tensor: loading tensor blk.1.attn_q.weight
create_tensor: loading tensor blk.1.attn_k.weight
create_tensor: loading tensor blk.1.attn_v.weight
create_tensor: loading tensor blk.1.attn_output.weight
create_tensor: loading tensor blk.1.ffn_norm.weight
create_tensor: loading tensor blk.1.ffn_gate.weight
create_tensor: loading tensor blk.1.ffn_down.weight
create_tensor: loading tensor blk.1.ffn_up.weight
create_tensor: loading tensor blk.2.attn_norm.weight
create_tensor: loading tensor blk.2.attn_q.weight
create_tensor: loading tensor blk.2.attn_k.weight
create_tensor: loading tensor blk.2.attn_v.weight
create_tensor: loading tensor blk.2.attn_output.weight
create_tensor: loading tensor blk.2.ffn_norm.weight
create_tensor: loading tensor blk.2.ffn_gate.weight
create_tensor: loading tensor blk.2.ffn_down.weight
create_tensor: loading tensor blk.2.ffn_up.weight
create_tensor: loading tensor blk.3.attn_norm.weight
create_tensor: loading tensor blk.3.attn_q.weight
create_tensor: loading tensor blk.3.attn_k.weight
create_tensor: loading tensor blk.3.attn_v.weight
create_tensor: loading tensor blk.3.attn_output.weight
create_tensor: loading tensor blk.3.ffn_norm.weight
create_tensor: loading tensor blk.3.ffn_gate.weight
create_tensor: loading tensor blk.3.ffn_down.weight
create_tensor: loading tensor blk.3.ffn_up.weight
create_tensor: loading tensor blk.4.attn_norm.weight
create_tensor: loading tensor blk.4.attn_q.weight
create_tensor: loading tensor blk.4.attn_k.weight
create_tensor: loading tensor blk.4.attn_v.weight
create_tensor: loading tensor blk.4.attn_output.weight
create_tensor: loading tensor blk.4.ffn_norm.weight
create_tensor: loading tensor blk.4.ffn_gate.weight
create_tensor: loading tensor blk.4.ffn_down.weight
create_tensor: loading tensor blk.4.ffn_up.weight
create_tensor: loading tensor blk.5.attn_norm.weight
create_tensor: loading tensor blk.5.attn_q.weight
create_tensor: loading tensor blk.5.attn_k.weight
create_tensor: loading tensor blk.5.attn_v.weight
create_tensor: loading tensor blk.5.attn_output.weight
create_tensor: loading tensor blk.5.ffn_norm.weight
create_tensor: loading tensor blk.5.ffn_gate.weight
create_tensor: loading tensor blk.5.ffn_down.weight
create_tensor: loading tensor blk.5.ffn_up.weight
create_tensor: loading tensor blk.6.attn_norm.weight
create_tensor: loading tensor blk.6.attn_q.weight
create_tensor: loading tensor blk.6.attn_k.weight
create_tensor: loading tensor blk.6.attn_v.weight
create_tensor: loading tensor blk.6.attn_output.weight
create_tensor: loading tensor blk.6.ffn_norm.weight
create_tensor: loading tensor blk.6.ffn_gate.weight
create_tensor: loading tensor blk.6.ffn_down.weight
create_tensor: loading tensor blk.6.ffn_up.weight
create_tensor: loading tensor blk.7.attn_norm.weight
create_tensor: loading tensor blk.7.attn_q.weight
create_tensor: loading tensor blk.7.attn_k.weight
create_tensor: loading tensor blk.7.attn_v.weight
create_tensor: loading tensor blk.7.attn_output.weight
create_tensor: loading tensor blk.7.ffn_norm.weight
create_tensor: loading tensor blk.7.ffn_gate.weight
create_tensor: loading tensor blk.7.ffn_down.weight
create_tensor: loading tensor blk.7.ffn_up.weight
create_tensor: loading tensor blk.8.attn_norm.weight
create_tensor: loading tensor blk.8.attn_q.weight
create_tensor: loading tensor blk.8.attn_k.weight
create_tensor: loading tensor blk.8.attn_v.weight
create_tensor: loading tensor blk.8.attn_output.weight
create_tensor: loading tensor blk.8.ffn_norm.weight
create_tensor: loading tensor blk.8.ffn_gate.weight
create_tensor: loading tensor blk.8.ffn_down.weight
create_tensor: loading tensor blk.8.ffn_up.weight
create_tensor: loading tensor blk.9.attn_norm.weight
create_tensor: loading tensor blk.9.attn_q.weight
create_tensor: loading tensor blk.9.attn_k.weight
create_tensor: loading tensor blk.9.attn_v.weight
create_tensor: loading tensor blk.9.attn_output.weight
create_tensor: loading tensor blk.9.ffn_norm.weight
create_tensor: loading tensor blk.9.ffn_gate.weight
create_tensor: loading tensor blk.9.ffn_down.weight
create_tensor: loading tensor blk.9.ffn_up.weight
create_tensor: loading tensor blk.10.attn_norm.weight
create_tensor: loading tensor blk.10.attn_q.weight
create_tensor: loading tensor blk.10.attn_k.weight
create_tensor: loading tensor blk.10.attn_v.weight
create_tensor: loading tensor blk.10.attn_output.weight
create_tensor: loading tensor blk.10.ffn_norm.weight
create_tensor: loading tensor blk.10.ffn_gate.weight
create_tensor: loading tensor blk.10.ffn_down.weight
create_tensor: loading tensor blk.10.ffn_up.weight
create_tensor: loading tensor blk.11.attn_norm.weight
create_tensor: loading tensor blk.11.attn_q.weight
create_tensor: loading tensor blk.11.attn_k.weight
create_tensor: loading tensor blk.11.attn_v.weight
create_tensor: loading tensor blk.11.attn_output.weight
create_tensor: loading tensor blk.11.ffn_norm.weight
create_tensor: loading tensor blk.11.ffn_gate.weight
create_tensor: loading tensor blk.11.ffn_down.weight
create_tensor: loading tensor blk.11.ffn_up.weight
create_tensor: loading tensor blk.12.attn_norm.weight
create_tensor: loading tensor blk.12.attn_q.weight
create_tensor: loading tensor blk.12.attn_k.weight
create_tensor: loading tensor blk.12.attn_v.weight
create_tensor: loading tensor blk.12.attn_output.weight
create_tensor: loading tensor blk.12.ffn_norm.weight
create_tensor: loading tensor blk.12.ffn_gate.weight
create_tensor: loading tensor blk.12.ffn_down.weight
create_tensor: loading tensor blk.12.ffn_up.weight
create_tensor: loading tensor blk.13.attn_norm.weight
create_tensor: loading tensor blk.13.attn_q.weight
create_tensor: loading tensor blk.13.attn_k.weight
create_tensor: loading tensor blk.13.attn_v.weight
create_tensor: loading tensor blk.13.attn_output.weight
create_tensor: loading tensor blk.13.ffn_norm.weight
create_tensor: loading tensor blk.13.ffn_gate.weight
create_tensor: loading tensor blk.13.ffn_down.weight
create_tensor: loading tensor blk.13.ffn_up.weight
create_tensor: loading tensor blk.14.attn_norm.weight
create_tensor: loading tensor blk.14.attn_q.weight
create_tensor: loading tensor blk.14.attn_k.weight
create_tensor: loading tensor blk.14.attn_v.weight
create_tensor: loading tensor blk.14.attn_output.weight
create_tensor: loading tensor blk.14.ffn_norm.weight
create_tensor: loading tensor blk.14.ffn_gate.weight
create_tensor: loading tensor blk.14.ffn_down.weight
create_tensor: loading tensor blk.14.ffn_up.weight
create_tensor: loading tensor blk.15.attn_norm.weight
create_tensor: loading tensor blk.15.attn_q.weight
create_tensor: loading tensor blk.15.attn_k.weight
create_tensor: loading tensor blk.15.attn_v.weight
create_tensor: loading tensor blk.15.attn_output.weight
create_tensor: loading tensor blk.15.ffn_norm.weight
create_tensor: loading tensor blk.15.ffn_gate.weight
create_tensor: loading tensor blk.15.ffn_down.weight
create_tensor: loading tensor blk.15.ffn_up.weight
create_tensor: loading tensor blk.16.attn_norm.weight
create_tensor: loading tensor blk.16.attn_q.weight
create_tensor: loading tensor blk.16.attn_k.weight
create_tensor: loading tensor blk.16.attn_v.weight
create_tensor: loading tensor blk.16.attn_output.weight
create_tensor: loading tensor blk.16.ffn_norm.weight
create_tensor: loading tensor blk.16.ffn_gate.weight
create_tensor: loading tensor blk.16.ffn_down.weight
create_tensor: loading tensor blk.16.ffn_up.weight
create_tensor: loading tensor blk.17.attn_norm.weight
create_tensor: loading tensor blk.17.attn_q.weight
create_tensor: loading tensor blk.17.attn_k.weight
create_tensor: loading tensor blk.17.attn_v.weight
create_tensor: loading tensor blk.17.attn_output.weight
create_tensor: loading tensor blk.17.ffn_norm.weight
create_tensor: loading tensor blk.17.ffn_gate.weight
create_tensor: loading tensor blk.17.ffn_down.weight
create_tensor: loading tensor blk.17.ffn_up.weight
create_tensor: loading tensor blk.18.attn_norm.weight
create_tensor: loading tensor blk.18.attn_q.weight
create_tensor: loading tensor blk.18.attn_k.weight
create_tensor: loading tensor blk.18.attn_v.weight
create_tensor: loading tensor blk.18.attn_output.weight
create_tensor: loading tensor blk.18.ffn_norm.weight
create_tensor: loading tensor blk.18.ffn_gate.weight
create_tensor: loading tensor blk.18.ffn_down.weight
create_tensor: loading tensor blk.18.ffn_up.weight
create_tensor: loading tensor blk.19.attn_norm.weight
create_tensor: loading tensor blk.19.attn_q.weight
create_tensor: loading tensor blk.19.attn_k.weight
create_tensor: loading tensor blk.19.attn_v.weight
create_tensor: loading tensor blk.19.attn_output.weight
create_tensor: loading tensor blk.19.ffn_norm.weight
create_tensor: loading tensor blk.19.ffn_gate.weight
create_tensor: loading tensor blk.19.ffn_down.weight
create_tensor: loading tensor blk.19.ffn_up.weight
create_tensor: loading tensor blk.20.attn_norm.weight
create_tensor: loading tensor blk.20.attn_q.weight
create_tensor: loading tensor blk.20.attn_k.weight
create_tensor: loading tensor blk.20.attn_v.weight
create_tensor: loading tensor blk.20.attn_output.weight
create_tensor: loading tensor blk.20.ffn_norm.weight
create_tensor: loading tensor blk.20.ffn_gate.weight
create_tensor: loading tensor blk.20.ffn_down.weight
create_tensor: loading tensor blk.20.ffn_up.weight
create_tensor: loading tensor blk.21.attn_norm.weight
create_tensor: loading tensor blk.21.attn_q.weight
create_tensor: loading tensor blk.21.attn_k.weight
create_tensor: loading tensor blk.21.attn_v.weight
create_tensor: loading tensor blk.21.attn_output.weight
create_tensor: loading tensor blk.21.ffn_norm.weight
create_tensor: loading tensor blk.21.ffn_gate.weight
create_tensor: loading tensor blk.21.ffn_down.weight
create_tensor: loading tensor blk.21.ffn_up.weight
load_tensors: tensor 'token_embd.weight' (q4_0) (and 0 others) cannot be used with preferred buffer type ROCm_Host, using CPU instead
load_tensors: offloading 22 repeating layers to GPU
load_tensors: offloading output layer to GPU
load_tensors: offloaded 23/23 layers to GPU
load_tensors:   CPU_Mapped model buffer size =    35.16 MiB
load_tensors:        ROCm0 model buffer size =   571.37 MiB
SIGSEGV: segmentation violation
PC=0x7fab758c0bf5 m=0 sigcode=1 addr=0x18
signal arrived during cgo execution

goroutine 29 gp=0xc000602540 m=0 mp=0x5579c5ee5080 [syscall]:
runtime.cgocall(0x5579c4d070d2, 0xc000487b58)
	runtime/cgocall.go:167 +0x4b fp=0xc000487b30 sp=0xc000487af8 pc=0x5579c3f038ab
github.com/ollama/ollama/llama._Cfunc_llama_model_load_from_file(0x5579e14c1d20, {0xc0003ee890, 0x0, 0x17, 0x1, 0x0, 0xc0003ee5b8, 0x5579c4d06315, 0xc0003ee888, 0x0, ...})
	_cgo_gotypes.go:910 +0x4b fp=0xc000487b58 sp=0xc000487b30 pc=0x5579c4304d6b
github.com/ollama/ollama/llama.LoadModelFromFile.func1(...)
	github.com/ollama/ollama/llama/llama.go:308
github.com/ollama/ollama/llama.LoadModelFromFile({0x7ffd8fb14db7, 0x62}, {{0xc0003ee5c0, 0x1, 0x1}, 0x17, 0x0, 0x1, {0xc0003ee5b8, 0x1, ...}, ...})
	github.com/ollama/ollama/llama/llama.go:308 +0x56d fp=0xc000487da0 sp=0xc000487b58 pc=0x5579c43080ed
github.com/ollama/ollama/runner/llamarunner.(*Server).loadModel(0xc000270a00, {{0xc0003ee5c0, 0x1, 0x1}, 0x17, 0x0, 0x1, {0xc0003ee5b8, 0x1, 0x2}, ...}, ...)
	github.com/ollama/ollama/runner/llamarunner/runner.go:841 +0x9e fp=0xc000487ee8 sp=0xc000487da0 pc=0x5579c43c64be
github.com/ollama/ollama/runner/llamarunner.(*Server).load.gowrap2()
	github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x114 fp=0xc000487fe0 sp=0xc000487ee8 pc=0x5579c43c77b4
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000487fe8 sp=0xc000487fe0 pc=0x5579c3f0ebc1
created by github.com/ollama/ollama/runner/llamarunner.(*Server).load in goroutine 50
	github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x889

goroutine 1 gp=0xc000002380 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0004af790 sp=0xc0004af770 pc=0x5579c3f06d2e
runtime.netpollblock(0xc0004af7e0?, 0xc3ea0466?, 0x79?)
	runtime/netpoll.go:575 +0xf7 fp=0xc0004af7c8 sp=0xc0004af790 pc=0x5579c3ecc057
internal/poll.runtime_pollWait(0x7fab847c6eb0, 0x72)
	runtime/netpoll.go:351 +0x85 fp=0xc0004af7e8 sp=0xc0004af7c8 pc=0x5579c3f05f45
internal/poll.(*pollDesc).wait(0xc000128780?, 0x900000036?, 0x0)
	internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004af810 sp=0xc0004af7e8 pc=0x5579c3f8e0c7
internal/poll.(*pollDesc).waitRead(...)
	internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc000128780)
	internal/poll/fd_unix.go:620 +0x295 fp=0xc0004af8b8 sp=0xc0004af810 pc=0x5579c3f93495
net.(*netFD).accept(0xc000128780)
	net/fd_unix.go:172 +0x29 fp=0xc0004af970 sp=0xc0004af8b8 pc=0x5579c4006549
net.(*TCPListener).accept(0xc000319240)
	net/tcpsock_posix.go:159 +0x1b fp=0xc0004af9c0 sp=0xc0004af970 pc=0x5579c401c45b
net.(*TCPListener).Accept(0xc000319240)
	net/tcpsock.go:380 +0x30 fp=0xc0004af9f0 sp=0xc0004af9c0 pc=0x5579c401b310
net/http.(*onceCloseListener).Accept(0xc00026e480?)
	<autogenerated>:1 +0x24 fp=0xc0004afa08 sp=0xc0004af9f0 pc=0x5579c42331c4
net/http.(*Server).Serve(0xc000051900, {0x5579c55abde0, 0xc000319240})
	net/http/server.go:3424 +0x30c fp=0xc0004afb38 sp=0xc0004afa08 pc=0x5579c420aa8c
github.com/ollama/ollama/runner/llamarunner.Execute({0xc000034260, 0x4, 0x4})
	github.com/ollama/ollama/runner/llamarunner/runner.go:1002 +0x8f5 fp=0xc0004afd08 sp=0xc0004afb38 pc=0x5579c43c8175
github.com/ollama/ollama/runner.Execute({0xc000034250?, 0x0?, 0x0?})
	github.com/ollama/ollama/runner/runner.go:30 +0x130 fp=0xc0004afd30 sp=0xc0004afd08 pc=0x5579c44b55d0
github.com/ollama/ollama/cmd.NewCLI.func3(0xc000051600?, {0x5579c503a0df?, 0x4?, 0x5579c503a0e3?})
	github.com/ollama/ollama/cmd/cmd.go:1961 +0x45 fp=0xc0004afd58 sp=0xc0004afd30 pc=0x5579c4c86025
github.com/spf13/cobra.(*Command).execute(0xc000275508, {0xc000319040, 0x4, 0x4})
	github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc0004afe78 sp=0xc0004afd58 pc=0x5579c40804bc
github.com/spf13/cobra.(*Command).ExecuteC(0xc0004e4908)
	github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc0004aff30 sp=0xc0004afe78 pc=0x5579c4080d05
github.com/spf13/cobra.(*Command).Execute(...)
	github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
	github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
	github.com/ollama/ollama/main.go:12 +0x4d fp=0xc0004aff50 sp=0xc0004aff30 pc=0x5579c4c86b0d
runtime.main()
	runtime/proc.go:283 +0x29d fp=0xc0004affe0 sp=0xc0004aff50 pc=0x5579c3ed36dd
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0004affe8 sp=0xc0004affe0 pc=0x5579c3f0ebc1

goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000aafa8 sp=0xc0000aaf88 pc=0x5579c3f06d2e
runtime.goparkunlock(...)
	runtime/proc.go:441
runtime.forcegchelper()
	runtime/proc.go:348 +0xb8 fp=0xc0000aafe0 sp=0xc0000aafa8 pc=0x5579c3ed3a18
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aafe8 sp=0xc0000aafe0 pc=0x5579c3f0ebc1
created by runtime.init.7 in goroutine 1
	runtime/proc.go:336 +0x1a

goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000ab780 sp=0xc0000ab760 pc=0x5579c3f06d2e
runtime.goparkunlock(...)
	runtime/proc.go:441
runtime.bgsweep(0xc0000d6000)
	runtime/mgcsweep.go:316 +0xdf fp=0xc0000ab7c8 sp=0xc0000ab780 pc=0x5579c3ebe1bf
runtime.gcenable.gowrap1()
	runtime/mgc.go:204 +0x25 fp=0xc0000ab7e0 sp=0xc0000ab7c8 pc=0x5579c3eb25a5
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ab7e8 sp=0xc0000ab7e0 pc=0x5579c3f0ebc1
created by runtime.gcenable in goroutine 1
	runtime/mgc.go:204 +0x66

goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]:
runtime.gopark(0x10000?, 0x5579c5221730?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000abf78 sp=0xc0000abf58 pc=0x5579c3f06d2e
runtime.goparkunlock(...)
	runtime/proc.go:441
runtime.(*scavengerState).park(0x5579c5ee2260)
	runtime/mgcscavenge.go:425 +0x49 fp=0xc0000abfa8 sp=0xc0000abf78 pc=0x5579c3ebbc09
runtime.bgscavenge(0xc0000d6000)
	runtime/mgcscavenge.go:658 +0x59 fp=0xc0000abfc8 sp=0xc0000abfa8 pc=0x5579c3ebc199
runtime.gcenable.gowrap2()
	runtime/mgc.go:205 +0x25 fp=0xc0000abfe0 sp=0xc0000abfc8 pc=0x5579c3eb2545
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000abfe8 sp=0xc0000abfe0 pc=0x5579c3f0ebc1
created by runtime.gcenable in goroutine 1
	runtime/mgc.go:205 +0xa5

goroutine 5 gp=0xc000003dc0 m=nil [finalizer wait]:
runtime.gopark(0x1b8?, 0x5579c3ed5c89?, 0x1?, 0x23?, 0xc0000aa688?)
	runtime/proc.go:435 +0xce fp=0xc0000aa630 sp=0xc0000aa610 pc=0x5579c3f06d2e
runtime.runfinq()
	runtime/mfinal.go:196 +0x107 fp=0xc0000aa7e0 sp=0xc0000aa630 pc=0x5579c3eb1567
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aa7e8 sp=0xc0000aa7e0 pc=0x5579c3f0ebc1
created by runtime.createfing in goroutine 1
	runtime/mfinal.go:166 +0x3d

goroutine 6 gp=0xc00021c8c0 m=nil [chan receive]:
runtime.gopark(0xc0001c1860?, 0xc0000110e0?, 0x60?, 0xc7?, 0x5579c3fed188?)
	runtime/proc.go:435 +0xce fp=0xc0000ac718 sp=0xc0000ac6f8 pc=0x5579c3f06d2e
runtime.chanrecv(0xc000038380, 0x0, 0x1)
	runtime/chan.go:664 +0x445 fp=0xc0000ac790 sp=0xc0000ac718 pc=0x5579c3ea3045
runtime.chanrecv1(0x0?, 0x0?)
	runtime/chan.go:506 +0x12 fp=0xc0000ac7b8 sp=0xc0000ac790 pc=0x5579c3ea2bd2
runtime.unique_runtime_registerUniqueMapCleanup.func2(...)
	runtime/mgc.go:1796
runtime.unique_runtime_registerUniqueMapCleanup.gowrap1()
	runtime/mgc.go:1799 +0x2f fp=0xc0000ac7e0 sp=0xc0000ac7b8 pc=0x5579c3eb574f
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ac7e8 sp=0xc0000ac7e0 pc=0x5579c3f0ebc1
created by unique.runtime_registerUniqueMapCleanup in goroutine 1
	runtime/mgc.go:1794 +0x85

goroutine 7 gp=0xc00021cc40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000acf38 sp=0xc0000acf18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000acfc8 sp=0xc0000acf38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000acfe0 sp=0xc0000acfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000acfe8 sp=0xc0000acfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 8 gp=0xc00021ce00 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000ad738 sp=0xc0000ad718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000ad7c8 sp=0xc0000ad738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000ad7e0 sp=0xc0000ad7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ad7e8 sp=0xc0000ad7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 9 gp=0xc00021cfc0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000adf38 sp=0xc0000adf18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000adfc8 sp=0xc0000adf38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000adfe0 sp=0xc0000adfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000adfe8 sp=0xc0000adfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 10 gp=0xc00021d180 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a6738 sp=0xc0000a6718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a67c8 sp=0xc0000a6738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a67e0 sp=0xc0000a67c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a67e8 sp=0xc0000a67e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 18 gp=0xc000504000 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050a738 sp=0xc00050a718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050a7c8 sp=0xc00050a738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050a7e0 sp=0xc00050a7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050a7e8 sp=0xc00050a7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 19 gp=0xc0005041c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050af38 sp=0xc00050af18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050afc8 sp=0xc00050af38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050afe0 sp=0xc00050afc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050afe8 sp=0xc00050afe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 20 gp=0xc000504380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050b738 sp=0xc00050b718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050b7c8 sp=0xc00050b738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050b7e0 sp=0xc00050b7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050b7e8 sp=0xc00050b7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 21 gp=0xc000504540 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050bf38 sp=0xc00050bf18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050bfc8 sp=0xc00050bf38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050bfe0 sp=0xc00050bfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050bfe8 sp=0xc00050bfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 22 gp=0xc000504700 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050c738 sp=0xc00050c718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050c7c8 sp=0xc00050c738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050c7e0 sp=0xc00050c7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050c7e8 sp=0xc00050c7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 34 gp=0xc000102380 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000506738 sp=0xc000506718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0005067c8 sp=0xc000506738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0005067e0 sp=0xc0005067c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0005067e8 sp=0xc0005067e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 11 gp=0xc00021d340 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a6f38 sp=0xc0000a6f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a6fc8 sp=0xc0000a6f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a6fe0 sp=0xc0000a6fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a6fe8 sp=0xc0000a6fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 35 gp=0xc000102540 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000506f38 sp=0xc000506f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc000506fc8 sp=0xc000506f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc000506fe0 sp=0xc000506fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000506fe8 sp=0xc000506fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 12 gp=0xc00021d500 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a7738 sp=0xc0000a7718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a77c8 sp=0xc0000a7738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a77e0 sp=0xc0000a77c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a77e8 sp=0xc0000a77e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 36 gp=0xc000102700 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000507738 sp=0xc000507718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0005077c8 sp=0xc000507738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0005077e0 sp=0xc0005077c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0005077e8 sp=0xc0005077e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 23 gp=0xc0005048c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050cf38 sp=0xc00050cf18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050cfc8 sp=0xc00050cf38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050cfe0 sp=0xc00050cfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050cfe8 sp=0xc00050cfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 24 gp=0xc000504a80 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050d738 sp=0xc00050d718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050d7c8 sp=0xc00050d738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050d7e0 sp=0xc00050d7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050d7e8 sp=0xc00050d7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 37 gp=0xc0001028c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000507f38 sp=0xc000507f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc000507fc8 sp=0xc000507f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc000507fe0 sp=0xc000507fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000507fe8 sp=0xc000507fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 38 gp=0xc000102a80 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000508738 sp=0xc000508718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0005087c8 sp=0xc000508738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0005087e0 sp=0xc0005087c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0005087e8 sp=0xc0005087e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 39 gp=0xc000102c40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000508f38 sp=0xc000508f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc000508fc8 sp=0xc000508f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc000508fe0 sp=0xc000508fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000508fe8 sp=0xc000508fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 13 gp=0xc00021d6c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a7f38 sp=0xc0000a7f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a7fc8 sp=0xc0000a7f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a7fe0 sp=0xc0000a7fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a7fe8 sp=0xc0000a7fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 14 gp=0xc00021d880 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a8738 sp=0xc0000a8718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a87c8 sp=0xc0000a8738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a87e0 sp=0xc0000a87c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a87e8 sp=0xc0000a87e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 40 gp=0xc000102e00 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000509738 sp=0xc000509718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0005097c8 sp=0xc000509738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0005097e0 sp=0xc0005097c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0005097e8 sp=0xc0005097e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 25 gp=0xc000504c40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00050df38 sp=0xc00050df18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00050dfc8 sp=0xc00050df38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00050dfe0 sp=0xc00050dfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00050dfe8 sp=0xc00050dfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 15 gp=0xc00021da40 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc0000a8f38 sp=0xc0000a8f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0000a8fc8 sp=0xc0000a8f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0000a8fe0 sp=0xc0000a8fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a8fe8 sp=0xc0000a8fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 41 gp=0xc000102fc0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000509f38 sp=0xc000509f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc000509fc8 sp=0xc000509f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc000509fe0 sp=0xc000509fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000509fe8 sp=0xc000509fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 42 gp=0xc000103180 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00011c738 sp=0xc00011c718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00011c7c8 sp=0xc00011c738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00011c7e0 sp=0xc00011c7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00011c7e8 sp=0xc00011c7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 43 gp=0xc000103340 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00011cf38 sp=0xc00011cf18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00011cfc8 sp=0xc00011cf38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00011cfe0 sp=0xc00011cfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00011cfe8 sp=0xc00011cfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 44 gp=0xc000103500 m=nil [GC worker (idle)]:
runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0x7e?, 0x12?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00011d738 sp=0xc00011d718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00011d7c8 sp=0xc00011d738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00011d7e0 sp=0xc00011d7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00011d7e8 sp=0xc00011d7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 45 gp=0xc0001036c0 m=nil [GC worker (idle)]:
runtime.gopark(0xb8bb7ab8e9?, 0x1?, 0x43?, 0x9d?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00011df38 sp=0xc00011df18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00011dfc8 sp=0xc00011df38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00011dfe0 sp=0xc00011dfc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00011dfe8 sp=0xc00011dfe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 26 gp=0xc000504e00 m=nil [GC worker (idle)]:
runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0xa5?, 0x5f?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000118738 sp=0xc000118718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc0001187c8 sp=0xc000118738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc0001187e0 sp=0xc0001187c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0001187e8 sp=0xc0001187e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 27 gp=0xc000504fc0 m=nil [GC worker (idle)]:
runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0x94?, 0x2a?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000118f38 sp=0xc000118f18 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc000118fc8 sp=0xc000118f38 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc000118fe0 sp=0xc000118fc8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000118fe8 sp=0xc000118fe0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 46 gp=0xc000103880 m=nil [GC worker (idle)]:
runtime.gopark(0xb8bb7a9c01?, 0x1?, 0x63?, 0xef?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc00011e738 sp=0xc00011e718 pc=0x5579c3f06d2e
runtime.gcBgMarkWorker(0xc0000397a0)
	runtime/mgc.go:1423 +0xe9 fp=0xc00011e7c8 sp=0xc00011e738 pc=0x5579c3eb4a69
runtime.gcBgMarkStartWorkers.gowrap1()
	runtime/mgc.go:1339 +0x25 fp=0xc00011e7e0 sp=0xc00011e7c8 pc=0x5579c3eb4945
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc00011e7e8 sp=0xc00011e7e0 pc=0x5579c3f0ebc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
	runtime/mgc.go:1339 +0x105

goroutine 16 gp=0xc000505c00 m=nil [sync.WaitGroup.Wait]:
runtime.gopark(0x0?, 0x0?, 0x60?, 0x40?, 0x0?)
	runtime/proc.go:435 +0xce fp=0xc000119620 sp=0xc000119600 pc=0x5579c3f06d2e
runtime.goparkunlock(...)
	runtime/proc.go:441
runtime.semacquire1(0xc000270a20, 0x0, 0x1, 0x0, 0x18)
	runtime/sema.go:188 +0x229 fp=0xc000119688 sp=0xc000119620 pc=0x5579c3ee6ca9
sync.runtime_SemacquireWaitGroup(0x0?)
	runtime/sema.go:110 +0x25 fp=0xc0001196c0 sp=0xc000119688 pc=0x5579c3f08665
sync.(*WaitGroup).Wait(0x0?)
	sync/waitgroup.go:118 +0x48 fp=0xc0001196e8 sp=0xc0001196c0 pc=0x5579c3f1a628
github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc000270a00, {0x5579c55ae470, 0xc0004fa5f0})
	github.com/ollama/ollama/runner/llamarunner/runner.go:360 +0x4b fp=0xc0001197b8 sp=0xc0001196e8 pc=0x5579c43c2e8b
github.com/ollama/ollama/runner/llamarunner.Execute.gowrap1()
	github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x28 fp=0xc0001197e0 sp=0xc0001197b8 pc=0x5579c43c83e8
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc0001197e8 sp=0xc0001197e0 pc=0x5579c3f0ebc1
created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1
	github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x4c5

goroutine 50 gp=0xc000505dc0 m=nil [IO wait]:
runtime.gopark(0x7fabcba993f0?, 0xc000128800?, 0x70?, 0x99?, 0xb?)
	runtime/proc.go:435 +0xce fp=0xc000049948 sp=0xc000049928 pc=0x5579c3f06d2e
runtime.netpollblock(0x5579c3f2a4f8?, 0xc3ea0466?, 0x79?)
	runtime/netpoll.go:575 +0xf7 fp=0xc000049980 sp=0xc000049948 pc=0x5579c3ecc057
internal/poll.runtime_pollWait(0x7fab847c6d98, 0x72)
	runtime/netpoll.go:351 +0x85 fp=0xc0000499a0 sp=0xc000049980 pc=0x5579c3f05f45
internal/poll.(*pollDesc).wait(0xc000128800?, 0xc000340000?, 0x0)
	internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0000499c8 sp=0xc0000499a0 pc=0x5579c3f8e0c7
internal/poll.(*pollDesc).waitRead(...)
	internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000128800, {0xc000340000, 0x1000, 0x1000})
	internal/poll/fd_unix.go:165 +0x27a fp=0xc000049a60 sp=0xc0000499c8 pc=0x5579c3f8f3ba
net.(*netFD).Read(0xc000128800, {0xc000340000?, 0xc000049ad0?, 0x5579c3f8e585?})
	net/fd_posix.go:55 +0x25 fp=0xc000049aa8 sp=0xc000049a60 pc=0x5579c40045a5
net.(*conn).Read(0xc0000aea00, {0xc000340000?, 0x0?, 0x0?})
	net/net.go:194 +0x45 fp=0xc000049af0 sp=0xc000049aa8 pc=0x5579c4012965
net/http.(*connReader).Read(0xc0002670e0, {0xc000340000, 0x1000, 0x1000})
	net/http/server.go:798 +0x159 fp=0xc000049b40 sp=0xc000049af0 pc=0x5579c41ff939
bufio.(*Reader).fill(0xc000110720)
	bufio/bufio.go:113 +0x103 fp=0xc000049b78 sp=0xc000049b40 pc=0x5579c402aa23
bufio.(*Reader).Peek(0xc000110720, 0x4)
	bufio/bufio.go:152 +0x53 fp=0xc000049b98 sp=0xc000049b78 pc=0x5579c402ab53
net/http.(*conn).serve(0xc00026e480, {0x5579c55ae438, 0xc000266ff0})
	net/http/server.go:2137 +0x785 fp=0xc000049fb8 sp=0xc000049b98 pc=0x5579c4205725
net/http.(*Server).Serve.gowrap3()
	net/http/server.go:3454 +0x28 fp=0xc000049fe0 sp=0xc000049fb8 pc=0x5579c420ae88
runtime.goexit({})
	runtime/asm_amd64.s:1700 +0x1 fp=0xc000049fe8 sp=0xc000049fe0 pc=0x5579c3f0ebc1
created by net/http.(*Server).Serve in goroutine 1
	net/http/server.go:3454 +0x485

rax    0x7fabcbad0698
rbx    0x5579e24f4ff0
rcx    0x0
rdx    0x5579e0d62100
rdi    0x7fab75c58e48
rsi    0x3
rbp    0x7fabcbad0698
rsp    0x7ffd8fb11f00
r8     0x0
r9     0x0
r10    0x5579e252b950
r11    0x290
r12    0x0
r13    0x0
r14    0x18
r15    0x1
rip    0x7fab758c0bf5
rflags 0x10246
cs     0x33
fs     0x0
gs     0x0
time=2026-01-23T09:57:50.539Z level=INFO source=sched.go:479 msg="Load failed" model=/root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 error="llama runner process has terminated: exit status 2"
time=2026-01-23T09:57:50.542Z level=DEBUG source=server.go:1827 msg="stopping llama server" pid=72
[GIN] 2026/01/23 - 09:57:50 | 500 |  1.562787192s |       127.0.0.1 | POST     "/api/generate"

My command is

podman container run --name ollama-rocm --detach --tty \
                      -e HSA_OVERRIDE_GFX_VERSION=11.0.0 \
                      -e OLLAMA_DEBUG=1 \
                      --device /dev/kfd --device /dev/dri \
                      --security-opt label=disable \
                      --volume $HOME/ollama:/root/.ollama \
                      --publish 11434:11434 \
                      docker.io/ollama/ollama:rocm

Hope it helps

<!-- gh-comment-id:3789439155 --> @esanchezm commented on GitHub (Jan 23, 2026): Reporting my logs, having the same issue ``` [GIN] 2026/01/23 - 09:57:48 | 200 | 28.693468ms | 127.0.0.1 | POST "/api/show" time=2026-01-23T09:57:49.010Z level=DEBUG source=runner.go:264 msg="refreshing free memory" time=2026-01-23T09:57:49.010Z level=DEBUG source=runner.go:328 msg="unable to refresh all GPUs with existing runners, performing bootstrap discovery" time=2026-01-23T09:57:49.010Z level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 35685" time=2026-01-23T09:57:49.010Z level=DEBUG source=server.go:430 msg=subprocess LD_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm:/usr/local/nvidia/lib:/usr/local/nvidia/lib64 HSA_OVERRIDE_GFX_VERSION=11.0.0 OLLAMA_HOST=0.0.0.0:11434 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin OLLAMA_DEBUG=1 OLLAMA_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm ROCR_VISIBLE_DEVICES=0 time=2026-01-23T09:57:49.694Z level=DEBUG source=runner.go:437 msg="bootstrap discovery took" duration=683.877455ms OLLAMA_LIBRARY_PATH="[/usr/lib/ollama /usr/lib/ollama/rocm]" extra_envs=map[ROCR_VISIBLE_DEVICES:0] time=2026-01-23T09:57:49.694Z level=DEBUG source=runner.go:40 msg="overall device VRAM discovery took" duration=684.048359ms time=2026-01-23T09:57:49.696Z level=WARN source=cpu_linux.go:130 msg="failed to parse CPU allowed micro secs" error="strconv.ParseInt: parsing \"max\": invalid syntax" time=2026-01-23T09:57:49.696Z level=DEBUG source=sched.go:195 msg="updating default concurrency" OLLAMA_MAX_LOADED_MODELS=3 gpu_count=1 time=2026-01-23T09:57:49.699Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=general.alignment default=32 time=2026-01-23T09:57:49.699Z level=DEBUG source=sched.go:220 msg="loading first model" model=/root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 llama_model_loader: loaded meta data with 23 key-value pairs and 201 tensors from /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 (version GGUF V3 (latest)) llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output. llama_model_loader: - kv 0: general.architecture str = llama llama_model_loader: - kv 1: general.name str = TinyLlama llama_model_loader: - kv 2: llama.context_length u32 = 2048 llama_model_loader: - kv 3: llama.embedding_length u32 = 2048 llama_model_loader: - kv 4: llama.block_count u32 = 22 llama_model_loader: - kv 5: llama.feed_forward_length u32 = 5632 llama_model_loader: - kv 6: llama.rope.dimension_count u32 = 64 llama_model_loader: - kv 7: llama.attention.head_count u32 = 32 llama_model_loader: - kv 8: llama.attention.head_count_kv u32 = 4 llama_model_loader: - kv 9: llama.attention.layer_norm_rms_epsilon f32 = 0.000010 llama_model_loader: - kv 10: llama.rope.freq_base f32 = 10000.000000 llama_model_loader: - kv 11: general.file_type u32 = 2 llama_model_loader: - kv 12: tokenizer.ggml.model str = llama llama_model_loader: - kv 13: tokenizer.ggml.tokens arr[str,32000] = ["<unk>", "<s>", "</s>", "<0x00>", "<... llama_model_loader: - kv 14: tokenizer.ggml.scores arr[f32,32000] = [0.000000, 0.000000, 0.000000, 0.0000... llama_model_loader: - kv 15: tokenizer.ggml.token_type arr[i32,32000] = [2, 3, 3, 6, 6, 6, 6, 6, 6, 6, 6, 6, ... llama_model_loader: - kv 16: tokenizer.ggml.merges arr[str,61249] = ["▁ t", "e r", "i n", "▁ a", "e n... llama_model_loader: - kv 17: tokenizer.ggml.bos_token_id u32 = 1 llama_model_loader: - kv 18: tokenizer.ggml.eos_token_id u32 = 2 llama_model_loader: - kv 19: tokenizer.ggml.unknown_token_id u32 = 0 llama_model_loader: - kv 20: tokenizer.ggml.padding_token_id u32 = 2 llama_model_loader: - kv 21: tokenizer.chat_template str = {% for message in messages %}\n{% if m... llama_model_loader: - kv 22: general.quantization_version u32 = 2 llama_model_loader: - type f32: 45 tensors llama_model_loader: - type q4_0: 155 tensors llama_model_loader: - type q6_K: 1 tensors print_info: file format = GGUF V3 (latest) print_info: file type = Q4_0 print_info: file size = 606.53 MiB (4.63 BPW) init_tokenizer: initializing tokenizer for type 1 load: control token: 2 '</s>' is not marked as EOG load: control token: 1 '<s>' is not marked as EOG load: special_eos_id is not in special_eog_ids - the tokenizer config may be incorrect load: printing all EOG tokens: load: - 2 ('</s>') load: special tokens cache size = 3 load: token to piece cache size = 0.1684 MB print_info: arch = llama print_info: vocab_only = 1 print_info: no_alloc = 0 print_info: model type = ?B print_info: model params = 1.10 B print_info: general.name = TinyLlama print_info: vocab type = SPM print_info: n_vocab = 32000 print_info: n_merges = 0 print_info: BOS token = 1 '<s>' print_info: EOS token = 2 '</s>' print_info: UNK token = 0 '<unk>' print_info: PAD token = 2 '</s>' print_info: LF token = 13 '<0x0A>' print_info: EOG token = 2 '</s>' print_info: max token length = 48 llama_model_load: vocab only - skipping tensors time=2026-01-23T09:57:49.741Z level=WARN source=server.go:167 msg="requested context size too large for model" num_ctx=4096 n_ctx_train=2048 time=2026-01-23T09:57:49.741Z level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --model /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 --port 34475" time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:430 msg=subprocess LD_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm:/usr/local/nvidia/lib:/usr/local/nvidia/lib64 HSA_OVERRIDE_GFX_VERSION=11.0.0 OLLAMA_HOST=0.0.0.0:11434 PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin OLLAMA_DEBUG=1 OLLAMA_LIBRARY_PATH=/usr/lib/ollama:/usr/lib/ollama/rocm ROCR_VISIBLE_DEVICES=0 time=2026-01-23T09:57:49.741Z level=INFO source=sched.go:452 msg="system memory" total="93.9 GiB" free="93.7 GiB" free_swap="8.0 GiB" time=2026-01-23T09:57:49.741Z level=INFO source=sched.go:459 msg="gpu memory" id=0 library=ROCm available="78.4 GiB" free="78.8 GiB" minimum="457.0 MiB" overhead="0 B" time=2026-01-23T09:57:49.741Z level=INFO source=server.go:496 msg="loading model" "model layers"=23 requested=-1 time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=llama.attention.key_length default=64 time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:298 msg="key with type not found" key=llama.attention.value_length default=64 time=2026-01-23T09:57:49.741Z level=DEBUG source=ggml.go:633 msg="default cache size estimate" "attention MiB"=44 "attention bytes"=46137344 "recurrent MiB"=0 "recurrent bytes"=0 time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:974 msg="available gpu" id=0 library=ROCm "available layer vram"="78.3 GiB" backoff=0.00 minimum="457.0 MiB" overhead="0 B" graph="0 B" time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:974 msg="available gpu" id=0 library=ROCm "available layer vram"="78.2 GiB" backoff=0.00 minimum="457.0 MiB" overhead="0 B" graph="148.0 MiB" time=2026-01-23T09:57:49.741Z level=DEBUG source=server.go:667 msg=memory estimate.ROCm0.ID=0 estimate.ROCm0.Weights="[24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 24788992 53768192]" estimate.ROCm0.Cache="[2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 2097152 0]" estimate.ROCm0.Graph=155191296 time=2026-01-23T09:57:49.741Z level=INFO source=device.go:240 msg="model weights" device=ROCm0 size="571.4 MiB" time=2026-01-23T09:57:49.741Z level=INFO source=device.go:251 msg="kv cache" device=ROCm0 size="44.0 MiB" time=2026-01-23T09:57:49.741Z level=INFO source=device.go:262 msg="compute graph" device=ROCm0 size="148.0 MiB" time=2026-01-23T09:57:49.741Z level=INFO source=device.go:272 msg="total memory" size="763.4 MiB" time=2026-01-23T09:57:49.749Z level=INFO source=runner.go:965 msg="starting go runner" time=2026-01-23T09:57:49.749Z level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=/usr/lib/ollama load_backend: loaded CPU backend from /usr/lib/ollama/libggml-cpu-icelake.so time=2026-01-23T09:57:49.753Z level=DEBUG source=ggml.go:94 msg="ggml backend load all from path" path=/usr/lib/ollama/rocm /opt/amdgpu/share/libdrm/amdgpu.ids: No such file or directory ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 ROCm devices: Device 0: AMD Radeon Graphics, gfx1100 (0x1100), VMM: no, Wave Size: 32, ID: 0 load_backend: loaded ROCm backend from /usr/lib/ollama/rocm/libggml-hip.so time=2026-01-23T09:57:50.280Z level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.AVX512=1 CPU.0.AVX512_VBMI=1 CPU.0.AVX512_VNNI=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 ROCm.0.NO_VMM=1 ROCm.0.PEER_MAX_BATCH_SIZE=128 compiler=cgo(gcc) time=2026-01-23T09:57:50.280Z level=INFO source=runner.go:1001 msg="Server listening on 127.0.0.1:34475" time=2026-01-23T09:57:50.288Z level=INFO source=runner.go:895 msg=load request="{Operation:commit LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Auto KvSize:2048 KvCacheType: NumThreads:16 GPULayers:23[ID:0 Layers:23(0..22)] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:true}" ggml_hip_get_device_memory searching for device 0000:c5:00.0 time=2026-01-23T09:57:50.288Z level=INFO source=server.go:1347 msg="waiting for llama runner to start responding" ggml_backend_cuda_device_get_memory device 0000:c5:00.0 utilizing AMD specific memory reporting free: 84612214784 total: 84785856512 llama_model_load_from_file_impl: using device ROCm0 (AMD Radeon Graphics) (0000:c5:00.0) - 80692 MiB free time=2026-01-23T09:57:50.289Z level=INFO source=server.go:1381 msg="waiting for server to become available" status="llm server loading model" llama_model_loader: loaded meta data with 23 key-value pairs and 201 tensors from /root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 (version GGUF V3 (latest)) llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output. llama_model_loader: - kv 0: general.architecture str = llama llama_model_loader: - kv 1: general.name str = TinyLlama llama_model_loader: - kv 2: llama.context_length u32 = 2048 llama_model_loader: - kv 3: llama.embedding_length u32 = 2048 llama_model_loader: - kv 4: llama.block_count u32 = 22 llama_model_loader: - kv 5: llama.feed_forward_length u32 = 5632 llama_model_loader: - kv 6: llama.rope.dimension_count u32 = 64 llama_model_loader: - kv 7: llama.attention.head_count u32 = 32 llama_model_loader: - kv 8: llama.attention.head_count_kv u32 = 4 llama_model_loader: - kv 9: llama.attention.layer_norm_rms_epsilon f32 = 0.000010 llama_model_loader: - kv 10: llama.rope.freq_base f32 = 10000.000000 llama_model_loader: - kv 11: general.file_type u32 = 2 llama_model_loader: - kv 12: tokenizer.ggml.model str = llama llama_model_loader: - kv 13: tokenizer.ggml.tokens arr[str,32000] = ["<unk>", "<s>", "</s>", "<0x00>", "<... llama_model_loader: - kv 14: tokenizer.ggml.scores arr[f32,32000] = [0.000000, 0.000000, 0.000000, 0.0000... llama_model_loader: - kv 15: tokenizer.ggml.token_type arr[i32,32000] = [2, 3, 3, 6, 6, 6, 6, 6, 6, 6, 6, 6, ... llama_model_loader: - kv 16: tokenizer.ggml.merges arr[str,61249] = ["▁ t", "e r", "i n", "▁ a", "e n... llama_model_loader: - kv 17: tokenizer.ggml.bos_token_id u32 = 1 llama_model_loader: - kv 18: tokenizer.ggml.eos_token_id u32 = 2 llama_model_loader: - kv 19: tokenizer.ggml.unknown_token_id u32 = 0 llama_model_loader: - kv 20: tokenizer.ggml.padding_token_id u32 = 2 llama_model_loader: - kv 21: tokenizer.chat_template str = {% for message in messages %}\n{% if m... llama_model_loader: - kv 22: general.quantization_version u32 = 2 llama_model_loader: - type f32: 45 tensors llama_model_loader: - type q4_0: 155 tensors llama_model_loader: - type q6_K: 1 tensors print_info: file format = GGUF V3 (latest) print_info: file type = Q4_0 print_info: file size = 606.53 MiB (4.63 BPW) init_tokenizer: initializing tokenizer for type 1 load: control token: 2 '</s>' is not marked as EOG load: control token: 1 '<s>' is not marked as EOG load: special_eos_id is not in special_eog_ids - the tokenizer config may be incorrect load: printing all EOG tokens: load: - 2 ('</s>') load: special tokens cache size = 3 load: token to piece cache size = 0.1684 MB print_info: arch = llama print_info: vocab_only = 0 print_info: no_alloc = 0 print_info: n_ctx_train = 2048 print_info: n_embd = 2048 print_info: n_embd_inp = 2048 print_info: n_layer = 22 print_info: n_head = 32 print_info: n_head_kv = 4 print_info: n_rot = 64 print_info: n_swa = 0 print_info: is_swa_any = 0 print_info: n_embd_head_k = 64 print_info: n_embd_head_v = 64 print_info: n_gqa = 8 print_info: n_embd_k_gqa = 256 print_info: n_embd_v_gqa = 256 print_info: f_norm_eps = 0.0e+00 print_info: f_norm_rms_eps = 1.0e-05 print_info: f_clamp_kqv = 0.0e+00 print_info: f_max_alibi_bias = 0.0e+00 print_info: f_logit_scale = 0.0e+00 print_info: f_attn_scale = 0.0e+00 print_info: n_ff = 5632 print_info: n_expert = 0 print_info: n_expert_used = 0 print_info: n_expert_groups = 0 print_info: n_group_used = 0 print_info: causal attn = 1 print_info: pooling type = 0 print_info: rope type = 0 print_info: rope scaling = linear print_info: freq_base_train = 10000.0 print_info: freq_scale_train = 1 print_info: n_ctx_orig_yarn = 2048 print_info: rope_yarn_log_mul= 0.0000 print_info: rope_finetuned = unknown print_info: model type = 1B print_info: model params = 1.10 B print_info: general.name = TinyLlama print_info: vocab type = SPM print_info: n_vocab = 32000 print_info: n_merges = 0 print_info: BOS token = 1 '<s>' print_info: EOS token = 2 '</s>' print_info: UNK token = 0 '<unk>' print_info: PAD token = 2 '</s>' print_info: LF token = 13 '<0x0A>' print_info: EOG token = 2 '</s>' print_info: max token length = 48 load_tensors: loading model tensors, this can take a while... (mmap = true) load_tensors: layer 0 assigned to device ROCm0, is_swa = 0 load_tensors: layer 1 assigned to device ROCm0, is_swa = 0 load_tensors: layer 2 assigned to device ROCm0, is_swa = 0 load_tensors: layer 3 assigned to device ROCm0, is_swa = 0 load_tensors: layer 4 assigned to device ROCm0, is_swa = 0 load_tensors: layer 5 assigned to device ROCm0, is_swa = 0 load_tensors: layer 6 assigned to device ROCm0, is_swa = 0 load_tensors: layer 7 assigned to device ROCm0, is_swa = 0 load_tensors: layer 8 assigned to device ROCm0, is_swa = 0 load_tensors: layer 9 assigned to device ROCm0, is_swa = 0 load_tensors: layer 10 assigned to device ROCm0, is_swa = 0 load_tensors: layer 11 assigned to device ROCm0, is_swa = 0 load_tensors: layer 12 assigned to device ROCm0, is_swa = 0 load_tensors: layer 13 assigned to device ROCm0, is_swa = 0 load_tensors: layer 14 assigned to device ROCm0, is_swa = 0 load_tensors: layer 15 assigned to device ROCm0, is_swa = 0 load_tensors: layer 16 assigned to device ROCm0, is_swa = 0 load_tensors: layer 17 assigned to device ROCm0, is_swa = 0 load_tensors: layer 18 assigned to device ROCm0, is_swa = 0 load_tensors: layer 19 assigned to device ROCm0, is_swa = 0 load_tensors: layer 20 assigned to device ROCm0, is_swa = 0 load_tensors: layer 21 assigned to device ROCm0, is_swa = 0 load_tensors: layer 22 assigned to device ROCm0, is_swa = 0 create_tensor: loading tensor token_embd.weight create_tensor: loading tensor output_norm.weight create_tensor: loading tensor output.weight create_tensor: loading tensor blk.0.attn_norm.weight create_tensor: loading tensor blk.0.attn_q.weight create_tensor: loading tensor blk.0.attn_k.weight create_tensor: loading tensor blk.0.attn_v.weight create_tensor: loading tensor blk.0.attn_output.weight create_tensor: loading tensor blk.0.ffn_norm.weight create_tensor: loading tensor blk.0.ffn_gate.weight create_tensor: loading tensor blk.0.ffn_down.weight create_tensor: loading tensor blk.0.ffn_up.weight create_tensor: loading tensor blk.1.attn_norm.weight create_tensor: loading tensor blk.1.attn_q.weight create_tensor: loading tensor blk.1.attn_k.weight create_tensor: loading tensor blk.1.attn_v.weight create_tensor: loading tensor blk.1.attn_output.weight create_tensor: loading tensor blk.1.ffn_norm.weight create_tensor: loading tensor blk.1.ffn_gate.weight create_tensor: loading tensor blk.1.ffn_down.weight create_tensor: loading tensor blk.1.ffn_up.weight create_tensor: loading tensor blk.2.attn_norm.weight create_tensor: loading tensor blk.2.attn_q.weight create_tensor: loading tensor blk.2.attn_k.weight create_tensor: loading tensor blk.2.attn_v.weight create_tensor: loading tensor blk.2.attn_output.weight create_tensor: loading tensor blk.2.ffn_norm.weight create_tensor: loading tensor blk.2.ffn_gate.weight create_tensor: loading tensor blk.2.ffn_down.weight create_tensor: loading tensor blk.2.ffn_up.weight create_tensor: loading tensor blk.3.attn_norm.weight create_tensor: loading tensor blk.3.attn_q.weight create_tensor: loading tensor blk.3.attn_k.weight create_tensor: loading tensor blk.3.attn_v.weight create_tensor: loading tensor blk.3.attn_output.weight create_tensor: loading tensor blk.3.ffn_norm.weight create_tensor: loading tensor blk.3.ffn_gate.weight create_tensor: loading tensor blk.3.ffn_down.weight create_tensor: loading tensor blk.3.ffn_up.weight create_tensor: loading tensor blk.4.attn_norm.weight create_tensor: loading tensor blk.4.attn_q.weight create_tensor: loading tensor blk.4.attn_k.weight create_tensor: loading tensor blk.4.attn_v.weight create_tensor: loading tensor blk.4.attn_output.weight create_tensor: loading tensor blk.4.ffn_norm.weight create_tensor: loading tensor blk.4.ffn_gate.weight create_tensor: loading tensor blk.4.ffn_down.weight create_tensor: loading tensor blk.4.ffn_up.weight create_tensor: loading tensor blk.5.attn_norm.weight create_tensor: loading tensor blk.5.attn_q.weight create_tensor: loading tensor blk.5.attn_k.weight create_tensor: loading tensor blk.5.attn_v.weight create_tensor: loading tensor blk.5.attn_output.weight create_tensor: loading tensor blk.5.ffn_norm.weight create_tensor: loading tensor blk.5.ffn_gate.weight create_tensor: loading tensor blk.5.ffn_down.weight create_tensor: loading tensor blk.5.ffn_up.weight create_tensor: loading tensor blk.6.attn_norm.weight create_tensor: loading tensor blk.6.attn_q.weight create_tensor: loading tensor blk.6.attn_k.weight create_tensor: loading tensor blk.6.attn_v.weight create_tensor: loading tensor blk.6.attn_output.weight create_tensor: loading tensor blk.6.ffn_norm.weight create_tensor: loading tensor blk.6.ffn_gate.weight create_tensor: loading tensor blk.6.ffn_down.weight create_tensor: loading tensor blk.6.ffn_up.weight create_tensor: loading tensor blk.7.attn_norm.weight create_tensor: loading tensor blk.7.attn_q.weight create_tensor: loading tensor blk.7.attn_k.weight create_tensor: loading tensor blk.7.attn_v.weight create_tensor: loading tensor blk.7.attn_output.weight create_tensor: loading tensor blk.7.ffn_norm.weight create_tensor: loading tensor blk.7.ffn_gate.weight create_tensor: loading tensor blk.7.ffn_down.weight create_tensor: loading tensor blk.7.ffn_up.weight create_tensor: loading tensor blk.8.attn_norm.weight create_tensor: loading tensor blk.8.attn_q.weight create_tensor: loading tensor blk.8.attn_k.weight create_tensor: loading tensor blk.8.attn_v.weight create_tensor: loading tensor blk.8.attn_output.weight create_tensor: loading tensor blk.8.ffn_norm.weight create_tensor: loading tensor blk.8.ffn_gate.weight create_tensor: loading tensor blk.8.ffn_down.weight create_tensor: loading tensor blk.8.ffn_up.weight create_tensor: loading tensor blk.9.attn_norm.weight create_tensor: loading tensor blk.9.attn_q.weight create_tensor: loading tensor blk.9.attn_k.weight create_tensor: loading tensor blk.9.attn_v.weight create_tensor: loading tensor blk.9.attn_output.weight create_tensor: loading tensor blk.9.ffn_norm.weight create_tensor: loading tensor blk.9.ffn_gate.weight create_tensor: loading tensor blk.9.ffn_down.weight create_tensor: loading tensor blk.9.ffn_up.weight create_tensor: loading tensor blk.10.attn_norm.weight create_tensor: loading tensor blk.10.attn_q.weight create_tensor: loading tensor blk.10.attn_k.weight create_tensor: loading tensor blk.10.attn_v.weight create_tensor: loading tensor blk.10.attn_output.weight create_tensor: loading tensor blk.10.ffn_norm.weight create_tensor: loading tensor blk.10.ffn_gate.weight create_tensor: loading tensor blk.10.ffn_down.weight create_tensor: loading tensor blk.10.ffn_up.weight create_tensor: loading tensor blk.11.attn_norm.weight create_tensor: loading tensor blk.11.attn_q.weight create_tensor: loading tensor blk.11.attn_k.weight create_tensor: loading tensor blk.11.attn_v.weight create_tensor: loading tensor blk.11.attn_output.weight create_tensor: loading tensor blk.11.ffn_norm.weight create_tensor: loading tensor blk.11.ffn_gate.weight create_tensor: loading tensor blk.11.ffn_down.weight create_tensor: loading tensor blk.11.ffn_up.weight create_tensor: loading tensor blk.12.attn_norm.weight create_tensor: loading tensor blk.12.attn_q.weight create_tensor: loading tensor blk.12.attn_k.weight create_tensor: loading tensor blk.12.attn_v.weight create_tensor: loading tensor blk.12.attn_output.weight create_tensor: loading tensor blk.12.ffn_norm.weight create_tensor: loading tensor blk.12.ffn_gate.weight create_tensor: loading tensor blk.12.ffn_down.weight create_tensor: loading tensor blk.12.ffn_up.weight create_tensor: loading tensor blk.13.attn_norm.weight create_tensor: loading tensor blk.13.attn_q.weight create_tensor: loading tensor blk.13.attn_k.weight create_tensor: loading tensor blk.13.attn_v.weight create_tensor: loading tensor blk.13.attn_output.weight create_tensor: loading tensor blk.13.ffn_norm.weight create_tensor: loading tensor blk.13.ffn_gate.weight create_tensor: loading tensor blk.13.ffn_down.weight create_tensor: loading tensor blk.13.ffn_up.weight create_tensor: loading tensor blk.14.attn_norm.weight create_tensor: loading tensor blk.14.attn_q.weight create_tensor: loading tensor blk.14.attn_k.weight create_tensor: loading tensor blk.14.attn_v.weight create_tensor: loading tensor blk.14.attn_output.weight create_tensor: loading tensor blk.14.ffn_norm.weight create_tensor: loading tensor blk.14.ffn_gate.weight create_tensor: loading tensor blk.14.ffn_down.weight create_tensor: loading tensor blk.14.ffn_up.weight create_tensor: loading tensor blk.15.attn_norm.weight create_tensor: loading tensor blk.15.attn_q.weight create_tensor: loading tensor blk.15.attn_k.weight create_tensor: loading tensor blk.15.attn_v.weight create_tensor: loading tensor blk.15.attn_output.weight create_tensor: loading tensor blk.15.ffn_norm.weight create_tensor: loading tensor blk.15.ffn_gate.weight create_tensor: loading tensor blk.15.ffn_down.weight create_tensor: loading tensor blk.15.ffn_up.weight create_tensor: loading tensor blk.16.attn_norm.weight create_tensor: loading tensor blk.16.attn_q.weight create_tensor: loading tensor blk.16.attn_k.weight create_tensor: loading tensor blk.16.attn_v.weight create_tensor: loading tensor blk.16.attn_output.weight create_tensor: loading tensor blk.16.ffn_norm.weight create_tensor: loading tensor blk.16.ffn_gate.weight create_tensor: loading tensor blk.16.ffn_down.weight create_tensor: loading tensor blk.16.ffn_up.weight create_tensor: loading tensor blk.17.attn_norm.weight create_tensor: loading tensor blk.17.attn_q.weight create_tensor: loading tensor blk.17.attn_k.weight create_tensor: loading tensor blk.17.attn_v.weight create_tensor: loading tensor blk.17.attn_output.weight create_tensor: loading tensor blk.17.ffn_norm.weight create_tensor: loading tensor blk.17.ffn_gate.weight create_tensor: loading tensor blk.17.ffn_down.weight create_tensor: loading tensor blk.17.ffn_up.weight create_tensor: loading tensor blk.18.attn_norm.weight create_tensor: loading tensor blk.18.attn_q.weight create_tensor: loading tensor blk.18.attn_k.weight create_tensor: loading tensor blk.18.attn_v.weight create_tensor: loading tensor blk.18.attn_output.weight create_tensor: loading tensor blk.18.ffn_norm.weight create_tensor: loading tensor blk.18.ffn_gate.weight create_tensor: loading tensor blk.18.ffn_down.weight create_tensor: loading tensor blk.18.ffn_up.weight create_tensor: loading tensor blk.19.attn_norm.weight create_tensor: loading tensor blk.19.attn_q.weight create_tensor: loading tensor blk.19.attn_k.weight create_tensor: loading tensor blk.19.attn_v.weight create_tensor: loading tensor blk.19.attn_output.weight create_tensor: loading tensor blk.19.ffn_norm.weight create_tensor: loading tensor blk.19.ffn_gate.weight create_tensor: loading tensor blk.19.ffn_down.weight create_tensor: loading tensor blk.19.ffn_up.weight create_tensor: loading tensor blk.20.attn_norm.weight create_tensor: loading tensor blk.20.attn_q.weight create_tensor: loading tensor blk.20.attn_k.weight create_tensor: loading tensor blk.20.attn_v.weight create_tensor: loading tensor blk.20.attn_output.weight create_tensor: loading tensor blk.20.ffn_norm.weight create_tensor: loading tensor blk.20.ffn_gate.weight create_tensor: loading tensor blk.20.ffn_down.weight create_tensor: loading tensor blk.20.ffn_up.weight create_tensor: loading tensor blk.21.attn_norm.weight create_tensor: loading tensor blk.21.attn_q.weight create_tensor: loading tensor blk.21.attn_k.weight create_tensor: loading tensor blk.21.attn_v.weight create_tensor: loading tensor blk.21.attn_output.weight create_tensor: loading tensor blk.21.ffn_norm.weight create_tensor: loading tensor blk.21.ffn_gate.weight create_tensor: loading tensor blk.21.ffn_down.weight create_tensor: loading tensor blk.21.ffn_up.weight load_tensors: tensor 'token_embd.weight' (q4_0) (and 0 others) cannot be used with preferred buffer type ROCm_Host, using CPU instead load_tensors: offloading 22 repeating layers to GPU load_tensors: offloading output layer to GPU load_tensors: offloaded 23/23 layers to GPU load_tensors: CPU_Mapped model buffer size = 35.16 MiB load_tensors: ROCm0 model buffer size = 571.37 MiB SIGSEGV: segmentation violation PC=0x7fab758c0bf5 m=0 sigcode=1 addr=0x18 signal arrived during cgo execution goroutine 29 gp=0xc000602540 m=0 mp=0x5579c5ee5080 [syscall]: runtime.cgocall(0x5579c4d070d2, 0xc000487b58) runtime/cgocall.go:167 +0x4b fp=0xc000487b30 sp=0xc000487af8 pc=0x5579c3f038ab github.com/ollama/ollama/llama._Cfunc_llama_model_load_from_file(0x5579e14c1d20, {0xc0003ee890, 0x0, 0x17, 0x1, 0x0, 0xc0003ee5b8, 0x5579c4d06315, 0xc0003ee888, 0x0, ...}) _cgo_gotypes.go:910 +0x4b fp=0xc000487b58 sp=0xc000487b30 pc=0x5579c4304d6b github.com/ollama/ollama/llama.LoadModelFromFile.func1(...) github.com/ollama/ollama/llama/llama.go:308 github.com/ollama/ollama/llama.LoadModelFromFile({0x7ffd8fb14db7, 0x62}, {{0xc0003ee5c0, 0x1, 0x1}, 0x17, 0x0, 0x1, {0xc0003ee5b8, 0x1, ...}, ...}) github.com/ollama/ollama/llama/llama.go:308 +0x56d fp=0xc000487da0 sp=0xc000487b58 pc=0x5579c43080ed github.com/ollama/ollama/runner/llamarunner.(*Server).loadModel(0xc000270a00, {{0xc0003ee5c0, 0x1, 0x1}, 0x17, 0x0, 0x1, {0xc0003ee5b8, 0x1, 0x2}, ...}, ...) github.com/ollama/ollama/runner/llamarunner/runner.go:841 +0x9e fp=0xc000487ee8 sp=0xc000487da0 pc=0x5579c43c64be github.com/ollama/ollama/runner/llamarunner.(*Server).load.gowrap2() github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x114 fp=0xc000487fe0 sp=0xc000487ee8 pc=0x5579c43c77b4 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000487fe8 sp=0xc000487fe0 pc=0x5579c3f0ebc1 created by github.com/ollama/ollama/runner/llamarunner.(*Server).load in goroutine 50 github.com/ollama/ollama/runner/llamarunner/runner.go:934 +0x889 goroutine 1 gp=0xc000002380 m=nil [IO wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0004af790 sp=0xc0004af770 pc=0x5579c3f06d2e runtime.netpollblock(0xc0004af7e0?, 0xc3ea0466?, 0x79?) runtime/netpoll.go:575 +0xf7 fp=0xc0004af7c8 sp=0xc0004af790 pc=0x5579c3ecc057 internal/poll.runtime_pollWait(0x7fab847c6eb0, 0x72) runtime/netpoll.go:351 +0x85 fp=0xc0004af7e8 sp=0xc0004af7c8 pc=0x5579c3f05f45 internal/poll.(*pollDesc).wait(0xc000128780?, 0x900000036?, 0x0) internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004af810 sp=0xc0004af7e8 pc=0x5579c3f8e0c7 internal/poll.(*pollDesc).waitRead(...) internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0xc000128780) internal/poll/fd_unix.go:620 +0x295 fp=0xc0004af8b8 sp=0xc0004af810 pc=0x5579c3f93495 net.(*netFD).accept(0xc000128780) net/fd_unix.go:172 +0x29 fp=0xc0004af970 sp=0xc0004af8b8 pc=0x5579c4006549 net.(*TCPListener).accept(0xc000319240) net/tcpsock_posix.go:159 +0x1b fp=0xc0004af9c0 sp=0xc0004af970 pc=0x5579c401c45b net.(*TCPListener).Accept(0xc000319240) net/tcpsock.go:380 +0x30 fp=0xc0004af9f0 sp=0xc0004af9c0 pc=0x5579c401b310 net/http.(*onceCloseListener).Accept(0xc00026e480?) <autogenerated>:1 +0x24 fp=0xc0004afa08 sp=0xc0004af9f0 pc=0x5579c42331c4 net/http.(*Server).Serve(0xc000051900, {0x5579c55abde0, 0xc000319240}) net/http/server.go:3424 +0x30c fp=0xc0004afb38 sp=0xc0004afa08 pc=0x5579c420aa8c github.com/ollama/ollama/runner/llamarunner.Execute({0xc000034260, 0x4, 0x4}) github.com/ollama/ollama/runner/llamarunner/runner.go:1002 +0x8f5 fp=0xc0004afd08 sp=0xc0004afb38 pc=0x5579c43c8175 github.com/ollama/ollama/runner.Execute({0xc000034250?, 0x0?, 0x0?}) github.com/ollama/ollama/runner/runner.go:30 +0x130 fp=0xc0004afd30 sp=0xc0004afd08 pc=0x5579c44b55d0 github.com/ollama/ollama/cmd.NewCLI.func3(0xc000051600?, {0x5579c503a0df?, 0x4?, 0x5579c503a0e3?}) github.com/ollama/ollama/cmd/cmd.go:1961 +0x45 fp=0xc0004afd58 sp=0xc0004afd30 pc=0x5579c4c86025 github.com/spf13/cobra.(*Command).execute(0xc000275508, {0xc000319040, 0x4, 0x4}) github.com/spf13/cobra@v1.7.0/command.go:940 +0x85c fp=0xc0004afe78 sp=0xc0004afd58 pc=0x5579c40804bc github.com/spf13/cobra.(*Command).ExecuteC(0xc0004e4908) github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc0004aff30 sp=0xc0004afe78 pc=0x5579c4080d05 github.com/spf13/cobra.(*Command).Execute(...) github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(...) github.com/spf13/cobra@v1.7.0/command.go:985 main.main() github.com/ollama/ollama/main.go:12 +0x4d fp=0xc0004aff50 sp=0xc0004aff30 pc=0x5579c4c86b0d runtime.main() runtime/proc.go:283 +0x29d fp=0xc0004affe0 sp=0xc0004aff50 pc=0x5579c3ed36dd runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0004affe8 sp=0xc0004affe0 pc=0x5579c3f0ebc1 goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000aafa8 sp=0xc0000aaf88 pc=0x5579c3f06d2e runtime.goparkunlock(...) runtime/proc.go:441 runtime.forcegchelper() runtime/proc.go:348 +0xb8 fp=0xc0000aafe0 sp=0xc0000aafa8 pc=0x5579c3ed3a18 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aafe8 sp=0xc0000aafe0 pc=0x5579c3f0ebc1 created by runtime.init.7 in goroutine 1 runtime/proc.go:336 +0x1a goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]: runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000ab780 sp=0xc0000ab760 pc=0x5579c3f06d2e runtime.goparkunlock(...) runtime/proc.go:441 runtime.bgsweep(0xc0000d6000) runtime/mgcsweep.go:316 +0xdf fp=0xc0000ab7c8 sp=0xc0000ab780 pc=0x5579c3ebe1bf runtime.gcenable.gowrap1() runtime/mgc.go:204 +0x25 fp=0xc0000ab7e0 sp=0xc0000ab7c8 pc=0x5579c3eb25a5 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ab7e8 sp=0xc0000ab7e0 pc=0x5579c3f0ebc1 created by runtime.gcenable in goroutine 1 runtime/mgc.go:204 +0x66 goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]: runtime.gopark(0x10000?, 0x5579c5221730?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000abf78 sp=0xc0000abf58 pc=0x5579c3f06d2e runtime.goparkunlock(...) runtime/proc.go:441 runtime.(*scavengerState).park(0x5579c5ee2260) runtime/mgcscavenge.go:425 +0x49 fp=0xc0000abfa8 sp=0xc0000abf78 pc=0x5579c3ebbc09 runtime.bgscavenge(0xc0000d6000) runtime/mgcscavenge.go:658 +0x59 fp=0xc0000abfc8 sp=0xc0000abfa8 pc=0x5579c3ebc199 runtime.gcenable.gowrap2() runtime/mgc.go:205 +0x25 fp=0xc0000abfe0 sp=0xc0000abfc8 pc=0x5579c3eb2545 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000abfe8 sp=0xc0000abfe0 pc=0x5579c3f0ebc1 created by runtime.gcenable in goroutine 1 runtime/mgc.go:205 +0xa5 goroutine 5 gp=0xc000003dc0 m=nil [finalizer wait]: runtime.gopark(0x1b8?, 0x5579c3ed5c89?, 0x1?, 0x23?, 0xc0000aa688?) runtime/proc.go:435 +0xce fp=0xc0000aa630 sp=0xc0000aa610 pc=0x5579c3f06d2e runtime.runfinq() runtime/mfinal.go:196 +0x107 fp=0xc0000aa7e0 sp=0xc0000aa630 pc=0x5579c3eb1567 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000aa7e8 sp=0xc0000aa7e0 pc=0x5579c3f0ebc1 created by runtime.createfing in goroutine 1 runtime/mfinal.go:166 +0x3d goroutine 6 gp=0xc00021c8c0 m=nil [chan receive]: runtime.gopark(0xc0001c1860?, 0xc0000110e0?, 0x60?, 0xc7?, 0x5579c3fed188?) runtime/proc.go:435 +0xce fp=0xc0000ac718 sp=0xc0000ac6f8 pc=0x5579c3f06d2e runtime.chanrecv(0xc000038380, 0x0, 0x1) runtime/chan.go:664 +0x445 fp=0xc0000ac790 sp=0xc0000ac718 pc=0x5579c3ea3045 runtime.chanrecv1(0x0?, 0x0?) runtime/chan.go:506 +0x12 fp=0xc0000ac7b8 sp=0xc0000ac790 pc=0x5579c3ea2bd2 runtime.unique_runtime_registerUniqueMapCleanup.func2(...) runtime/mgc.go:1796 runtime.unique_runtime_registerUniqueMapCleanup.gowrap1() runtime/mgc.go:1799 +0x2f fp=0xc0000ac7e0 sp=0xc0000ac7b8 pc=0x5579c3eb574f runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ac7e8 sp=0xc0000ac7e0 pc=0x5579c3f0ebc1 created by unique.runtime_registerUniqueMapCleanup in goroutine 1 runtime/mgc.go:1794 +0x85 goroutine 7 gp=0xc00021cc40 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000acf38 sp=0xc0000acf18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000acfc8 sp=0xc0000acf38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000acfe0 sp=0xc0000acfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000acfe8 sp=0xc0000acfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 8 gp=0xc00021ce00 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000ad738 sp=0xc0000ad718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000ad7c8 sp=0xc0000ad738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000ad7e0 sp=0xc0000ad7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000ad7e8 sp=0xc0000ad7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 9 gp=0xc00021cfc0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000adf38 sp=0xc0000adf18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000adfc8 sp=0xc0000adf38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000adfe0 sp=0xc0000adfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000adfe8 sp=0xc0000adfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 10 gp=0xc00021d180 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a6738 sp=0xc0000a6718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a67c8 sp=0xc0000a6738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a67e0 sp=0xc0000a67c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a67e8 sp=0xc0000a67e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 18 gp=0xc000504000 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050a738 sp=0xc00050a718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050a7c8 sp=0xc00050a738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050a7e0 sp=0xc00050a7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050a7e8 sp=0xc00050a7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 19 gp=0xc0005041c0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050af38 sp=0xc00050af18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050afc8 sp=0xc00050af38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050afe0 sp=0xc00050afc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050afe8 sp=0xc00050afe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 20 gp=0xc000504380 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050b738 sp=0xc00050b718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050b7c8 sp=0xc00050b738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050b7e0 sp=0xc00050b7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050b7e8 sp=0xc00050b7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 21 gp=0xc000504540 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050bf38 sp=0xc00050bf18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050bfc8 sp=0xc00050bf38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050bfe0 sp=0xc00050bfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050bfe8 sp=0xc00050bfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 22 gp=0xc000504700 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050c738 sp=0xc00050c718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050c7c8 sp=0xc00050c738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050c7e0 sp=0xc00050c7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050c7e8 sp=0xc00050c7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 34 gp=0xc000102380 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000506738 sp=0xc000506718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0005067c8 sp=0xc000506738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0005067e0 sp=0xc0005067c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0005067e8 sp=0xc0005067e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 11 gp=0xc00021d340 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a6f38 sp=0xc0000a6f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a6fc8 sp=0xc0000a6f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a6fe0 sp=0xc0000a6fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a6fe8 sp=0xc0000a6fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 35 gp=0xc000102540 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000506f38 sp=0xc000506f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc000506fc8 sp=0xc000506f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc000506fe0 sp=0xc000506fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000506fe8 sp=0xc000506fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 12 gp=0xc00021d500 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a7738 sp=0xc0000a7718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a77c8 sp=0xc0000a7738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a77e0 sp=0xc0000a77c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a77e8 sp=0xc0000a77e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 36 gp=0xc000102700 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000507738 sp=0xc000507718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0005077c8 sp=0xc000507738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0005077e0 sp=0xc0005077c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0005077e8 sp=0xc0005077e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 23 gp=0xc0005048c0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050cf38 sp=0xc00050cf18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050cfc8 sp=0xc00050cf38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050cfe0 sp=0xc00050cfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050cfe8 sp=0xc00050cfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 24 gp=0xc000504a80 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050d738 sp=0xc00050d718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050d7c8 sp=0xc00050d738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050d7e0 sp=0xc00050d7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050d7e8 sp=0xc00050d7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 37 gp=0xc0001028c0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000507f38 sp=0xc000507f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc000507fc8 sp=0xc000507f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc000507fe0 sp=0xc000507fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000507fe8 sp=0xc000507fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 38 gp=0xc000102a80 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000508738 sp=0xc000508718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0005087c8 sp=0xc000508738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0005087e0 sp=0xc0005087c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0005087e8 sp=0xc0005087e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 39 gp=0xc000102c40 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000508f38 sp=0xc000508f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc000508fc8 sp=0xc000508f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc000508fe0 sp=0xc000508fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000508fe8 sp=0xc000508fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 13 gp=0xc00021d6c0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a7f38 sp=0xc0000a7f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a7fc8 sp=0xc0000a7f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a7fe0 sp=0xc0000a7fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a7fe8 sp=0xc0000a7fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 14 gp=0xc00021d880 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a8738 sp=0xc0000a8718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a87c8 sp=0xc0000a8738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a87e0 sp=0xc0000a87c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a87e8 sp=0xc0000a87e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 40 gp=0xc000102e00 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000509738 sp=0xc000509718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0005097c8 sp=0xc000509738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0005097e0 sp=0xc0005097c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0005097e8 sp=0xc0005097e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 25 gp=0xc000504c40 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00050df38 sp=0xc00050df18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00050dfc8 sp=0xc00050df38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00050dfe0 sp=0xc00050dfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00050dfe8 sp=0xc00050dfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 15 gp=0xc00021da40 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc0000a8f38 sp=0xc0000a8f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0000a8fc8 sp=0xc0000a8f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0000a8fe0 sp=0xc0000a8fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0000a8fe8 sp=0xc0000a8fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 41 gp=0xc000102fc0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000509f38 sp=0xc000509f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc000509fc8 sp=0xc000509f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc000509fe0 sp=0xc000509fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000509fe8 sp=0xc000509fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 42 gp=0xc000103180 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00011c738 sp=0xc00011c718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00011c7c8 sp=0xc00011c738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00011c7e0 sp=0xc00011c7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00011c7e8 sp=0xc00011c7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 43 gp=0xc000103340 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00011cf38 sp=0xc00011cf18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00011cfc8 sp=0xc00011cf38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00011cfe0 sp=0xc00011cfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00011cfe8 sp=0xc00011cfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 44 gp=0xc000103500 m=nil [GC worker (idle)]: runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0x7e?, 0x12?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00011d738 sp=0xc00011d718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00011d7c8 sp=0xc00011d738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00011d7e0 sp=0xc00011d7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00011d7e8 sp=0xc00011d7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 45 gp=0xc0001036c0 m=nil [GC worker (idle)]: runtime.gopark(0xb8bb7ab8e9?, 0x1?, 0x43?, 0x9d?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00011df38 sp=0xc00011df18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00011dfc8 sp=0xc00011df38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00011dfe0 sp=0xc00011dfc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00011dfe8 sp=0xc00011dfe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 26 gp=0xc000504e00 m=nil [GC worker (idle)]: runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0xa5?, 0x5f?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000118738 sp=0xc000118718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc0001187c8 sp=0xc000118738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc0001187e0 sp=0xc0001187c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0001187e8 sp=0xc0001187e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 27 gp=0xc000504fc0 m=nil [GC worker (idle)]: runtime.gopark(0x5579c5fb1ba0?, 0x1?, 0x94?, 0x2a?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000118f38 sp=0xc000118f18 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc000118fc8 sp=0xc000118f38 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc000118fe0 sp=0xc000118fc8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000118fe8 sp=0xc000118fe0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 46 gp=0xc000103880 m=nil [GC worker (idle)]: runtime.gopark(0xb8bb7a9c01?, 0x1?, 0x63?, 0xef?, 0x0?) runtime/proc.go:435 +0xce fp=0xc00011e738 sp=0xc00011e718 pc=0x5579c3f06d2e runtime.gcBgMarkWorker(0xc0000397a0) runtime/mgc.go:1423 +0xe9 fp=0xc00011e7c8 sp=0xc00011e738 pc=0x5579c3eb4a69 runtime.gcBgMarkStartWorkers.gowrap1() runtime/mgc.go:1339 +0x25 fp=0xc00011e7e0 sp=0xc00011e7c8 pc=0x5579c3eb4945 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc00011e7e8 sp=0xc00011e7e0 pc=0x5579c3f0ebc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 runtime/mgc.go:1339 +0x105 goroutine 16 gp=0xc000505c00 m=nil [sync.WaitGroup.Wait]: runtime.gopark(0x0?, 0x0?, 0x60?, 0x40?, 0x0?) runtime/proc.go:435 +0xce fp=0xc000119620 sp=0xc000119600 pc=0x5579c3f06d2e runtime.goparkunlock(...) runtime/proc.go:441 runtime.semacquire1(0xc000270a20, 0x0, 0x1, 0x0, 0x18) runtime/sema.go:188 +0x229 fp=0xc000119688 sp=0xc000119620 pc=0x5579c3ee6ca9 sync.runtime_SemacquireWaitGroup(0x0?) runtime/sema.go:110 +0x25 fp=0xc0001196c0 sp=0xc000119688 pc=0x5579c3f08665 sync.(*WaitGroup).Wait(0x0?) sync/waitgroup.go:118 +0x48 fp=0xc0001196e8 sp=0xc0001196c0 pc=0x5579c3f1a628 github.com/ollama/ollama/runner/llamarunner.(*Server).run(0xc000270a00, {0x5579c55ae470, 0xc0004fa5f0}) github.com/ollama/ollama/runner/llamarunner/runner.go:360 +0x4b fp=0xc0001197b8 sp=0xc0001196e8 pc=0x5579c43c2e8b github.com/ollama/ollama/runner/llamarunner.Execute.gowrap1() github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x28 fp=0xc0001197e0 sp=0xc0001197b8 pc=0x5579c43c83e8 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc0001197e8 sp=0xc0001197e0 pc=0x5579c3f0ebc1 created by github.com/ollama/ollama/runner/llamarunner.Execute in goroutine 1 github.com/ollama/ollama/runner/llamarunner/runner.go:981 +0x4c5 goroutine 50 gp=0xc000505dc0 m=nil [IO wait]: runtime.gopark(0x7fabcba993f0?, 0xc000128800?, 0x70?, 0x99?, 0xb?) runtime/proc.go:435 +0xce fp=0xc000049948 sp=0xc000049928 pc=0x5579c3f06d2e runtime.netpollblock(0x5579c3f2a4f8?, 0xc3ea0466?, 0x79?) runtime/netpoll.go:575 +0xf7 fp=0xc000049980 sp=0xc000049948 pc=0x5579c3ecc057 internal/poll.runtime_pollWait(0x7fab847c6d98, 0x72) runtime/netpoll.go:351 +0x85 fp=0xc0000499a0 sp=0xc000049980 pc=0x5579c3f05f45 internal/poll.(*pollDesc).wait(0xc000128800?, 0xc000340000?, 0x0) internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0000499c8 sp=0xc0000499a0 pc=0x5579c3f8e0c7 internal/poll.(*pollDesc).waitRead(...) internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000128800, {0xc000340000, 0x1000, 0x1000}) internal/poll/fd_unix.go:165 +0x27a fp=0xc000049a60 sp=0xc0000499c8 pc=0x5579c3f8f3ba net.(*netFD).Read(0xc000128800, {0xc000340000?, 0xc000049ad0?, 0x5579c3f8e585?}) net/fd_posix.go:55 +0x25 fp=0xc000049aa8 sp=0xc000049a60 pc=0x5579c40045a5 net.(*conn).Read(0xc0000aea00, {0xc000340000?, 0x0?, 0x0?}) net/net.go:194 +0x45 fp=0xc000049af0 sp=0xc000049aa8 pc=0x5579c4012965 net/http.(*connReader).Read(0xc0002670e0, {0xc000340000, 0x1000, 0x1000}) net/http/server.go:798 +0x159 fp=0xc000049b40 sp=0xc000049af0 pc=0x5579c41ff939 bufio.(*Reader).fill(0xc000110720) bufio/bufio.go:113 +0x103 fp=0xc000049b78 sp=0xc000049b40 pc=0x5579c402aa23 bufio.(*Reader).Peek(0xc000110720, 0x4) bufio/bufio.go:152 +0x53 fp=0xc000049b98 sp=0xc000049b78 pc=0x5579c402ab53 net/http.(*conn).serve(0xc00026e480, {0x5579c55ae438, 0xc000266ff0}) net/http/server.go:2137 +0x785 fp=0xc000049fb8 sp=0xc000049b98 pc=0x5579c4205725 net/http.(*Server).Serve.gowrap3() net/http/server.go:3454 +0x28 fp=0xc000049fe0 sp=0xc000049fb8 pc=0x5579c420ae88 runtime.goexit({}) runtime/asm_amd64.s:1700 +0x1 fp=0xc000049fe8 sp=0xc000049fe0 pc=0x5579c3f0ebc1 created by net/http.(*Server).Serve in goroutine 1 net/http/server.go:3454 +0x485 rax 0x7fabcbad0698 rbx 0x5579e24f4ff0 rcx 0x0 rdx 0x5579e0d62100 rdi 0x7fab75c58e48 rsi 0x3 rbp 0x7fabcbad0698 rsp 0x7ffd8fb11f00 r8 0x0 r9 0x0 r10 0x5579e252b950 r11 0x290 r12 0x0 r13 0x0 r14 0x18 r15 0x1 rip 0x7fab758c0bf5 rflags 0x10246 cs 0x33 fs 0x0 gs 0x0 time=2026-01-23T09:57:50.539Z level=INFO source=sched.go:479 msg="Load failed" model=/root/.ollama/models/blobs/sha256-2af3b81862c6be03c769683af18efdadb2c33f60ff32ab6f83e42c043d6c7816 error="llama runner process has terminated: exit status 2" time=2026-01-23T09:57:50.542Z level=DEBUG source=server.go:1827 msg="stopping llama server" pid=72 [GIN] 2026/01/23 - 09:57:50 | 500 | 1.562787192s | 127.0.0.1 | POST "/api/generate" ``` My command is ``` podman container run --name ollama-rocm --detach --tty \ -e HSA_OVERRIDE_GFX_VERSION=11.0.0 \ -e OLLAMA_DEBUG=1 \ --device /dev/kfd --device /dev/dri \ --security-opt label=disable \ --volume $HOME/ollama:/root/.ollama \ --publish 11434:11434 \ docker.io/ollama/ollama:rocm ``` Hope it helps
Author
Owner

@esanchezm commented on GitHub (Jan 23, 2026):

After reviewing other github issues, I was able to make it work by setting

OLLAMA_VULKAN=1
OLLAMA_LLM_LIBRARY=vulkan 

So my podman command looks like this

podman container run --name ollama-rocm --detach --tty \
                      -e HSA_OVERRIDE_GFX_VERSION=11.0.0 \
                      -e OLLAMA_VULKAN=1 \
                      -e OLLAMA_LLM_LIBRARY=vulkan \
                      --device /dev/kfd --device /dev/dri \
                      --security-opt label=disable \
                      --volume $HOME/ollama:/root/.ollama \
                      --publish 11434:11434 \
                      docker.io/ollama/ollama:rocm
<!-- gh-comment-id:3789500616 --> @esanchezm commented on GitHub (Jan 23, 2026): After reviewing other github issues, I was able to make it work by setting ``` OLLAMA_VULKAN=1 OLLAMA_LLM_LIBRARY=vulkan ``` So my podman command looks like this ``` podman container run --name ollama-rocm --detach --tty \ -e HSA_OVERRIDE_GFX_VERSION=11.0.0 \ -e OLLAMA_VULKAN=1 \ -e OLLAMA_LLM_LIBRARY=vulkan \ --device /dev/kfd --device /dev/dri \ --security-opt label=disable \ --volume $HOME/ollama:/root/.ollama \ --publish 11434:11434 \ docker.io/ollama/ollama:rocm ```
Author
Owner

@yogeshbabu commented on GitHub (Jan 24, 2026):

Thanks VULKAN use is not really GPU solution but i will give it a try

<!-- gh-comment-id:3795164353 --> @yogeshbabu commented on GitHub (Jan 24, 2026): Thanks VULKAN use is not really GPU solution but i will give it a try
Author
Owner

@yogeshbabu commented on GitHub (Jan 24, 2026):

I have installed the latest today: v0.15.0

> ollama -v
ollama version is 0.15.0

systemctl status

> systemctl status ollama
● ollama.service - Ollama Service
     Loaded: loaded (/etc/systemd/system/ollama.service; enabled; preset: enabled)
     Active: active (running) since Sat 2026-01-24 09:21:37 PST; 1min 28s ago
   Main PID: 99258 (ollama)
      Tasks: 22 (limit: 153478)
     Memory: 45.6M (peak: 536.8M)
        CPU: 7.690s
     CGroup: /system.slice/ollama.service
             └─99258 /usr/local/bin/ollama serve

Jan 24 09:22:15 AI-Desktop ollama[99258]: r14    0x16
Jan 24 09:22:15 AI-Desktop ollama[99258]: r15    0x7808bac43ba0
Jan 24 09:22:15 AI-Desktop ollama[99258]: rip    0x78095dc9eb2c
Jan 24 09:22:15 AI-Desktop ollama[99258]: rflags 0x246
Jan 24 09:22:15 AI-Desktop ollama[99258]: cs     0x33
Jan 24 09:22:15 AI-Desktop ollama[99258]: fs     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: gs     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.570-08:00 level=ERROR source=server>
Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.571-08:00 level=INFO source=sched.g>
Jan 24 09:22:15 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:22:15 | 500 |  2.054661713s |       12>

journalctl

> journalctl -e -u ollama
Jan 24 09:22:15 AI-Desktop ollama[99258]: rax    0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: rbx    0x184d7
Jan 24 09:22:15 AI-Desktop ollama[99258]: rcx    0x78095dc9eb2c
Jan 24 09:22:15 AI-Desktop ollama[99258]: rdx    0x6
Jan 24 09:22:15 AI-Desktop ollama[99258]: rdi    0x184d0
Jan 24 09:22:15 AI-Desktop ollama[99258]: rsi    0x184d7
Jan 24 09:22:15 AI-Desktop ollama[99258]: rbp    0x7809151ba300
Jan 24 09:22:15 AI-Desktop ollama[99258]: rsp    0x7809151ba2c0
Jan 24 09:22:15 AI-Desktop ollama[99258]: r8     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: r9     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: r10    0x8
Jan 24 09:22:15 AI-Desktop ollama[99258]: r11    0x246
Jan 24 09:22:15 AI-Desktop ollama[99258]: r12    0x6
Jan 24 09:22:15 AI-Desktop ollama[99258]: r13    0x7808bac6d890
Jan 24 09:22:15 AI-Desktop ollama[99258]: r14    0x16
Jan 24 09:22:15 AI-Desktop ollama[99258]: r15    0x7808bac43ba0
Jan 24 09:22:15 AI-Desktop ollama[99258]: rip    0x78095dc9eb2c
Jan 24 09:22:15 AI-Desktop ollama[99258]: rflags 0x246
Jan 24 09:22:15 AI-Desktop ollama[99258]: cs     0x33
Jan 24 09:22:15 AI-Desktop ollama[99258]: fs     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: gs     0x0
Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.570-08:00 level=ERROR source=server>
Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.571-08:00 level=INFO source=sched.g>
Jan 24 09:22:15 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:22:15 | 500 |  2.054661713s |       12>
Jan 24 09:27:13 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:27:13 | 200 |      46.417µs |       12

Full log for today
ollama_log.txt

<!-- gh-comment-id:3795230063 --> @yogeshbabu commented on GitHub (Jan 24, 2026): I have installed the latest today: v0.15.0 ```{term} > ollama -v ollama version is 0.15.0 ``` **systemctl status** ```{term} > systemctl status ollama ● ollama.service - Ollama Service Loaded: loaded (/etc/systemd/system/ollama.service; enabled; preset: enabled) Active: active (running) since Sat 2026-01-24 09:21:37 PST; 1min 28s ago Main PID: 99258 (ollama) Tasks: 22 (limit: 153478) Memory: 45.6M (peak: 536.8M) CPU: 7.690s CGroup: /system.slice/ollama.service └─99258 /usr/local/bin/ollama serve Jan 24 09:22:15 AI-Desktop ollama[99258]: r14 0x16 Jan 24 09:22:15 AI-Desktop ollama[99258]: r15 0x7808bac43ba0 Jan 24 09:22:15 AI-Desktop ollama[99258]: rip 0x78095dc9eb2c Jan 24 09:22:15 AI-Desktop ollama[99258]: rflags 0x246 Jan 24 09:22:15 AI-Desktop ollama[99258]: cs 0x33 Jan 24 09:22:15 AI-Desktop ollama[99258]: fs 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: gs 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.570-08:00 level=ERROR source=server> Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.571-08:00 level=INFO source=sched.g> Jan 24 09:22:15 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:22:15 | 500 | 2.054661713s | 12> ``` **journalctl** ```{term} > journalctl -e -u ollama Jan 24 09:22:15 AI-Desktop ollama[99258]: rax 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: rbx 0x184d7 Jan 24 09:22:15 AI-Desktop ollama[99258]: rcx 0x78095dc9eb2c Jan 24 09:22:15 AI-Desktop ollama[99258]: rdx 0x6 Jan 24 09:22:15 AI-Desktop ollama[99258]: rdi 0x184d0 Jan 24 09:22:15 AI-Desktop ollama[99258]: rsi 0x184d7 Jan 24 09:22:15 AI-Desktop ollama[99258]: rbp 0x7809151ba300 Jan 24 09:22:15 AI-Desktop ollama[99258]: rsp 0x7809151ba2c0 Jan 24 09:22:15 AI-Desktop ollama[99258]: r8 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: r9 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: r10 0x8 Jan 24 09:22:15 AI-Desktop ollama[99258]: r11 0x246 Jan 24 09:22:15 AI-Desktop ollama[99258]: r12 0x6 Jan 24 09:22:15 AI-Desktop ollama[99258]: r13 0x7808bac6d890 Jan 24 09:22:15 AI-Desktop ollama[99258]: r14 0x16 Jan 24 09:22:15 AI-Desktop ollama[99258]: r15 0x7808bac43ba0 Jan 24 09:22:15 AI-Desktop ollama[99258]: rip 0x78095dc9eb2c Jan 24 09:22:15 AI-Desktop ollama[99258]: rflags 0x246 Jan 24 09:22:15 AI-Desktop ollama[99258]: cs 0x33 Jan 24 09:22:15 AI-Desktop ollama[99258]: fs 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: gs 0x0 Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.570-08:00 level=ERROR source=server> Jan 24 09:22:15 AI-Desktop ollama[99258]: time=2026-01-24T09:22:15.571-08:00 level=INFO source=sched.g> Jan 24 09:22:15 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:22:15 | 500 | 2.054661713s | 12> Jan 24 09:27:13 AI-Desktop ollama[99258]: [GIN] 2026/01/24 - 09:27:13 | 200 | 46.417µs | 12 ``` **Full log for today** [ollama_log.txt](https://github.com/user-attachments/files/24838905/ollama_log.txt)
Author
Owner

@ecevinoth commented on GitHub (Feb 1, 2026):

Similar to AMD ROCm/Vulkan, I have used below value for **NVIDIA/CUDA**

# For shell sessions, add to ~/.zshrc or ~/.bashrc:
export OLLAMA_LLM_LIBRARY=cuda_v13

you can add this to your .bashrc or .zshrc file for permanent fix.

<!-- gh-comment-id:3830753078 --> @ecevinoth commented on GitHub (Feb 1, 2026): Similar to `AMD ROCm/Vulkan`, I have used below value for `**NVIDIA/CUDA**` ```bash # For shell sessions, add to ~/.zshrc or ~/.bashrc: export OLLAMA_LLM_LIBRARY=cuda_v13 ``` you can add this to your .bashrc or .zshrc file for permanent fix.
Author
Owner

@rick-github commented on GitHub (Feb 1, 2026):

Jan 24 09:21:57 AI-Desktop ollama[99258]: ROCm error: out of memory

The Strix has two memory pools it draws from, GTT and VRAM. I'm wondering if the GPU driver is trying allocate in just one of these pools and it's too small for the requested alloc. What's the output of:

radeontop -d - -l 1
<!-- gh-comment-id:3831314530 --> @rick-github commented on GitHub (Feb 1, 2026): ``` Jan 24 09:21:57 AI-Desktop ollama[99258]: ROCm error: out of memory ``` The Strix has two memory pools it draws from, GTT and VRAM. I'm wondering if the GPU driver is trying allocate in just one of these pools and it's too small for the requested alloc. What's the output of: ``` radeontop -d - -l 1 ```
Author
Owner

@yogeshbabu commented on GitHub (Feb 1, 2026):

I have reset my PC and moved on to different LLM service for now as i needed to continue dev.
If you are asking about memory allocation. I didn't have have radeontop and i installed ROCm using amdgpu-install and the setup wat 96GB VRAM and remaining to GTT for all services. Used amd-ttm to setup as per official documentation.

All of these did work until some update or something changed. Don't know what caused it to break. No model runs, the execution goes straight to that error. I even tried with 1.7b models and embedding models like nomic and gemma which is <500mb. All same error

<!-- gh-comment-id:3832204339 --> @yogeshbabu commented on GitHub (Feb 1, 2026): I have reset my PC and moved on to different LLM service for now as i needed to continue dev. If you are asking about memory allocation. I didn't have have `radeontop` and i installed ROCm using `amdgpu-install` and the setup wat 96GB VRAM and remaining to GTT for all services. Used `amd-ttm` to setup as per official documentation. All of these did work until some update or something changed. Don't know what caused it to break. No model runs, the execution goes straight to that error. I even tried with 1.7b models and embedding models like nomic and gemma which is <500mb. All same error
Author
Owner

@w8sl commented on GitHub (Feb 5, 2026):

I'm experiencing the exact same issue with my RX6800. I'm not sure if it was ROCm 6.4 or an earlier version of Ollama, but everything was functioning perfectly previously, even with large LLMs exceeding available VRAM. PyTorch still works fine with ROCm 6.4 in Python venv. Unfortunately, since I'm using a rolling-release Arch-based distro, I can't downgrade Ollama + ROCm (currently 7.1.1)

<!-- gh-comment-id:3856120529 --> @w8sl commented on GitHub (Feb 5, 2026): I'm experiencing the exact same issue with my RX6800. I'm not sure if it was ROCm 6.4 or an earlier version of Ollama, but everything was functioning perfectly previously, even with large LLMs exceeding available VRAM. PyTorch still works fine with ROCm 6.4 in Python venv. Unfortunately, since I'm using a rolling-release Arch-based distro, I can't downgrade Ollama + ROCm (currently 7.1.1)
Author
Owner

@rick-github commented on GitHub (Feb 5, 2026):

Server log may aid in debugging.

<!-- gh-comment-id:3856137479 --> @rick-github commented on GitHub (Feb 5, 2026): [Server log](https://docs.ollama.com/troubleshooting) may aid in debugging.
Author
Owner

@w8sl commented on GitHub (Feb 6, 2026):

Tried with gemma3:4B and HSA_OVERRIDE_GFX_VERSION=10.3.0. The same error. As far as I can see, VRAM stays empty.

time=2026-02-06T19:58:18.885+01:00 level=INFO source=routes.go:1631 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES:0 HSA_OVERRIDE_GFX_VERSION:10.3.0 HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/my_dir/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false OLLAMA_VULKAN:false ROCR_VISIBLE_DEVICES:0 http_proxy: https_proxy: no_proxy:]"
time=2026-02-06T19:58:18.886+01:00 level=INFO source=images.go:473 msg="total blobs: 10"
time=2026-02-06T19:58:18.886+01:00 level=INFO source=images.go:480 msg="total unused blobs removed: 0"
time=2026-02-06T19:58:18.886+01:00 level=INFO source=routes.go:1684 msg="Listening on 127.0.0.1:11434 (version 0.15.2)"
time=2026-02-06T19:58:18.886+01:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" HIP_VISIBLE_DEVICES=0
time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" ROCR_VISIBLE_DEVICES=0
time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" HSA_OVERRIDE_GFX_VERSION=10.3.0
time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:489 msg="if GPUs are not correctly discovered, unset and try again"
time=2026-02-06T19:58:18.886+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 46727"
time=2026-02-06T19:58:18.930+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 34051"
time=2026-02-06T19:58:19.512+01:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-974eabe421d4bd99 filter_id="" library=ROCm compute=gfx1030 name=ROCm0 description="AMD Radeon RX 6800" libdirs=ollama driver=70152.80 pci_id=0000:03:00.0 type=discrete total="16.0 GiB" available="15.2 GiB"
time=2026-02-06T19:58:19.512+01:00 level=INFO source=routes.go:1725 msg="entering low vram mode" "total vram"="16.0 GiB" threshold="20.0 GiB"
[GIN] 2026/02/06 - 19:59:29 | 200 |      27.337µs |       127.0.0.1 | HEAD     "/"
[GIN] 2026/02/06 - 19:59:30 | 200 |  182.490288ms |       127.0.0.1 | POST     "/api/show"
[GIN] 2026/02/06 - 19:59:30 | 200 |  182.444164ms |       127.0.0.1 | POST     "/api/show"
time=2026-02-06T19:59:30.415+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 39413"
time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:245 msg="enabling flash attention"
time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --model /home/my_dir/.ollama/models/blobs/sha256-aeda25e63ebd698fab8638ffb778e68bed908b960d39d0becc650fa981609d25 --port 35569"
time=2026-02-06T19:59:30.577+01:00 level=INFO source=sched.go:452 msg="system memory" total="31.2 GiB" free="27.9 GiB" free_swap="8.0 GiB"
time=2026-02-06T19:59:30.577+01:00 level=INFO source=sched.go:459 msg="gpu memory" id=GPU-974eabe421d4bd99 library=ROCm available="14.7 GiB" free="15.2 GiB" minimum="457.0 MiB" overhead="0 B"
time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:755 msg="loading model" "model layers"=35 requested=-1
time=2026-02-06T19:59:30.584+01:00 level=INFO source=runner.go:1405 msg="starting ollama engine"
time=2026-02-06T19:59:30.584+01:00 level=INFO source=runner.go:1440 msg="Server listening on 127.0.0.1:35569"
time=2026-02-06T19:59:30.588+01:00 level=INFO source=runner.go:1278 msg=load request="{Operation:fit LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Enabled KvSize:4096 KvCacheType: NumThreads:6 GPULayers:35[ID:GPU-974eabe421d4bd99 Layers:35(0..34)] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:false}"
time=2026-02-06T19:59:30.631+01:00 level=INFO source=ggml.go:136 msg="" architecture=gemma3 file_type=Q4_K_M name="" description="" num_tensors=883 num_key_values=36
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 ROCm devices:
  Device 0: AMD Radeon RX 6800, gfx1030 (0x1030), VMM: no, Wave Size: 32, ID: GPU-974eabe421d4bd99
load_backend: loaded ROCm backend from /usr/lib/ollama/libggml-hip.so
load_backend: loaded CPU backend from /usr/lib/ollama/libggml-cpu-icelake.so
time=2026-02-06T19:59:30.663+01:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.AVX512=1 CPU.0.AVX512_VBMI=1 CPU.0.AVX512_VNNI=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 ROCm.0.NO_VMM=1 ROCm.0.PEER_MAX_BATCH_SIZE=128 compiler=cgo(gcc)
/build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed
[New LWP 9497]
[New LWP 9496]
[New LWP 9495]
[New LWP 9493]
[New LWP 9492]
[New LWP 9491]
[New LWP 9490]
[New LWP 9489]
[New LWP 9488]
[New LWP 9487]
[New LWP 9486]
[New LWP 9485]
[New LWP 9484]
[New LWP 9483]
[New LWP 9482]

This GDB supports auto-downloading debuginfo from the following URLs:
  <https://debuginfod.archlinux.org>
Enable debuginfod for this session? (y or [n]) [answered N; input not from terminal]
Debuginfod has been disabled.
To make this setting permanent, add 'set debuginfod enabled off' to .gdbinit.
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/usr/lib/libthread_db.so.1".
0x0000561821d49dce in ?? ()
#0  0x0000561821d49dce in ?? ()
#1  0x0000561821d49be5 in ?? ()
#2  0x0000000000000000 in ?? ()
[Inferior 1 (process 9481) detached]
SIGABRT: abort
PC=0x7f2f5b4a790c m=3 sigcode=18446744073709551610
signal arrived during cgo execution

goroutine 22 gp=0xc0000bd500 m=3 mp=0xc000089008 [syscall]:
runtime.cgocall(0x561822b7fdf0, 0xc0000530d8)
        /usr/lib/go/src/runtime/cgocall.go:167 +0x4b fp=0xc0000530b0 sp=0xc000053078 pc=0x561821dbda0b
github.com/ollama/ollama/ml/backend/ggml._Cfunc_ggml_backend_sched_reserve(0x7f2ef08270e0, 0x7f2ef673a480)
        _cgo_gotypes.go:1014 +0x47 fp=0xc0000530d8 sp=0xc0000530b0 pc=0x56182224d767
github.com/ollama/ollama/ml/backend/ggml.(*Context).Reserve.func2(...)
        /build/ollama/src/ollama/ml/backend/ggml/ggml.go:850
github.com/ollama/ollama/ml/backend/ggml.(*Context).Reserve(0xc001f13a00)
        /build/ollama/src/ollama/ml/backend/ggml/ggml.go:850 +0x125 fp=0xc000053348 sp=0xc0000530d8 pc=0x56182225b5a5
github.com/ollama/ollama/runner/ollamarunner.(*Server).reserveWorstCaseGraph(0xc00022b0e0, 0x1)
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1163 +0xaee fp=0xc000053678 sp=0xc000053348 pc=0x561822344ece
github.com/ollama/ollama/runner/ollamarunner.(*Server).allocModel(0xc00022b0e0, {0x7ffed4ee0eea?, 0x5618220b981a?}, {0x0, 0x6, {0xc0001357c0, 0x1, 0x1}, 0x1}, {0x0, ...}, ...)
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1226 +0x395 fp=0xc000053728 sp=0xc000053678 pc=0x5618223456d5
github.com/ollama/ollama/runner/ollamarunner.(*Server).load(0xc00022b0e0, {0x5618233c4800, 0xc0004c22d0}, 0xc0005ba000)
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1305 +0x545 fp=0xc000053ab8 sp=0xc000053728 pc=0x561822346125
github.com/ollama/ollama/runner/ollamarunner.(*Server).load-fm({0x5618233c4800?, 0xc0004c22d0?}, 0xc0003a5b38?)
        <autogenerated>:1 +0x36 fp=0xc000053ae8 sp=0xc000053ab8 pc=0x561822348436
net/http.HandlerFunc.ServeHTTP(0xc0005ad680?, {0x5618233c4800?, 0xc0004c22d0?}, 0xc0003a5b58?)
        /usr/lib/go/src/net/http/server.go:2322 +0x29 fp=0xc000053b10 sp=0xc000053ae8 pc=0x5618220c4469
net/http.(*ServeMux).ServeHTTP(0x561821d61c25?, {0x5618233c4800, 0xc0004c22d0}, 0xc0005ba000)
        /usr/lib/go/src/net/http/server.go:2861 +0x1c7 fp=0xc000053b60 sp=0xc000053b10 pc=0x5618220c6347
net/http.serverHandler.ServeHTTP({0x5618233c1110?}, {0x5618233c4800?, 0xc0004c22d0?}, 0x1?)
        /usr/lib/go/src/net/http/server.go:3340 +0x8e fp=0xc000053b90 sp=0xc000053b60 pc=0x5618220e3c2e
net/http.(*conn).serve(0xc000238480, {0x5618233c6cd8, 0xc000237230})
        /usr/lib/go/src/net/http/server.go:2109 +0x665 fp=0xc000053fb8 sp=0xc000053b90 pc=0x5618220c2565
net/http.(*Server).Serve.gowrap3()
        /usr/lib/go/src/net/http/server.go:3493 +0x28 fp=0xc000053fe0 sp=0xc000053fb8 pc=0x5618220c8228
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000053fe8 sp=0xc000053fe0 pc=0x561821dc8fc1
created by net/http.(*Server).Serve in goroutine 1
        /usr/lib/go/src/net/http/server.go:3493 +0x485

goroutine 1 gp=0xc000002380 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc001485790 sp=0xc001485770 pc=0x561821dc0e8e
runtime.netpollblock(0xc0004977e0?, 0x21d55e66?, 0x18?)
        /usr/lib/go/src/runtime/netpoll.go:575 +0xf7 fp=0xc0014857c8 sp=0xc001485790 pc=0x561821d84a77
internal/poll.runtime_pollWait(0x7f2f12ff4e00, 0x72)
        /usr/lib/go/src/runtime/netpoll.go:351 +0x85 fp=0xc0014857e8 sp=0xc0014857c8 pc=0x561821dc0065
internal/poll.(*pollDesc).wait(0xc000176880?, 0x900000036?, 0x0)
        /usr/lib/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc001485810 sp=0xc0014857e8 pc=0x561821e48d07
internal/poll.(*pollDesc).waitRead(...)
        /usr/lib/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc000176880)
        /usr/lib/go/src/internal/poll/fd_unix.go:613 +0x28c fp=0xc0014858b8 sp=0xc001485810 pc=0x561821e4e12c
net.(*netFD).accept(0xc000176880)
        /usr/lib/go/src/net/fd_unix.go:161 +0x29 fp=0xc001485970 sp=0xc0014858b8 pc=0x561821eb8729
net.(*TCPListener).accept(0xc0004bb640)
        /usr/lib/go/src/net/tcpsock_posix.go:159 +0x1b fp=0xc0014859c0 sp=0xc001485970 pc=0x561821ece3bb
net.(*TCPListener).Accept(0xc0004bb640)
        /usr/lib/go/src/net/tcpsock.go:380 +0x30 fp=0xc0014859f0 sp=0xc0014859c0 pc=0x561821ecd250
net/http.(*onceCloseListener).Accept(0xc000238480?)
        <autogenerated>:1 +0x24 fp=0xc001485a08 sp=0xc0014859f0 pc=0x5618220f0444
net/http.(*Server).Serve(0xc0000bf100, {0x5618233c4620, 0xc0004bb640})
        /usr/lib/go/src/net/http/server.go:3463 +0x30c fp=0xc001485b38 sp=0xc001485a08 pc=0x5618220c7e2c
github.com/ollama/ollama/runner/ollamarunner.Execute({0xc000136030, 0x4, 0x4})
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1441 +0x954 fp=0xc001485d08 sp=0xc001485b38 pc=0x561822347dd4
github.com/ollama/ollama/runner.Execute({0xc000136010?, 0x0?, 0x0?})
        /build/ollama/src/ollama/runner/runner.go:28 +0x130 fp=0xc001485d30 sp=0xc001485d08 pc=0x561822348730
github.com/ollama/ollama/cmd.NewCLI.func3(0xc0000bef00?, {0x561822e772fe?, 0x4?, 0x561822e77302?})
        /build/ollama/src/ollama/cmd/cmd.go:1966 +0x45 fp=0xc001485d58 sp=0xc001485d30 pc=0x561822b173c5
github.com/spf13/cobra.(*Command).execute(0xc00023d508, {0xc0004a5770, 0x5, 0x5})
        /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x88a fp=0xc001485e78 sp=0xc001485d58 pc=0x561821f3292a
github.com/spf13/cobra.(*Command).ExecuteC(0xc0005a3508)
        /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x398 fp=0xc001485f30 sp=0xc001485e78 pc=0x561821f33158
github.com/spf13/cobra.(*Command).Execute(...)
        /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
        /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
        /build/ollama/src/ollama/main.go:12 +0x4d fp=0xc001485f50 sp=0xc001485f30 pc=0x561822b17ead
runtime.main()
        /usr/lib/go/src/runtime/proc.go:285 +0x29d fp=0xc001485fe0 sp=0xc001485f50 pc=0x561821d8c31d
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc001485fe8 sp=0xc001485fe0 pc=0x561821dc8fc1

goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000084fa8 sp=0xc000084f88 pc=0x561821dc0e8e
runtime.goparkunlock(...)
        /usr/lib/go/src/runtime/proc.go:466
runtime.forcegchelper()
        /usr/lib/go/src/runtime/proc.go:373 +0xb8 fp=0xc000084fe0 sp=0xc000084fa8 pc=0x561821d8c658
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000084fe8 sp=0xc000084fe0 pc=0x561821dc8fc1
created by runtime.init.7 in goroutine 1
        /usr/lib/go/src/runtime/proc.go:361 +0x1a

goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000085780 sp=0xc000085760 pc=0x561821dc0e8e
runtime.goparkunlock(...)
        /usr/lib/go/src/runtime/proc.go:466
runtime.bgsweep(0xc0000ae000)
        /usr/lib/go/src/runtime/mgcsweep.go:323 +0xdf fp=0xc0000857c8 sp=0xc000085780 pc=0x561821d7637f
runtime.gcenable.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:212 +0x25 fp=0xc0000857e0 sp=0xc0000857c8 pc=0x561821d6a305
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000857e8 sp=0xc0000857e0 pc=0x561821dc8fc1
created by runtime.gcenable in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:212 +0x66

goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]:
runtime.gopark(0x10000?, 0x5618230599f0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000085f78 sp=0xc000085f58 pc=0x561821dc0e8e
runtime.goparkunlock(...)
        /usr/lib/go/src/runtime/proc.go:466
runtime.(*scavengerState).park(0x561823cf8780)
        /usr/lib/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000085fa8 sp=0xc000085f78 pc=0x561821d73de9
runtime.bgscavenge(0xc0000ae000)
        /usr/lib/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000085fc8 sp=0xc000085fa8 pc=0x561821d74399
runtime.gcenable.gowrap2()
        /usr/lib/go/src/runtime/mgc.go:213 +0x25 fp=0xc000085fe0 sp=0xc000085fc8 pc=0x561821d6a2a5
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000085fe8 sp=0xc000085fe0 pc=0x561821dc8fc1
created by runtime.gcenable in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:213 +0xa5

goroutine 18 gp=0xc000102700 m=nil [finalizer wait]:
runtime.gopark(0x561821d9b657?, 0x561821d61c25?, 0xb8?, 0x1?, 0xc000002380?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000084620 sp=0xc000084600 pc=0x561821dc0e8e
runtime.runFinalizers()
        /usr/lib/go/src/runtime/mfinal.go:210 +0x107 fp=0xc0000847e0 sp=0xc000084620 pc=0x561821d69207
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000847e8 sp=0xc0000847e0 pc=0x561821dc8fc1
created by runtime.createfing in goroutine 1
        /usr/lib/go/src/runtime/mfinal.go:172 +0x3d

goroutine 19 gp=0xc000103180 m=nil [cleanup wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000080768 sp=0xc000080748 pc=0x561821dc0e8e
runtime.goparkunlock(...)
        /usr/lib/go/src/runtime/proc.go:466
runtime.(*cleanupQueue).dequeue(0x561823cf90e0)
        /usr/lib/go/src/runtime/mcleanup.go:439 +0xc5 fp=0xc0000807a0 sp=0xc000080768 pc=0x561821d663e5
runtime.runCleanups()
        /usr/lib/go/src/runtime/mcleanup.go:635 +0x45 fp=0xc0000807e0 sp=0xc0000807a0 pc=0x561821d66aa5
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000807e8 sp=0xc0000807e0 pc=0x561821dc8fc1
created by runtime.(*cleanupQueue).createGs in goroutine 1
        /usr/lib/go/src/runtime/mcleanup.go:589 +0xa5

goroutine 20 gp=0xc000103500 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000080f38 sp=0xc000080f18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000080fc8 sp=0xc000080f38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000080fe0 sp=0xc000080fc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000080fe8 sp=0xc000080fe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 34 gp=0xc000484000 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048a738 sp=0xc00048a718 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048a7c8 sp=0xc00048a738 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048a7e0 sp=0xc00048a7c8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048a7e8 sp=0xc00048a7e0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 35 gp=0xc0004841c0 m=nil [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048af38 sp=0xc00048af18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048afc8 sp=0xc00048af38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048afe0 sp=0xc00048afc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048afe8 sp=0xc00048afe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 36 gp=0xc000484380 m=nil [GC worker (idle)]:
runtime.gopark(0x194908844c2?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048b738 sp=0xc00048b718 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048b7c8 sp=0xc00048b738 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048b7e0 sp=0xc00048b7c8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048b7e8 sp=0xc00048b7e0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 37 gp=0xc000484540 m=nil [GC worker (idle)]:
runtime.gopark(0x19490879270?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048bf38 sp=0xc00048bf18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048bfc8 sp=0xc00048bf38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048bfe0 sp=0xc00048bfc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048bfe8 sp=0xc00048bfe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 5 gp=0xc000003a40 m=nil [GC worker (idle)]:
runtime.gopark(0x194908846ae?, 0x1?, 0x2b?, 0xbb?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048cf38 sp=0xc00048cf18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048cfc8 sp=0xc00048cf38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048cfe0 sp=0xc00048cfc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048cfe8 sp=0xc00048cfe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 6 gp=0xc000003c00 m=nil [GC worker (idle)]:
runtime.gopark(0x19490cd78e3?, 0x3?, 0x83?, 0x42?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000086f38 sp=0xc000086f18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000086fc8 sp=0xc000086f38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000086fe0 sp=0xc000086fc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000086fe8 sp=0xc000086fe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 7 gp=0xc000003dc0 m=nil [GC worker (idle)]:
runtime.gopark(0x19490884595?, 0x3?, 0xfc?, 0x3f?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000087738 sp=0xc000087718 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0000877c8 sp=0xc000087738 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0000877e0 sp=0xc0000877c8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000877e8 sp=0xc0000877e0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 8 gp=0xc0000bc000 m=nil [GC worker (idle)]:
runtime.gopark(0x19490cb7e18?, 0x1?, 0xf?, 0xf8?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000087f38 sp=0xc000087f18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000087fc8 sp=0xc000087f38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000087fe0 sp=0xc000087fc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000087fe8 sp=0xc000087fe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 9 gp=0xc0000bc1c0 m=nil [GC worker (idle)]:
runtime.gopark(0x561823dc6fa0?, 0x1?, 0x2a?, 0xc2?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000098f38 sp=0xc000098f18 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000098fc8 sp=0xc000098f38 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000098fe0 sp=0xc000098fc8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000098fe8 sp=0xc000098fe0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 10 gp=0xc0000bc380 m=nil [GC worker (idle)]:
runtime.gopark(0x1949087932e?, 0x3?, 0x99?, 0x1?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000081738 sp=0xc000081718 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0000817c8 sp=0xc000081738 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0000817e0 sp=0xc0000817c8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000817e8 sp=0xc0000817e0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 11 gp=0xc0000bc540 m=nil [GC worker (idle)]:
runtime.gopark(0x19490884639?, 0x3?, 0x32?, 0x17?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000487738 sp=0xc000487718 pc=0x561821dc0e8e
runtime.gcBgMarkWorker(0xc0001116c0)
        /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0004877c8 sp=0xc000487738 pc=0x561821d6ca2b
runtime.gcBgMarkStartWorkers.gowrap1()
        /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0004877e0 sp=0xc0004877c8 pc=0x561821d6c905
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0004877e8 sp=0xc0004877e0 pc=0x561821dc8fc1
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/lib/go/src/runtime/mgc.go:1373 +0x105

goroutine 21 gp=0xc0000bd340 m=nil [sync.WaitGroup.Wait]:
runtime.gopark(0xc000092aa0?, 0x561821dc6ff4?, 0x60?, 0x80?, 0x0?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000092a90 sp=0xc000092a70 pc=0x561821dc0e8e
runtime.goparkunlock(...)
        /usr/lib/go/src/runtime/proc.go:466
runtime.semacquire1(0xc00022b198, 0x0, 0x1, 0x0, 0x19)
        /usr/lib/go/src/runtime/sema.go:192 +0x229 fp=0xc000092af8 sp=0xc000092a90 pc=0x561821da0129
sync.runtime_SemacquireWaitGroup(0xc00011aab0?, 0xc5?)
        /usr/lib/go/src/runtime/sema.go:114 +0x2e fp=0xc000092b30 sp=0xc000092af8 pc=0x561821dc28ae
sync.(*WaitGroup).Wait(0xc00022b190)
        /usr/lib/go/src/sync/waitgroup.go:206 +0x85 fp=0xc000092b58 sp=0xc000092b30 pc=0x561821dd4c25
github.com/ollama/ollama/runner/ollamarunner.(*Server).run(0xc00022b0e0, {0x5618233c6d10, 0xc0004a5810})
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:441 +0x45 fp=0xc000092fb8 sp=0xc000092b58 pc=0x56182233e8a5
github.com/ollama/ollama/runner/ollamarunner.Execute.gowrap1()
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1418 +0x28 fp=0xc000092fe0 sp=0xc000092fb8 pc=0x561822348048
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000092fe8 sp=0xc000092fe0 pc=0x561821dc8fc1
created by github.com/ollama/ollama/runner/ollamarunner.Execute in goroutine 1
        /build/ollama/src/ollama/runner/ollamarunner/runner.go:1418 +0x4c9

goroutine 12 gp=0xc000484700 m=nil [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0xb?)
        /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048d5d8 sp=0xc00048d5b8 pc=0x561821dc0e8e
runtime.netpollblock(0x561821de5038?, 0x21d55e66?, 0x18?)
        /usr/lib/go/src/runtime/netpoll.go:575 +0xf7 fp=0xc00048d610 sp=0xc00048d5d8 pc=0x561821d84a77
internal/poll.runtime_pollWait(0x7f2f12ff4c00, 0x72)
        /usr/lib/go/src/runtime/netpoll.go:351 +0x85 fp=0xc00048d630 sp=0xc00048d610 pc=0x561821dc0065
internal/poll.(*pollDesc).wait(0xc000176900?, 0xc0004bb6a1?, 0x0)
        /usr/lib/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00048d658 sp=0xc00048d630 pc=0x561821e48d07
internal/poll.(*pollDesc).waitRead(...)
        /usr/lib/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000176900, {0xc0004bb6a1, 0x1, 0x1})
        /usr/lib/go/src/internal/poll/fd_unix.go:165 +0x279 fp=0xc00048d6f0 sp=0xc00048d658 pc=0x561821e49ff9
net.(*netFD).Read(0xc000176900, {0xc0004bb6a1?, 0x0?, 0x0?})
        /usr/lib/go/src/net/fd_posix.go:68 +0x25 fp=0xc00048d738 sp=0xc00048d6f0 pc=0x561821eb6885
net.(*conn).Read(0xc00007e628, {0xc0004bb6a1?, 0x0?, 0x0?})
        /usr/lib/go/src/net/net.go:196 +0x45 fp=0xc00048d780 sp=0xc00048d738 pc=0x561821ec48a5
net/http.(*connReader).backgroundRead(0xc0004bb680)
        /usr/lib/go/src/net/http/server.go:702 +0x33 fp=0xc00048d7c8 sp=0xc00048d780 pc=0x5618220bca13
net/http.(*connReader).startBackgroundRead.gowrap2()
        /usr/lib/go/src/net/http/server.go:698 +0x25 fp=0xc00048d7e0 sp=0xc00048d7c8 pc=0x5618220bc945
runtime.goexit({})
        /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048d7e8 sp=0xc00048d7e0 pc=0x561821dc8fc1
created by net/http.(*connReader).startBackgroundRead in goroutine 22
        /usr/lib/go/src/net/http/server.go:698 +0xb6

rax    0x0
rbx    0x250b
rcx    0x7f2f5b4a790c
rdx    0x6
rdi    0x2509
rsi    0x250b
rbp    0x7f2f1402ba30
rsp    0x7f2f1402b9f0
r8     0x0
r9     0x0
r10    0x0
r11    0x246
r12    0x7f2eb164ee34
r13    0x387
r14    0x6
r15    0x0
rip    0x7f2f5b4a790c
rflags 0x246
cs     0x33
fs     0x0
gs     0x0
time=2026-02-06T19:59:31.215+01:00 level=INFO source=sched.go:479 msg="Load failed" model=/home/my_dir/.ollama/models/blobs/sha256-aeda25e63ebd698fab8638ffb778e68bed908b960d39d0becc650fa981609d25 error="do load request: Post \"http://127.0.0.1:35569/load\": EOF"
time=2026-02-06T19:59:31.218+01:00 level=ERROR source=server.go:302 msg="llama runner terminated" error="exit status 2"
[GIN] 2026/02/06 - 19:59:31 | 500 |  1.004984617s |       127.0.0.1 | POST     "/api/generate"
<!-- gh-comment-id:3862111875 --> @w8sl commented on GitHub (Feb 6, 2026): Tried with gemma3:4B and HSA_OVERRIDE_GFX_VERSION=10.3.0. The same error. As far as I can see, VRAM stays empty. ``` time=2026-02-06T19:58:18.885+01:00 level=INFO source=routes.go:1631 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES:0 HSA_OVERRIDE_GFX_VERSION:10.3.0 HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/my_dir/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false OLLAMA_VULKAN:false ROCR_VISIBLE_DEVICES:0 http_proxy: https_proxy: no_proxy:]" time=2026-02-06T19:58:18.886+01:00 level=INFO source=images.go:473 msg="total blobs: 10" time=2026-02-06T19:58:18.886+01:00 level=INFO source=images.go:480 msg="total unused blobs removed: 0" time=2026-02-06T19:58:18.886+01:00 level=INFO source=routes.go:1684 msg="Listening on 127.0.0.1:11434 (version 0.15.2)" time=2026-02-06T19:58:18.886+01:00 level=INFO source=runner.go:67 msg="discovering available GPUs..." time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" HIP_VISIBLE_DEVICES=0 time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" ROCR_VISIBLE_DEVICES=0 time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:485 msg="user overrode visible devices" HSA_OVERRIDE_GFX_VERSION=10.3.0 time=2026-02-06T19:58:18.886+01:00 level=WARN source=runner.go:489 msg="if GPUs are not correctly discovered, unset and try again" time=2026-02-06T19:58:18.886+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 46727" time=2026-02-06T19:58:18.930+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 34051" time=2026-02-06T19:58:19.512+01:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-974eabe421d4bd99 filter_id="" library=ROCm compute=gfx1030 name=ROCm0 description="AMD Radeon RX 6800" libdirs=ollama driver=70152.80 pci_id=0000:03:00.0 type=discrete total="16.0 GiB" available="15.2 GiB" time=2026-02-06T19:58:19.512+01:00 level=INFO source=routes.go:1725 msg="entering low vram mode" "total vram"="16.0 GiB" threshold="20.0 GiB" [GIN] 2026/02/06 - 19:59:29 | 200 | 27.337µs | 127.0.0.1 | HEAD "/" [GIN] 2026/02/06 - 19:59:30 | 200 | 182.490288ms | 127.0.0.1 | POST "/api/show" [GIN] 2026/02/06 - 19:59:30 | 200 | 182.444164ms | 127.0.0.1 | POST "/api/show" time=2026-02-06T19:59:30.415+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --port 39413" time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:245 msg="enabling flash attention" time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:429 msg="starting runner" cmd="/usr/bin/ollama runner --ollama-engine --model /home/my_dir/.ollama/models/blobs/sha256-aeda25e63ebd698fab8638ffb778e68bed908b960d39d0becc650fa981609d25 --port 35569" time=2026-02-06T19:59:30.577+01:00 level=INFO source=sched.go:452 msg="system memory" total="31.2 GiB" free="27.9 GiB" free_swap="8.0 GiB" time=2026-02-06T19:59:30.577+01:00 level=INFO source=sched.go:459 msg="gpu memory" id=GPU-974eabe421d4bd99 library=ROCm available="14.7 GiB" free="15.2 GiB" minimum="457.0 MiB" overhead="0 B" time=2026-02-06T19:59:30.577+01:00 level=INFO source=server.go:755 msg="loading model" "model layers"=35 requested=-1 time=2026-02-06T19:59:30.584+01:00 level=INFO source=runner.go:1405 msg="starting ollama engine" time=2026-02-06T19:59:30.584+01:00 level=INFO source=runner.go:1440 msg="Server listening on 127.0.0.1:35569" time=2026-02-06T19:59:30.588+01:00 level=INFO source=runner.go:1278 msg=load request="{Operation:fit LoraPath:[] Parallel:1 BatchSize:512 FlashAttention:Enabled KvSize:4096 KvCacheType: NumThreads:6 GPULayers:35[ID:GPU-974eabe421d4bd99 Layers:35(0..34)] MultiUserCache:false ProjectorPath: MainGPU:0 UseMmap:false}" time=2026-02-06T19:59:30.631+01:00 level=INFO source=ggml.go:136 msg="" architecture=gemma3 file_type=Q4_K_M name="" description="" num_tensors=883 num_key_values=36 ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 ROCm devices: Device 0: AMD Radeon RX 6800, gfx1030 (0x1030), VMM: no, Wave Size: 32, ID: GPU-974eabe421d4bd99 load_backend: loaded ROCm backend from /usr/lib/ollama/libggml-hip.so load_backend: loaded CPU backend from /usr/lib/ollama/libggml-cpu-icelake.so time=2026-02-06T19:59:30.663+01:00 level=INFO source=ggml.go:104 msg=system CPU.0.SSE3=1 CPU.0.SSSE3=1 CPU.0.AVX=1 CPU.0.AVX2=1 CPU.0.F16C=1 CPU.0.FMA=1 CPU.0.BMI2=1 CPU.0.AVX512=1 CPU.0.AVX512_VBMI=1 CPU.0.AVX512_VNNI=1 CPU.0.LLAMAFILE=1 CPU.1.LLAMAFILE=1 ROCm.0.NO_VMM=1 ROCm.0.PEER_MAX_BATCH_SIZE=128 compiler=cgo(gcc) /build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed [New LWP 9497] [New LWP 9496] [New LWP 9495] [New LWP 9493] [New LWP 9492] [New LWP 9491] [New LWP 9490] [New LWP 9489] [New LWP 9488] [New LWP 9487] [New LWP 9486] [New LWP 9485] [New LWP 9484] [New LWP 9483] [New LWP 9482] This GDB supports auto-downloading debuginfo from the following URLs: <https://debuginfod.archlinux.org> Enable debuginfod for this session? (y or [n]) [answered N; input not from terminal] Debuginfod has been disabled. To make this setting permanent, add 'set debuginfod enabled off' to .gdbinit. [Thread debugging using libthread_db enabled] Using host libthread_db library "/usr/lib/libthread_db.so.1". 0x0000561821d49dce in ?? () #0 0x0000561821d49dce in ?? () #1 0x0000561821d49be5 in ?? () #2 0x0000000000000000 in ?? () [Inferior 1 (process 9481) detached] SIGABRT: abort PC=0x7f2f5b4a790c m=3 sigcode=18446744073709551610 signal arrived during cgo execution goroutine 22 gp=0xc0000bd500 m=3 mp=0xc000089008 [syscall]: runtime.cgocall(0x561822b7fdf0, 0xc0000530d8) /usr/lib/go/src/runtime/cgocall.go:167 +0x4b fp=0xc0000530b0 sp=0xc000053078 pc=0x561821dbda0b github.com/ollama/ollama/ml/backend/ggml._Cfunc_ggml_backend_sched_reserve(0x7f2ef08270e0, 0x7f2ef673a480) _cgo_gotypes.go:1014 +0x47 fp=0xc0000530d8 sp=0xc0000530b0 pc=0x56182224d767 github.com/ollama/ollama/ml/backend/ggml.(*Context).Reserve.func2(...) /build/ollama/src/ollama/ml/backend/ggml/ggml.go:850 github.com/ollama/ollama/ml/backend/ggml.(*Context).Reserve(0xc001f13a00) /build/ollama/src/ollama/ml/backend/ggml/ggml.go:850 +0x125 fp=0xc000053348 sp=0xc0000530d8 pc=0x56182225b5a5 github.com/ollama/ollama/runner/ollamarunner.(*Server).reserveWorstCaseGraph(0xc00022b0e0, 0x1) /build/ollama/src/ollama/runner/ollamarunner/runner.go:1163 +0xaee fp=0xc000053678 sp=0xc000053348 pc=0x561822344ece github.com/ollama/ollama/runner/ollamarunner.(*Server).allocModel(0xc00022b0e0, {0x7ffed4ee0eea?, 0x5618220b981a?}, {0x0, 0x6, {0xc0001357c0, 0x1, 0x1}, 0x1}, {0x0, ...}, ...) /build/ollama/src/ollama/runner/ollamarunner/runner.go:1226 +0x395 fp=0xc000053728 sp=0xc000053678 pc=0x5618223456d5 github.com/ollama/ollama/runner/ollamarunner.(*Server).load(0xc00022b0e0, {0x5618233c4800, 0xc0004c22d0}, 0xc0005ba000) /build/ollama/src/ollama/runner/ollamarunner/runner.go:1305 +0x545 fp=0xc000053ab8 sp=0xc000053728 pc=0x561822346125 github.com/ollama/ollama/runner/ollamarunner.(*Server).load-fm({0x5618233c4800?, 0xc0004c22d0?}, 0xc0003a5b38?) <autogenerated>:1 +0x36 fp=0xc000053ae8 sp=0xc000053ab8 pc=0x561822348436 net/http.HandlerFunc.ServeHTTP(0xc0005ad680?, {0x5618233c4800?, 0xc0004c22d0?}, 0xc0003a5b58?) /usr/lib/go/src/net/http/server.go:2322 +0x29 fp=0xc000053b10 sp=0xc000053ae8 pc=0x5618220c4469 net/http.(*ServeMux).ServeHTTP(0x561821d61c25?, {0x5618233c4800, 0xc0004c22d0}, 0xc0005ba000) /usr/lib/go/src/net/http/server.go:2861 +0x1c7 fp=0xc000053b60 sp=0xc000053b10 pc=0x5618220c6347 net/http.serverHandler.ServeHTTP({0x5618233c1110?}, {0x5618233c4800?, 0xc0004c22d0?}, 0x1?) /usr/lib/go/src/net/http/server.go:3340 +0x8e fp=0xc000053b90 sp=0xc000053b60 pc=0x5618220e3c2e net/http.(*conn).serve(0xc000238480, {0x5618233c6cd8, 0xc000237230}) /usr/lib/go/src/net/http/server.go:2109 +0x665 fp=0xc000053fb8 sp=0xc000053b90 pc=0x5618220c2565 net/http.(*Server).Serve.gowrap3() /usr/lib/go/src/net/http/server.go:3493 +0x28 fp=0xc000053fe0 sp=0xc000053fb8 pc=0x5618220c8228 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000053fe8 sp=0xc000053fe0 pc=0x561821dc8fc1 created by net/http.(*Server).Serve in goroutine 1 /usr/lib/go/src/net/http/server.go:3493 +0x485 goroutine 1 gp=0xc000002380 m=nil [IO wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc001485790 sp=0xc001485770 pc=0x561821dc0e8e runtime.netpollblock(0xc0004977e0?, 0x21d55e66?, 0x18?) /usr/lib/go/src/runtime/netpoll.go:575 +0xf7 fp=0xc0014857c8 sp=0xc001485790 pc=0x561821d84a77 internal/poll.runtime_pollWait(0x7f2f12ff4e00, 0x72) /usr/lib/go/src/runtime/netpoll.go:351 +0x85 fp=0xc0014857e8 sp=0xc0014857c8 pc=0x561821dc0065 internal/poll.(*pollDesc).wait(0xc000176880?, 0x900000036?, 0x0) /usr/lib/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc001485810 sp=0xc0014857e8 pc=0x561821e48d07 internal/poll.(*pollDesc).waitRead(...) /usr/lib/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0xc000176880) /usr/lib/go/src/internal/poll/fd_unix.go:613 +0x28c fp=0xc0014858b8 sp=0xc001485810 pc=0x561821e4e12c net.(*netFD).accept(0xc000176880) /usr/lib/go/src/net/fd_unix.go:161 +0x29 fp=0xc001485970 sp=0xc0014858b8 pc=0x561821eb8729 net.(*TCPListener).accept(0xc0004bb640) /usr/lib/go/src/net/tcpsock_posix.go:159 +0x1b fp=0xc0014859c0 sp=0xc001485970 pc=0x561821ece3bb net.(*TCPListener).Accept(0xc0004bb640) /usr/lib/go/src/net/tcpsock.go:380 +0x30 fp=0xc0014859f0 sp=0xc0014859c0 pc=0x561821ecd250 net/http.(*onceCloseListener).Accept(0xc000238480?) <autogenerated>:1 +0x24 fp=0xc001485a08 sp=0xc0014859f0 pc=0x5618220f0444 net/http.(*Server).Serve(0xc0000bf100, {0x5618233c4620, 0xc0004bb640}) /usr/lib/go/src/net/http/server.go:3463 +0x30c fp=0xc001485b38 sp=0xc001485a08 pc=0x5618220c7e2c github.com/ollama/ollama/runner/ollamarunner.Execute({0xc000136030, 0x4, 0x4}) /build/ollama/src/ollama/runner/ollamarunner/runner.go:1441 +0x954 fp=0xc001485d08 sp=0xc001485b38 pc=0x561822347dd4 github.com/ollama/ollama/runner.Execute({0xc000136010?, 0x0?, 0x0?}) /build/ollama/src/ollama/runner/runner.go:28 +0x130 fp=0xc001485d30 sp=0xc001485d08 pc=0x561822348730 github.com/ollama/ollama/cmd.NewCLI.func3(0xc0000bef00?, {0x561822e772fe?, 0x4?, 0x561822e77302?}) /build/ollama/src/ollama/cmd/cmd.go:1966 +0x45 fp=0xc001485d58 sp=0xc001485d30 pc=0x561822b173c5 github.com/spf13/cobra.(*Command).execute(0xc00023d508, {0xc0004a5770, 0x5, 0x5}) /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x88a fp=0xc001485e78 sp=0xc001485d58 pc=0x561821f3292a github.com/spf13/cobra.(*Command).ExecuteC(0xc0005a3508) /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x398 fp=0xc001485f30 sp=0xc001485e78 pc=0x561821f33158 github.com/spf13/cobra.(*Command).Execute(...) /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(...) /build/ollama/src/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 main.main() /build/ollama/src/ollama/main.go:12 +0x4d fp=0xc001485f50 sp=0xc001485f30 pc=0x561822b17ead runtime.main() /usr/lib/go/src/runtime/proc.go:285 +0x29d fp=0xc001485fe0 sp=0xc001485f50 pc=0x561821d8c31d runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc001485fe8 sp=0xc001485fe0 pc=0x561821dc8fc1 goroutine 2 gp=0xc000002e00 m=nil [force gc (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000084fa8 sp=0xc000084f88 pc=0x561821dc0e8e runtime.goparkunlock(...) /usr/lib/go/src/runtime/proc.go:466 runtime.forcegchelper() /usr/lib/go/src/runtime/proc.go:373 +0xb8 fp=0xc000084fe0 sp=0xc000084fa8 pc=0x561821d8c658 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000084fe8 sp=0xc000084fe0 pc=0x561821dc8fc1 created by runtime.init.7 in goroutine 1 /usr/lib/go/src/runtime/proc.go:361 +0x1a goroutine 3 gp=0xc000003340 m=nil [GC sweep wait]: runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000085780 sp=0xc000085760 pc=0x561821dc0e8e runtime.goparkunlock(...) /usr/lib/go/src/runtime/proc.go:466 runtime.bgsweep(0xc0000ae000) /usr/lib/go/src/runtime/mgcsweep.go:323 +0xdf fp=0xc0000857c8 sp=0xc000085780 pc=0x561821d7637f runtime.gcenable.gowrap1() /usr/lib/go/src/runtime/mgc.go:212 +0x25 fp=0xc0000857e0 sp=0xc0000857c8 pc=0x561821d6a305 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000857e8 sp=0xc0000857e0 pc=0x561821dc8fc1 created by runtime.gcenable in goroutine 1 /usr/lib/go/src/runtime/mgc.go:212 +0x66 goroutine 4 gp=0xc000003500 m=nil [GC scavenge wait]: runtime.gopark(0x10000?, 0x5618230599f0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000085f78 sp=0xc000085f58 pc=0x561821dc0e8e runtime.goparkunlock(...) /usr/lib/go/src/runtime/proc.go:466 runtime.(*scavengerState).park(0x561823cf8780) /usr/lib/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000085fa8 sp=0xc000085f78 pc=0x561821d73de9 runtime.bgscavenge(0xc0000ae000) /usr/lib/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000085fc8 sp=0xc000085fa8 pc=0x561821d74399 runtime.gcenable.gowrap2() /usr/lib/go/src/runtime/mgc.go:213 +0x25 fp=0xc000085fe0 sp=0xc000085fc8 pc=0x561821d6a2a5 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000085fe8 sp=0xc000085fe0 pc=0x561821dc8fc1 created by runtime.gcenable in goroutine 1 /usr/lib/go/src/runtime/mgc.go:213 +0xa5 goroutine 18 gp=0xc000102700 m=nil [finalizer wait]: runtime.gopark(0x561821d9b657?, 0x561821d61c25?, 0xb8?, 0x1?, 0xc000002380?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000084620 sp=0xc000084600 pc=0x561821dc0e8e runtime.runFinalizers() /usr/lib/go/src/runtime/mfinal.go:210 +0x107 fp=0xc0000847e0 sp=0xc000084620 pc=0x561821d69207 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000847e8 sp=0xc0000847e0 pc=0x561821dc8fc1 created by runtime.createfing in goroutine 1 /usr/lib/go/src/runtime/mfinal.go:172 +0x3d goroutine 19 gp=0xc000103180 m=nil [cleanup wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000080768 sp=0xc000080748 pc=0x561821dc0e8e runtime.goparkunlock(...) /usr/lib/go/src/runtime/proc.go:466 runtime.(*cleanupQueue).dequeue(0x561823cf90e0) /usr/lib/go/src/runtime/mcleanup.go:439 +0xc5 fp=0xc0000807a0 sp=0xc000080768 pc=0x561821d663e5 runtime.runCleanups() /usr/lib/go/src/runtime/mcleanup.go:635 +0x45 fp=0xc0000807e0 sp=0xc0000807a0 pc=0x561821d66aa5 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000807e8 sp=0xc0000807e0 pc=0x561821dc8fc1 created by runtime.(*cleanupQueue).createGs in goroutine 1 /usr/lib/go/src/runtime/mcleanup.go:589 +0xa5 goroutine 20 gp=0xc000103500 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000080f38 sp=0xc000080f18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000080fc8 sp=0xc000080f38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000080fe0 sp=0xc000080fc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000080fe8 sp=0xc000080fe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 34 gp=0xc000484000 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048a738 sp=0xc00048a718 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048a7c8 sp=0xc00048a738 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048a7e0 sp=0xc00048a7c8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048a7e8 sp=0xc00048a7e0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 35 gp=0xc0004841c0 m=nil [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048af38 sp=0xc00048af18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048afc8 sp=0xc00048af38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048afe0 sp=0xc00048afc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048afe8 sp=0xc00048afe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 36 gp=0xc000484380 m=nil [GC worker (idle)]: runtime.gopark(0x194908844c2?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048b738 sp=0xc00048b718 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048b7c8 sp=0xc00048b738 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048b7e0 sp=0xc00048b7c8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048b7e8 sp=0xc00048b7e0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 37 gp=0xc000484540 m=nil [GC worker (idle)]: runtime.gopark(0x19490879270?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048bf38 sp=0xc00048bf18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048bfc8 sp=0xc00048bf38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048bfe0 sp=0xc00048bfc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048bfe8 sp=0xc00048bfe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 5 gp=0xc000003a40 m=nil [GC worker (idle)]: runtime.gopark(0x194908846ae?, 0x1?, 0x2b?, 0xbb?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048cf38 sp=0xc00048cf18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc00048cfc8 sp=0xc00048cf38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc00048cfe0 sp=0xc00048cfc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048cfe8 sp=0xc00048cfe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 6 gp=0xc000003c00 m=nil [GC worker (idle)]: runtime.gopark(0x19490cd78e3?, 0x3?, 0x83?, 0x42?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000086f38 sp=0xc000086f18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000086fc8 sp=0xc000086f38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000086fe0 sp=0xc000086fc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000086fe8 sp=0xc000086fe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 7 gp=0xc000003dc0 m=nil [GC worker (idle)]: runtime.gopark(0x19490884595?, 0x3?, 0xfc?, 0x3f?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000087738 sp=0xc000087718 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0000877c8 sp=0xc000087738 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0000877e0 sp=0xc0000877c8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000877e8 sp=0xc0000877e0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 8 gp=0xc0000bc000 m=nil [GC worker (idle)]: runtime.gopark(0x19490cb7e18?, 0x1?, 0xf?, 0xf8?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000087f38 sp=0xc000087f18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000087fc8 sp=0xc000087f38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000087fe0 sp=0xc000087fc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000087fe8 sp=0xc000087fe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 9 gp=0xc0000bc1c0 m=nil [GC worker (idle)]: runtime.gopark(0x561823dc6fa0?, 0x1?, 0x2a?, 0xc2?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000098f38 sp=0xc000098f18 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc000098fc8 sp=0xc000098f38 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc000098fe0 sp=0xc000098fc8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000098fe8 sp=0xc000098fe0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 10 gp=0xc0000bc380 m=nil [GC worker (idle)]: runtime.gopark(0x1949087932e?, 0x3?, 0x99?, 0x1?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000081738 sp=0xc000081718 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0000817c8 sp=0xc000081738 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0000817e0 sp=0xc0000817c8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0000817e8 sp=0xc0000817e0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 11 gp=0xc0000bc540 m=nil [GC worker (idle)]: runtime.gopark(0x19490884639?, 0x3?, 0x32?, 0x17?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000487738 sp=0xc000487718 pc=0x561821dc0e8e runtime.gcBgMarkWorker(0xc0001116c0) /usr/lib/go/src/runtime/mgc.go:1463 +0xeb fp=0xc0004877c8 sp=0xc000487738 pc=0x561821d6ca2b runtime.gcBgMarkStartWorkers.gowrap1() /usr/lib/go/src/runtime/mgc.go:1373 +0x25 fp=0xc0004877e0 sp=0xc0004877c8 pc=0x561821d6c905 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc0004877e8 sp=0xc0004877e0 pc=0x561821dc8fc1 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/lib/go/src/runtime/mgc.go:1373 +0x105 goroutine 21 gp=0xc0000bd340 m=nil [sync.WaitGroup.Wait]: runtime.gopark(0xc000092aa0?, 0x561821dc6ff4?, 0x60?, 0x80?, 0x0?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc000092a90 sp=0xc000092a70 pc=0x561821dc0e8e runtime.goparkunlock(...) /usr/lib/go/src/runtime/proc.go:466 runtime.semacquire1(0xc00022b198, 0x0, 0x1, 0x0, 0x19) /usr/lib/go/src/runtime/sema.go:192 +0x229 fp=0xc000092af8 sp=0xc000092a90 pc=0x561821da0129 sync.runtime_SemacquireWaitGroup(0xc00011aab0?, 0xc5?) /usr/lib/go/src/runtime/sema.go:114 +0x2e fp=0xc000092b30 sp=0xc000092af8 pc=0x561821dc28ae sync.(*WaitGroup).Wait(0xc00022b190) /usr/lib/go/src/sync/waitgroup.go:206 +0x85 fp=0xc000092b58 sp=0xc000092b30 pc=0x561821dd4c25 github.com/ollama/ollama/runner/ollamarunner.(*Server).run(0xc00022b0e0, {0x5618233c6d10, 0xc0004a5810}) /build/ollama/src/ollama/runner/ollamarunner/runner.go:441 +0x45 fp=0xc000092fb8 sp=0xc000092b58 pc=0x56182233e8a5 github.com/ollama/ollama/runner/ollamarunner.Execute.gowrap1() /build/ollama/src/ollama/runner/ollamarunner/runner.go:1418 +0x28 fp=0xc000092fe0 sp=0xc000092fb8 pc=0x561822348048 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc000092fe8 sp=0xc000092fe0 pc=0x561821dc8fc1 created by github.com/ollama/ollama/runner/ollamarunner.Execute in goroutine 1 /build/ollama/src/ollama/runner/ollamarunner/runner.go:1418 +0x4c9 goroutine 12 gp=0xc000484700 m=nil [IO wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0xb?) /usr/lib/go/src/runtime/proc.go:460 +0xce fp=0xc00048d5d8 sp=0xc00048d5b8 pc=0x561821dc0e8e runtime.netpollblock(0x561821de5038?, 0x21d55e66?, 0x18?) /usr/lib/go/src/runtime/netpoll.go:575 +0xf7 fp=0xc00048d610 sp=0xc00048d5d8 pc=0x561821d84a77 internal/poll.runtime_pollWait(0x7f2f12ff4c00, 0x72) /usr/lib/go/src/runtime/netpoll.go:351 +0x85 fp=0xc00048d630 sp=0xc00048d610 pc=0x561821dc0065 internal/poll.(*pollDesc).wait(0xc000176900?, 0xc0004bb6a1?, 0x0) /usr/lib/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00048d658 sp=0xc00048d630 pc=0x561821e48d07 internal/poll.(*pollDesc).waitRead(...) /usr/lib/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000176900, {0xc0004bb6a1, 0x1, 0x1}) /usr/lib/go/src/internal/poll/fd_unix.go:165 +0x279 fp=0xc00048d6f0 sp=0xc00048d658 pc=0x561821e49ff9 net.(*netFD).Read(0xc000176900, {0xc0004bb6a1?, 0x0?, 0x0?}) /usr/lib/go/src/net/fd_posix.go:68 +0x25 fp=0xc00048d738 sp=0xc00048d6f0 pc=0x561821eb6885 net.(*conn).Read(0xc00007e628, {0xc0004bb6a1?, 0x0?, 0x0?}) /usr/lib/go/src/net/net.go:196 +0x45 fp=0xc00048d780 sp=0xc00048d738 pc=0x561821ec48a5 net/http.(*connReader).backgroundRead(0xc0004bb680) /usr/lib/go/src/net/http/server.go:702 +0x33 fp=0xc00048d7c8 sp=0xc00048d780 pc=0x5618220bca13 net/http.(*connReader).startBackgroundRead.gowrap2() /usr/lib/go/src/net/http/server.go:698 +0x25 fp=0xc00048d7e0 sp=0xc00048d7c8 pc=0x5618220bc945 runtime.goexit({}) /usr/lib/go/src/runtime/asm_amd64.s:1693 +0x1 fp=0xc00048d7e8 sp=0xc00048d7e0 pc=0x561821dc8fc1 created by net/http.(*connReader).startBackgroundRead in goroutine 22 /usr/lib/go/src/net/http/server.go:698 +0xb6 rax 0x0 rbx 0x250b rcx 0x7f2f5b4a790c rdx 0x6 rdi 0x2509 rsi 0x250b rbp 0x7f2f1402ba30 rsp 0x7f2f1402b9f0 r8 0x0 r9 0x0 r10 0x0 r11 0x246 r12 0x7f2eb164ee34 r13 0x387 r14 0x6 r15 0x0 rip 0x7f2f5b4a790c rflags 0x246 cs 0x33 fs 0x0 gs 0x0 time=2026-02-06T19:59:31.215+01:00 level=INFO source=sched.go:479 msg="Load failed" model=/home/my_dir/.ollama/models/blobs/sha256-aeda25e63ebd698fab8638ffb778e68bed908b960d39d0becc650fa981609d25 error="do load request: Post \"http://127.0.0.1:35569/load\": EOF" time=2026-02-06T19:59:31.218+01:00 level=ERROR source=server.go:302 msg="llama runner terminated" error="exit status 2" [GIN] 2026/02/06 - 19:59:31 | 500 | 1.004984617s | 127.0.0.1 | POST "/api/generate" ```
Author
Owner

@rick-github commented on GitHub (Feb 6, 2026):

/build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed

This is a different problem, open a new issue.

<!-- gh-comment-id:3862156710 --> @rick-github commented on GitHub (Feb 6, 2026): ``` /build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed ``` This is a different problem, open a new issue.
Author
Owner

@w8sl commented on GitHub (Feb 9, 2026):

/build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed

This is a different problem, open a new issue.

I will pass on this as neither Linux Arch nor RDNA2 GPUs are actively supported by ROCm.
llama.cpp Vulkan is a way to go on Arch!

<!-- gh-comment-id:3873753969 --> @w8sl commented on GitHub (Feb 9, 2026): > ``` > /build/ollama/src/ollama/ml/backend/ggml/ggml/src/ggml-cuda/template-instances/../fattn-common.cuh:903: GGML_ASSERT(max_blocks_per_sm > 0) failed > ``` > > This is a different problem, open a new issue. I will pass on this as neither Linux Arch nor RDNA2 GPUs are actively supported by ROCm. llama.cpp Vulkan is a way to go on Arch!
Author
Owner

@rick-github commented on GitHub (Feb 10, 2026):

Ollama has experimental support for Vulkan.

<!-- gh-comment-id:3881339234 --> @rick-github commented on GitHub (Feb 10, 2026): Ollama has experimental support for [Vulkan](https://docs.ollama.com/gpu#vulkan-gpu-support).
Author
Owner

@w8sl commented on GitHub (Feb 11, 2026):

Ollama has experimental support for Vulkan.

Thank you for info! Indeed it is available in Arch repositories as ollama-vulkan and it works very well !

<!-- gh-comment-id:3885720367 --> @w8sl commented on GitHub (Feb 11, 2026): > Ollama has experimental support for [Vulkan](https://docs.ollama.com/gpu#vulkan-gpu-support). Thank you for info! Indeed it is available in Arch repositories as ollama-vulkan and it works very well !
Author
Owner

@juanluisbaptiste commented on GitHub (Feb 17, 2026):

I had this same error both on EndeavourOS with the native ollama downloaded from ollama.com, and with the docker:x.x-rocm image , but I can confirm that using the ollama-vulkan package works perfectly!

<!-- gh-comment-id:3912078013 --> @juanluisbaptiste commented on GitHub (Feb 17, 2026): I had this same error both on EndeavourOS with the native ollama downloaded from ollama.com, and with the docker:x.x-rocm image , but I can confirm that using the ollama-vulkan package works perfectly!
Author
Owner

@liorgross commented on GitHub (Mar 2, 2026):

I am now facing the same issue after updating my Ubuntu OS to latest kernel (6.17.0-1012-oem). Everything was working fine prior to the upgrade, now I am getting the 500 error. It seems to be detecting the GPU and its memory just fine, but fails to load any model to the GPU. am using latest ROCm (7.2.0), I also tried with the latest pre-release version with no luck…

I have also switched to Vulkan instead, and it works fine with that. But I rather use ROCm like I used to…

(Same hardware - AMD Ryzen 395+, Same OS - Ubuntu 24.04)

<!-- gh-comment-id:3981669151 --> @liorgross commented on GitHub (Mar 2, 2026): I am now facing the same issue after updating my Ubuntu OS to latest kernel (6.17.0-1012-oem). Everything was working fine prior to the upgrade, now I am getting the 500 error. It seems to be detecting the GPU and its memory just fine, but fails to load any model to the GPU. am using latest ROCm (7.2.0), I also tried with the latest pre-release version with no luck… I have also switched to Vulkan instead, and it works fine with that. But I rather use ROCm like I used to… (Same hardware - AMD Ryzen 395+, Same OS - Ubuntu 24.04)
Author
Owner

@w8sl commented on GitHub (Mar 2, 2026):

There doesn't appear to be any penalty; in fact, performance is better with Vulkan, and rock-solid stable

<!-- gh-comment-id:3982181051 --> @w8sl commented on GitHub (Mar 2, 2026): There doesn't appear to be any penalty; in fact, performance is better with Vulkan, and rock-solid stable
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#71101