[GH-ISSUE #1865] Add GPU support for CUDA Compute Capability 5.0 and 5.2 cards #47578

Closed
opened 2026-04-28 04:21:03 -05:00 by GiteaMirror · 14 comments
Owner

Originally created by @Subie1 on GitHub (Jan 9, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/1865

Originally assigned to: @dhiltgen on GitHub.

The ollama serve command runs as normally with the detection of my GPU:

2024/01/09 14:37:45 gpu.go:34: Detecting GPU type
ama 2024/01/09 14:37:45 gpu.go:53: Nvidia GPU detected

ggml_init_cublas: found 1 CUDA devices:
  Device 0: Quadro M1000M, compute capability 5.0

Lines which lead me to believe it's loading CUDA:

llm_load_tensors: using CUDA for GPU acceleration
llm_load_tensors: mem required  =   35.23 MiB
llm_load_tensors: offloading 22 repeating layers to GPU
llm_load_tensors: offloading non-repeating layers to GPU
llm_load_tensors: offloaded 23/23 layers to GPU
llm_load_tensors: VRAM used: 571.37 MiB
⠴ .
llama_new_context_with_model: n_ctx      = 2048
llama_new_context_with_model: freq_base  = 10000.0
llama_new_context_with_model: freq_scale = 1
llama_kv_cache_init: VRAM kv self = 44.00 MB
llama_new_context_with_model: KV self size  =   44.00 MiB, K (f16):   22.00 MiB, V (f16):   22.00 MiB
⠦ llama_build_graph: non-view tensors processed: 466/466
llama_new_context_with_model: compute buffer total size = 147.19 MiB
⠧ llama_new_context_with_model: VRAM scratch buffer: 144.00 MiB
llama_new_context_with_model: total VRAM used: 759.38 MiB (model: 571.37 MiB, context: 188.00 MiB)

The once I run a model it starts normally, then before the finish it crashes with this error:

CUDA error 209 at /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: no kernel image is available for execution on the device
current device: 0
GGML_ASSERT: /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: !"CUDA error"
⠼ SIGABRT: abort
PC=0x7f7fa7b7b9fc m=11 sigcode=18446744073709551610
signal arrived during cgo execution

Then continues with this huge error:

CUDA error 209 at /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: no kernel image is available for execution on the device
current device: 0
GGML_ASSERT: /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: !"CUDA error"
⠋ SIGABRT: abort
PC=0x7fa1d727b9fc m=4 sigcode=18446744073709551610
signal arrived during cgo execution

goroutine 10 [syscall]:
runtime.cgocall(0x9c1470, 0xc0004ca608)
        /usr/local/go/src/runtime/cgocall.go:157 +0x4b fp=0xc0004ca5e0 sp=0xc0004ca5a8 pc=0x4291ab
github.com/jmorganca/ollama/llm._Cfunc_dynamic_shim_llama_server_init({0x7fa1840014a0, 0x7fa13ce7b2e0, 0x7fa13ce6da80, 0x7fa13ce71270, 0x7fa13ce83770, 0x7fa13ce78900, 0x7fa13ce71430, 0x7fa13ce6db00, 0x7fa13ce7ea00, 0x7fa13ce7e5b0, ...}, ...)
        _cgo_gotypes.go:287 +0x45 fp=0xc0004ca608 sp=0xc0004ca5e0 pc=0x7cdd85
github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init.func1(0x45971b?, 0x80?, 0x80?)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0xec fp=0xc0004ca6f8 sp=0xc0004ca608 pc=0x7d326c
github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init(0xc0000982d0?, 0x0?, 0x200?)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0x13 fp=0xc0004ca720 sp=0xc0004ca6f8 pc=0x7d3153
github.com/jmorganca/ollama/llm.newExtServer({0x17842518, 0xc0000f8360}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/ext_server_common.go:146 +0x7f8 fp=0xc0004ca9a8 sp=0xc0004ca720 pc=0x7cf3b8
github.com/jmorganca/ollama/llm.newDynamicShimExtServer({0xc00047cf00, 0x2b}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:93 +0x54f fp=0xc0004cabd0 sp=0xc0004ca9a8 pc=0x7d45af
github.com/jmorganca/ollama/llm.newLlmServer({0xc3d801, 0x4}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/llm.go:86 +0x16b fp=0xc0004cad60 sp=0xc0004cabd0 pc=0x7ccecb
github.com/jmorganca/ollama/llm.New({0xc00048e180?, 0x0?}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/llm.go:76 +0x233 fp=0xc0004caef0 sp=0xc0004cad60 pc=0x7ccb33
github.com/jmorganca/ollama/server.load(0xc0004d2000?, 0xc0004d2000, {{0x0, 0x800, 0x200, 0x1, 0xffffffffffffffff, 0x0, 0x0, 0x1, ...}, ...}, ...)
        /go/src/github.com/jmorganca/ollama/server/routes.go:84 +0x425 fp=0xc0004cb0a0 sp=0xc0004caef0 pc=0x99d825
github.com/jmorganca/ollama/server.GenerateHandler(0xc000522200)
        /go/src/github.com/jmorganca/ollama/server/routes.go:191 +0x8c8 fp=0xc0004cb748 sp=0xc0004cb0a0 pc=0x99e5c8
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func1(0xc000522200)
        /go/src/github.com/jmorganca/ollama/server/routes.go:876 +0x68 fp=0xc0004cb780 sp=0xc0004cb748 pc=0x9a79c8
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7a fp=0xc0004cb7d0 sp=0xc0004cb780 pc=0x9813ba
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.LoggerWithConfig.func1(0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xde fp=0xc0004cb980 sp=0xc0004cb7d0 pc=0x98055e
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0xc0000d1ba0, 0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x65b fp=0xc0004cbb08 sp=0xc0004cb980 pc=0x97f61b
github.com/gin-gonic/gin.(*Engine).ServeHTTP(0xc0000d1ba0, {0x1783c860?, 0xc00041e0e0}, 0xc000522300)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1dd fp=0xc0004cbb48 sp=0xc0004cbb08 pc=0x97eddd
net/http.serverHandler.ServeHTTP({0x1783ab80?}, {0x1783c860?, 0xc00041e0e0?}, 0x6?)
        /usr/local/go/src/net/http/server.go:2938 +0x8e fp=0xc0004cbb78 sp=0xc0004cbb48 pc=0x6ee3ee
net/http.(*conn).serve(0xc0004b4120, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2009 +0x5f4 fp=0xc0004cbfb8 sp=0xc0004cbb78 pc=0x6ea2d4
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004cbfe0 sp=0xc0004cbfb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004cbfe8 sp=0xc0004cbfe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 1 [IO wait]:
runtime.gopark(0x4a05b0?, 0xc000533828?, 0x78?, 0x38?, 0x5166dd?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000615808 sp=0xc0006157e8 pc=0x45de8e
runtime.netpollblock(0x48b9d2?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc000615840 sp=0xc000615808 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbe80, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc000615860 sp=0xc000615840 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462000?, 0x4?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000615888 sp=0xc000615860 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc000462000)
        /usr/local/go/src/internal/poll/fd_unix.go:611 +0x2ac fp=0xc000615930 sp=0xc000615888 pc=0x51480c
net.(*netFD).accept(0xc000462000)
        /usr/local/go/src/net/fd_unix.go:172 +0x29 fp=0xc0006159e8 sp=0xc000615930 pc=0x58b3e9
net.(*TCPListener).accept(0xc00043b5a0)
        /usr/local/go/src/net/tcpsock_posix.go:152 +0x1e fp=0xc000615a10 sp=0xc0006159e8 pc=0x5a01fe
)
        ⠙ /usr/local/go/src/net/tcpsock.go:315 +0x30 fp=0xc000615a40 sp=0xc000615a10 pc=0x59f3b0
net/http.(*onceCloseListener).Accept(0xc0004b4120?)
        <autogenerated>:1 +0x24 fp=0xc000615a58 sp=0xc000615a40 pc=0x711184
net/http.(*Server).Serve(0xc000366ff0, {0x1783c650, 0xc00043b5a0})
        /usr/local/go/src/net/http/server.go:3056 +0x364 fp=0xc000615b88 sp=0xc000615a58 pc=0x6ee844
github.com/jmorganca/ollama/server.Serve({0x1783c650, 0xc00043b5a0})
        /go/src/github.com/jmorganca/ollama/server/routes.go:956 +0x389 fp=0xc000615c98 sp=0xc000615b88 pc=0x9a7da9
github.com/jmorganca/ollama/cmd.RunServer(0xc000460300?, {0x17d9db40?, 0x4?, 0xc3d4f5?})
        /go/src/github.com/jmorganca/ollama/cmd/cmd.go:634 +0x199 fp=0xc000615d30 sp=0xc000615c98 pc=0x9b9f99
github.com/spf13/cobra.(*Command).execute(0xc000419800, {0x17d9db40, 0x0, 0x0})
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x87c fp=0xc000615e68 sp=0xc000615d30 pc=0x783fbc
github.com/spf13/cobra.(*Command).ExecuteC(0xc000418c00)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000615f20 sp=0xc000615e68 pc=0x7847e5
github.com/spf13/cobra.(*Command).Execute(...)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
        /go/src/github.com/jmorganca/ollama/main.go:11 +0x4d fp=0xc000615f40 sp=0xc000615f20 pc=0x9c04cd
runtime.main()
        /usr/local/go/src/runtime/proc.go:267 +0x2bb fp=0xc000615fe0 sp=0xc000615f40 pc=0x45da3b
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000615fe8 sp=0xc000615fe0 pc=0x48d961

goroutine 2 [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058fa8 sp=0xc000058f88 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.forcegchelper()
        /usr/local/go/src/runtime/proc.go:322 +0xb3 fp=0xc000058fe0 sp=0xc000058fa8 pc=0x45dd13
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000058fe8 sp=0xc000058fe0 pc=0x48d961
created by runtime.init.6 in goroutine 1
        /usr/local/go/src/runtime/proc.go:310 +0x1a

goroutine 3 [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059778 sp=0xc000059758 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.bgsweep(0x0?)
        /usr/local/go/src/runtime/mgcsweep.go:321 +0xdf fp=0xc0000597c8 sp=0xc000059778 pc=0x449ddf
runtime.gcenable.func1()
        /usr/local/go/src/runtime/mgc.go:200 +0x25 fp=0xc0000597e0 sp=0xc0000597c8 pc=0x43ef05
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000597e8 sp=0xc0000597e0 pc=0x48d961
created by runtime.gcenable in goroutine 1
        /usr/local/go/src/runtime/mgc.go:200 +0x66

goroutine 4 [GC scavenge wait]:
runtime.gopark(0x19a55b4?, 0x188b346?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059f70 sp=0xc000059f50 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.(*scavengerState).park(0x17ca7640)
        /usr/local/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000059fa0 sp=0xc000059f70 pc=0x447609
runtime.bgscavenge(0x0?)
        /usr/local/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000059fc8 sp=0xc000059fa0 pc=0x447bb9
runtime.gcenable.func2()
        /usr/local/go/src/runtime/mgc.go:201 +0x25 fp=0xc000059fe0 sp=0xc000059fc8 pc=0x43eea5
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000059fe8 sp=0xc000059fe0 pc=0x48d961
created by runtime.gcenable in goroutine 1
        /usr/local/go/src/runtime/mgc.go:201 +0xa5

goroutine 5 [finalizer wait]:
runtime.gopark(0xc364c0?, 0x10045f001?, 0x0?, 0x0?, 0x466045?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058628 sp=0xc000058608 pc=0x45de8e
runtime.runfinq()
        /usr/local/go/src/runtime/mfinal.go:193 +0x107 fp=0xc0000587e0 sp=0xc000058628 pc=0x43df87
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000587e8 sp=0xc0000587e0 pc=0x48d961
created by runtime.createfing in goroutine 1
        /usr/local/go/src/runtime/mfinal.go:163 +0x3d

goroutine 6 [select, locked to thread]:
runtime.gopark(0xc00005a7a8?, 0x2?, 0x29?, 0xe1?, 0xc00005a7a4?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005a638 sp=0xc00005a618 pc=0x45de8e
runtime.selectgo(0xc00005a7a8, 0xc00005a7a0, 0x0?, 0x0, 0x0?, 0x1)
        /usr/local/go/src/runtime/select.go:327 +0x725 fp=0xc00005a758 sp=0xc00005a638 pc=0x46d9c5
runtime.ensureSigM.func1()
        /usr/local/go/src/runtime/signal_unix.go:1014 +0x19f fp=0xc00005a7e0 sp=0xc00005a758 pc=0x4849ff
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005a7e8 sp=0xc00005a7e0 pc=0x48d961
created by runtime.ensureSigM in goroutine 1
        /usr/local/go/src/runtime/signal_unix.go:997 +0xc8

goroutine 18 [syscall]:
runtime.notetsleepg(0x0?, 0x0?)
        /usr/local/go/src/runtime/lock_futex.go:236 +0x29 fp=0xc0000547a0 sp=0xc000054768 pc=0x4309e9
os/signal.signal_recv()
        /usr/local/go/src/runtime/sigqueue.go:152 +0x29 fp=0xc0000547c0 sp=0xc0000547a0 pc=0x48a329
os/signal.loop()
        /usr/local/go/src/os/signal/signal_unix.go:23 +0x13 fp=0xc0000547e0 sp=0xc0000547c0 pc=0x713bb3
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000547e8 sp=0xc0000547e0 pc=0x48d961
created by os/signal.Notify.func1.1 in goroutine 1
        /usr/local/go/src/os/signal/signal.go:151 +0x1f

goroutine 7 [chan receive]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005af18 sp=0xc00005aef8 pc=0x45de8e
runtime.chanrecv(0xc00018da40, 0x0, 0x1)
        /usr/local/go/src/runtime/chan.go:583 +0x3cd fp=0xc00005af90 sp=0xc00005af18 pc=0x42b58d
runtime.chanrecv1(0x0?, 0x0?)
        /usr/local/go/src/runtime/chan.go:442 +0x12 fp=0xc00005afb8 sp=0xc00005af90 pc=0x42b192
github.com/jmorganca/ollama/server.Serve.func1()
        /go/src/github.com/jmorganca/ollama/server/routes.go:938 +0x25 fp=0xc00005afe0 sp=0xc00005afb8 pc=0x9a7ea5
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005afe8 sp=0xc00005afe0 pc=0x48d961
created by github.com/jmorganca/ollama/server.Serve in goroutine 1
        /go/src/github.com/jmorganca/ollama/server/routes.go:937 +0x285

goroutine 34 [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058c750 sp=0xc00058c730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058c7e0 sp=0xc00058c750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058c7e8 sp=0xc00058c7e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 35 [GC worker (idle)]:
runtime.gopark(0x2d8659670228?, 0x3?, 0xb4?, 0x48?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058cf50 sp=0xc00058cf30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058cfe0 sp=0xc00058cf50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058cfe8 sp=0xc00058cfe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 19 [GC worker (idle)]:
runtime.gopark(0x2d8657fbbc81?, 0x3?, 0x24?, 0xe7?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000054f50 sp=0xc000054f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000054fe0 sp=0xc000054f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000054fe8 sp=0xc000054fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 20 [GC worker (idle)]:
runtime.gopark(0x2d865966fa58?, 0x1?, 0xa0?, 0xd3?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055750 sp=0xc000055730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0000557e0 sp=0xc000055750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000557e8 sp=0xc0000557e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 50 [GC worker (idle)]:
runtime.gopark(0x2d8657fbb961?, 0x3?, 0x58?, 0x66?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588750 sp=0xc000588730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0005887e0 sp=0xc000588750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0005887e8 sp=0xc0005887e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 8 [GC worker (idle)]:
runtime.gopark(0x17d9f7a0?, 0x1?, 0xe4?, 0xfa?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005b750 sp=0xc00005b730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00005b7e0 sp=0xc00005b750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005b7e8 sp=0xc00005b7e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 21 [GC worker (idle)]:
runtime.gopark(0x2d865963289c?, 0x3?, 0xc0?, 0x89?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055f50 sp=0xc000055f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000055fe0 sp=0xc000055f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000055fe8 sp=0xc000055fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 51 [GC worker (idle)]:
runtime.gopark(0x2d865966f800?, 0x3?, 0xd8?, 0x8c?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588f50 sp=0xc000588f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000588fe0 sp=0xc000588f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000588fe8 sp=0xc000588fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 52 [IO wait]:
runtime.gopark(0x75?, 0xb?, 0x0?, 0x0?, 0x8?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bd8f8 sp=0xc0004bd8d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bd930 sp=0xc0004bd8f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbd88, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bd950 sp=0xc0004bd930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462080?, 0xc0004b2000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bd978 sp=0xc0004bd950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000462080, {0xc0004b2000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bda10 sp=0xc0004bd978 pc=0x51061a
net.(*netFD).Read(0xc000462080, {0xc0004b2000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bda58 sp=0xc0004bda10 pc=0x5893c5
net.(*conn).Read(0xc000592010, {0xc0004b2000?, 0x0?, 0xc00048a0f8?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bdaa0 sp=0xc0004bda58 pc=0x597665
net.(*TCPConn).Read(0xc00048a0f0?, {0xc0004b2000?, 0x0?, 0xc0004bdac0?})
        <autogenerated>:1 +0x25 fp=0xc0004bdad0 sp=0xc0004bdaa0 pc=0x5a9565
net/http.(*connReader).Read(0xc00048a0f0, {0xc0004b2000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bdb20 sp=0xc0004bdad0 pc=0x6e458b
bufio.(*Reader).fill(0xc000516060)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bdb58 sp=0xc0004bdb20 pc=0x6741c3
bufio.(*Reader).Peek(0xc000516060, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bdb78 sp=0xc0004bdb58 pc=0x6742f3
net/http.(*conn).serve(0xc0000fc1b0, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bdfb8 sp=0xc0004bdb78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bdfe0 sp=0xc0004bdfb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bdfe8 sp=0xc0004bdfe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 53 [IO wait]:
runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0x9?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bf8f8 sp=0xc0004bf8d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bf930 sp=0xc0004bf8f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbc90, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bf950 sp=0xc0004bf930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462180?, 0xc00053c000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bf978 sp=0xc0004bf950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000462180, {0xc00053c000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bfa10 sp=0xc0004bf978 pc=0x51061a
net.(*netFD).Read(0xc000462180, {0xc00053c000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bfa58 sp=0xc0004bfa10 pc=0x5893c5
net.(*conn).Read(0xc000592018, {0xc00053c000?, 0x0?, 0xc000580218?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bfaa0 sp=0xc0004bfa58 pc=0x597665
net.(*TCPConn).Read(0xc000580210?, {0xc00053c000?, 0x0?, 0xc000551ac0?})
        <autogenerated>:1 +0x25 fp=0xc0004bfad0 sp=0xc0004bfaa0 pc=0x5a9565
net/http.(*connReader).Read(0xc000580210, {0xc00053c000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bfb20 sp=0xc0004bfad0 pc=0x6e458b
bufio.(*Reader).fill(0xc00010e060)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bfb58 sp=0xc0004bfb20 pc=0x6741c3
bufio.(*Reader).Peek(0xc00010e060, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bfb78 sp=0xc0004bfb58 pc=0x6742f3
net/http.(*conn).serve(0xc0000fc240, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bffb8 sp=0xc0004bfb78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bffe0 sp=0xc0004bffb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bffe8 sp=0xc0004bffe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 22 [IO wait]:
runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0xa?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004b98f8 sp=0xc0004b98d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004b9930 sp=0xc0004b98f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbb98, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004b9950 sp=0xc0004b9930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000186000?, 0xc0000c8000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004b9978 sp=0xc0004b9950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000186000, {0xc0000c8000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004b9a10 sp=0xc0004b9978 pc=0x51061a
net.(*netFD).Read(0xc000186000, {0xc0000c8000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004b9a58 sp=0xc0004b9a10 pc=0x5893c5
net.(*conn).Read(0xc000080000, {0xc0000c8000?, 0x0?, 0xc000098518?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004b9aa0 sp=0xc0004b9a58 pc=0x597665
net.(*TCPConn).Read(0xc000098510?, {0xc0000c8000?, 0x0?, 0xc00054dac0?})
        <autogenerated>:1 +0x25 fp=0xc0004b9ad0 sp=0xc0004b9aa0 pc=0x5a9565
net/http.(*connReader).Read(0xc000098510, {0xc0000c8000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004b9b20 sp=0xc0004b9ad0 pc=0x6e458b
bufio.(*Reader).fill(0xc00018c7e0)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004b9b58 sp=0xc0004b9b20 pc=0x6741c3
bufio.(*Reader).Peek(0xc00018c7e0, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004b9b78 sp=0xc0004b9b58 pc=0x6742f3
net/http.(*conn).serve(0xc0000c6000, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004b9fb8 sp=0xc0004b9b78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004b9fe0 sp=0xc0004b9fb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004b9fe8 sp=0xc0004b9fe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 11 [IO wait]:
runtime.gopark(0x0?, 0xb?, 0x0?, 0x0?, 0xb?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058eda0 sp=0xc00058ed80 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc00058edd8 sp=0xc00058eda0 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbaa0, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc00058edf8 sp=0xc00058edd8 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc00041a000?, 0xc00048a6a1?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00058ee20 sp=0xc00058edf8 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc00041a000, {0xc00048a6a1, 0x1, 0x1})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc00058eeb8 sp=0xc00058ee20 pc=0x51061a
net.(*netFD).Read(0xc00041a000, {0xc00048a6a1?, 0xc00058ef40?, 0x48a030?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc00058ef00 sp=0xc00058eeb8 pc=0x5893c5
net.(*conn).Read(0xc00005c040, {0xc00048a6a1?, 0x1?, 0xc0004240a0?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc00058ef48 sp=0xc00058ef00 pc=0x597665
net.(*TCPConn).Read(0xc00048a0f0?, {0xc00048a6a1?, 0xc0004240a0?, 0x0?})
        <autogenerated>:1 +0x25 fp=0xc00058ef78 sp=0xc00058ef48 pc=0x5a9565
net/http.(*connReader).backgroundRead(0xc00048a690)
        /usr/local/go/src/net/http/server.go:683 +0x37 fp=0xc00058efc8 sp=0xc00058ef78 pc=0x6e4157
net/http.(*connReader).startBackgroundRead.func2()
        /usr/local/go/src/net/http/server.go:679 +0x25 fp=0xc00058efe0 sp=0xc00058efc8 pc=0x6e4085
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058efe8 sp=0xc00058efe0 pc=0x48d961
created by net/http.(*connReader).startBackgroundRead in goroutine 10
        /usr/local/go/src/net/http/server.go:679 +0xba

rax    0x0
rbx    0x7fa18f649640
rcx    0x7fa1d727b9fc
rdx    0x6
rdi    0x211
rsi    0x211
rbp    0x211
rsp    0x7fa18f6481f0
r8     0x7fa18f6482c0
r9     0x1
r10    0x8
r11    0x246
r12    0x6
r13    0x16
r14    0x60c3f8000
r15    0x0
rip    0x7fa1d727b9fc
rflags 0x246
cs     0x33
fs     0x0
gs     0x0
SIGABRT: abort
PC=0x7fa1d727b9fc m=4 sigcode=18446744073709551610
signal arrived during cgo execution

goroutine 10 [syscall]:
runtime.cgocall(0x9c1470, 0xc0004ca608)
        /usr/local/go/src/runtime/cgocall.go:157 +0x4b fp=0xc0004ca5e0 sp=0xc0004ca5a8 pc=0x4291ab
github.com/jmorganca/ollama/llm._Cfunc_dynamic_shim_llama_server_init({0x7fa1840014a0, 0x7fa13ce7b2e0, 0x7fa13ce6da80, 0x7fa13ce71270, 0x7fa13ce83770, 0x7fa13ce78900, 0x7fa13ce71430, 0x7fa13ce6db00, 0x7fa13ce7ea00, 0x7fa13ce7e5b0, ...}, ...)
        _cgo_gotypes.go:287 +0x45 fp=0xc0004ca608 sp=0xc0004ca5e0 pc=0x7cdd85
github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init.func1(0x45971b?, 0x80?, 0x80?)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0xec fp=0xc0004ca6f8 sp=0xc0004ca608 pc=0x7d326c
github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init(0xc0000982d0?, 0x0?, 0x200?)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0x13 fp=0xc0004ca720 sp=0xc0004ca6f8 pc=0x7d3153
github.com/jmorganca/ollama/llm.newExtServer({0x17842518, 0xc0000f8360}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/ext_server_common.go:146 +0x7f8 fp=0xc0004ca9a8 sp=0xc0004ca720 pc=0x7cf3b8
github.com/jmorganca/ollama/llm.newDynamicShimExtServer({0xc00047cf00, 0x2b}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:93 +0x54f fp=0xc0004cabd0 sp=0xc0004ca9a8 pc=0x7d45af
github.com/jmorganca/ollama/llm.newLlmServer({0xc3d801, 0x4}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/llm.go:86 +0x16b fp=0xc0004cad60 sp=0xc0004cabd0 pc=0x7ccecb
github.com/jmorganca/ollama/llm.New({0xc00048e180?, 0x0?}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...)
        /go/src/github.com/jmorganca/ollama/llm/llm.go:76 +0x233 fp=0xc0004caef0 sp=0xc0004cad60 pc=0x7ccb33
github.com/jmorganca/ollama/server.load(0xc0004d2000?, 0xc0004d2000, {{0x0, 0x800, 0x200, 0x1, 0xffffffffffffffff, 0x0, 0x0, 0x1, ...}, ...}, ...)
        /go/src/github.com/jmorganca/ollama/server/routes.go:84 +0x425 fp=0xc0004cb0a0 sp=0xc0004caef0 pc=0x99d825
github.com/jmorganca/ollama/server.GenerateHandler(0xc000522200)
        /go/src/github.com/jmorganca/ollama/server/routes.go:191 +0x8c8 fp=0xc0004cb748 sp=0xc0004cb0a0 pc=0x99e5c8
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func1(0xc000522200)
        /go/src/github.com/jmorganca/ollama/server/routes.go:876 +0x68 fp=0xc0004cb780 sp=0xc0004cb748 pc=0x9a79c8
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7a fp=0xc0004cb7d0 sp=0xc0004cb780 pc=0x9813ba
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.LoggerWithConfig.func1(0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xde fp=0xc0004cb980 sp=0xc0004cb7d0 pc=0x98055e
github.com/gin-gonic/gin.(*Context).Next(...)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0xc0000d1ba0, 0xc000522200)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x65b fp=0xc0004cbb08 sp=0xc0004cb980 pc=0x97f61b
github.com/gin-gonic/gin.(*Engine).ServeHTTP(0xc0000d1ba0, {0x1783c860?, 0xc00041e0e0}, 0xc000522300)
        /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1dd fp=0xc0004cbb48 sp=0xc0004cbb08 pc=0x97eddd
net/http.serverHandler.ServeHTTP({0x1783ab80?}, {0x1783c860?, 0xc00041e0e0?}, 0x6?)
        /usr/local/go/src/net/http/server.go:2938 +0x8e fp=0xc0004cbb78 sp=0xc0004cbb48 pc=0x6ee3ee
net/http.(*conn).serve(0xc0004b4120, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2009 +0x5f4 fp=0xc0004cbfb8 sp=0xc0004cbb78 pc=0x6ea2d4
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004cbfe0 sp=0xc0004cbfb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004cbfe8 sp=0xc0004cbfe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 1 [IO wait]:
runtime.gopark(0x4a05b0?, 0xc000533828?, 0x78?, 0x38?, 0x5166dd?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000615808 sp=0xc0006157e8 pc=0x45de8e
runtime.netpollblock(0x48b9d2?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc000615840 sp=0xc000615808 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbe80, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc000615860 sp=0xc000615840 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462000?, 0x4?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000615888 sp=0xc000615860 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0xc000462000)
        /usr/local/go/src/internal/poll/fd_unix.go:611 +0x2ac fp=0xc000615930 sp=0xc000615888 pc=0x51480c
net.(*netFD).accept(0xc000462000)
        /usr/local/go/src/net/fd_unix.go:172 +0x29 fp=0xc0006159e8 sp=0xc000615930 pc=0x58b3e9
net.(*TCPListener).accept(0xc00043b5a0)
        /usr/local/go/src/net/tcpsock_posix.go:152 +0x1e fp=0xc000615a10 sp=0xc0006159e8 pc=0x5a01fe
net.(*TCPListener).Accept(0xc00043b5a0)
        /usr/local/go/src/net/tcpsock.go:315 +0x30 fp=0xc000615a40 sp=0xc000615a10 pc=0x59f3b0
net/http.(*onceCloseListener).Accept(0xc0004b4120?)
        <autogenerated>:1 +0x24 fp=0xc000615a58 sp=0xc000615a40 pc=0x711184
net/http.(*Server).Serve(0xc000366ff0, {0x1783c650, 0xc00043b5a0})
        /usr/local/go/src/net/http/server.go:3056 +0x364 fp=0xc000615b88 sp=0xc000615a58 pc=0x6ee844
github.com/jmorganca/ollama/server.Serve({0x1783c650, 0xc00043b5a0})
        /go/src/github.com/jmorganca/ollama/server/routes.go:956 +0x389 fp=0xc000615c98 sp=0xc000615b88 pc=0x9a7da9
github.com/jmorganca/ollama/cmd.RunServer(0xc000460300?, {0x17d9db40?, 0x4?, 0xc3d4f5?})
        /go/src/github.com/jmorganca/ollama/cmd/cmd.go:634 +0x199 fp=0xc000615d30 sp=0xc000615c98 pc=0x9b9f99
github.com/spf13/cobra.(*Command).execute(0xc000419800, {0x17d9db40, 0x0, 0x0})
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x87c fp=0xc000615e68 sp=0xc000615d30 pc=0x783fbc
github.com/spf13/cobra.(*Command).ExecuteC(0xc000418c00)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000615f20 sp=0xc000615e68 pc=0x7847e5
github.com/spf13/cobra.(*Command).Execute(...)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(...)
        /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985
main.main()
        /go/src/github.com/jmorganca/ollama/main.go:11 +0x4d fp=0xc000615f40 sp=0xc000615f20 pc=0x9c04cd
runtime.main()
        /usr/local/go/src/runtime/proc.go:267 +0x2bb fp=0xc000615fe0 sp=0xc000615f40 pc=0x45da3b
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000615fe8 sp=0xc000615fe0 pc=0x48d961

goroutine 2 [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058fa8 sp=0xc000058f88 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.forcegchelper()
        /usr/local/go/src/runtime/proc.go:322 +0xb3 fp=0xc000058fe0 sp=0xc000058fa8 pc=0x45dd13
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000058fe8 sp=0xc000058fe0 pc=0x48d961
created by runtime.init.6 in goroutine 1
        /usr/local/go/src/runtime/proc.go:310 +0x1a

goroutine 3 [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059778 sp=0xc000059758 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.bgsweep(0x0?)
        /usr/local/go/src/runtime/mgcsweep.go:321 +0xdf fp=0xc0000597c8 sp=0xc000059778 pc=0x449ddf
runtime.gcenable.func1()
        /usr/local/go/src/runtime/mgc.go:200 +0x25 fp=0xc0000597e0 sp=0xc0000597c8 pc=0x43ef05
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000597e8 sp=0xc0000597e0 pc=0x48d961
created by runtime.gcenable in goroutine 1
        /usr/local/go/src/runtime/mgc.go:200 +0x66

goroutine 4 [GC scavenge wait]:
runtime.gopark(0x19a55b4?, 0x188b346?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059f70 sp=0xc000059f50 pc=0x45de8e
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:404
runtime.(*scavengerState).park(0x17ca7640)
        /usr/local/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000059fa0 sp=0xc000059f70 pc=0x447609
runtime.bgscavenge(0x0?)
        /usr/local/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000059fc8 sp=0xc000059fa0 pc=0x447bb9
runtime.gcenable.func2()
        /usr/local/go/src/runtime/mgc.go:201 +0x25 fp=0xc000059fe0 sp=0xc000059fc8 pc=0x43eea5
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000059fe8 sp=0xc000059fe0 pc=0x48d961
created by runtime.gcenable in goroutine 1
        /usr/local/go/src/runtime/mgc.go:201 +0xa5

goroutine 5 [finalizer wait]:
runtime.gopark(0xc364c0?, 0x10045f001?, 0x0?, 0x0?, 0x466045?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058628 sp=0xc000058608 pc=0x45de8e
runtime.runfinq()
        /usr/local/go/src/runtime/mfinal.go:193 +0x107 fp=0xc0000587e0 sp=0xc000058628 pc=0x43df87
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000587e8 sp=0xc0000587e0 pc=0x48d961
created by runtime.createfing in goroutine 1
        /usr/local/go/src/runtime/mfinal.go:163 +0x3d

goroutine 6 [select, locked to thread]:
runtime.gopark(0xc00005a7a8?, 0x2?, 0x29?, 0xe1?, 0xc00005a7a4?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005a638 sp=0xc00005a618 pc=0x45de8e
runtime.selectgo(0xc00005a7a8, 0xc00005a7a0, 0x0?, 0x0, 0x0?, 0x1)
        /usr/local/go/src/runtime/select.go:327 +0x725 fp=0xc00005a758 sp=0xc00005a638 pc=0x46d9c5
runtime.ensureSigM.func1()
        /usr/local/go/src/runtime/signal_unix.go:1014 +0x19f fp=0xc00005a7e0 sp=0xc00005a758 pc=0x4849ff
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005a7e8 sp=0xc00005a7e0 pc=0x48d961
created by runtime.ensureSigM in goroutine 1
        /usr/local/go/src/runtime/signal_unix.go:997 +0xc8

goroutine 18 [syscall]:
runtime.notetsleepg(0x0?, 0x0?)
        /usr/local/go/src/runtime/lock_futex.go:236 +0x29 fp=0xc0000547a0 sp=0xc000054768 pc=0x4309e9
os/signal.signal_recv()
        /usr/local/go/src/runtime/sigqueue.go:152 +0x29 fp=0xc0000547c0 sp=0xc0000547a0 pc=0x48a329
os/signal.loop()
        /usr/local/go/src/os/signal/signal_unix.go:23 +0x13 fp=0xc0000547e0 sp=0xc0000547c0 pc=0x713bb3
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000547e8 sp=0xc0000547e0 pc=0x48d961
created by os/signal.Notify.func1.1 in goroutine 1
        /usr/local/go/src/os/signal/signal.go:151 +0x1f

goroutine 7 [chan receive]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005af18 sp=0xc00005aef8 pc=0x45de8e
runtime.chanrecv(0xc00018da40, 0x0, 0x1)
        /usr/local/go/src/runtime/chan.go:583 +0x3cd fp=0xc00005af90 sp=0xc00005af18 pc=0x42b58d
runtime.chanrecv1(0x0?, 0x0?)
        /usr/local/go/src/runtime/chan.go:442 +0x12 fp=0xc00005afb8 sp=0xc00005af90 pc=0x42b192
github.com/jmorganca/ollama/server.Serve.func1()
        /go/src/github.com/jmorganca/ollama/server/routes.go:938 +0x25 fp=0xc00005afe0 sp=0xc00005afb8 pc=0x9a7ea5
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005afe8 sp=0xc00005afe0 pc=0x48d961
created by github.com/jmorganca/ollama/server.Serve in goroutine 1
        /go/src/github.com/jmorganca/ollama/server/routes.go:937 +0x285

goroutine 34 [GC worker (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058c750 sp=0xc00058c730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058c7e0 sp=0xc00058c750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058c7e8 sp=0xc00058c7e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 35 [GC worker (idle)]:
runtime.gopark(0x2d8659670228?, 0x3?, 0xb4?, 0x48?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058cf50 sp=0xc00058cf30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058cfe0 sp=0xc00058cf50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058cfe8 sp=0xc00058cfe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 19 [GC worker (idle)]:
runtime.gopark(0x2d8657fbbc81?, 0x3?, 0x24?, 0xe7?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000054f50 sp=0xc000054f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000054fe0 sp=0xc000054f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000054fe8 sp=0xc000054fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 20 [GC worker (idle)]:
runtime.gopark(0x2d865966fa58?, 0x1?, 0xa0?, 0xd3?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055750 sp=0xc000055730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0000557e0 sp=0xc000055750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000557e8 sp=0xc0000557e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 50 [GC worker (idle)]:
runtime.gopark(0x2d8657fbb961?, 0x3?, 0x58?, 0x66?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588750 sp=0xc000588730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0005887e0 sp=0xc000588750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0005887e8 sp=0xc0005887e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 8 [GC worker (idle)]:
runtime.gopark(0x17d9f7a0?, 0x1?, 0xe4?, 0xfa?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005b750 sp=0xc00005b730 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00005b7e0 sp=0xc00005b750 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005b7e8 sp=0xc00005b7e0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 21 [GC worker (idle)]:
runtime.gopark(0x2d865963289c?, 0x3?, 0xc0?, 0x89?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055f50 sp=0xc000055f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000055fe0 sp=0xc000055f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000055fe8 sp=0xc000055fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 51 [GC worker (idle)]:
runtime.gopark(0x2d865966f800?, 0x3?, 0xd8?, 0x8c?, 0x0?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588f50 sp=0xc000588f30 pc=0x45de8e
runtime.gcBgMarkWorker()
        /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000588fe0 sp=0xc000588f50 pc=0x440a85
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000588fe8 sp=0xc000588fe0 pc=0x48d961
created by runtime.gcBgMarkStartWorkers in goroutine 1
        /usr/local/go/src/runtime/mgc.go:1217 +0x1c

goroutine 52 [IO wait]:
runtime.gopark(0x75?, 0xb?, 0x0?, 0x0?, 0x8?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bd8f8 sp=0xc0004bd8d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bd930 sp=0xc0004bd8f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbd88, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bd950 sp=0xc0004bd930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462080?, 0xc0004b2000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bd978 sp=0xc0004bd950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000462080, {0xc0004b2000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bda10 sp=0xc0004bd978 pc=0x51061a
net.(*netFD).Read(0xc000462080, {0xc0004b2000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bda58 sp=0xc0004bda10 pc=0x5893c5
net.(*conn).Read(0xc000592010, {0xc0004b2000?, 0x0?, 0xc00048a0f8?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bdaa0 sp=0xc0004bda58 pc=0x597665
net.(*TCPConn).Read(0xc00048a0f0?, {0xc0004b2000?, 0x0?, 0xc0004bdac0?})
        <autogenerated>:1 +0x25 fp=0xc0004bdad0 sp=0xc0004bdaa0 pc=0x5a9565
net/http.(*connReader).Read(0xc00048a0f0, {0xc0004b2000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bdb20 sp=0xc0004bdad0 pc=0x6e458b
bufio.(*Reader).fill(0xc000516060)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bdb58 sp=0xc0004bdb20 pc=0x6741c3
bufio.(*Reader).Peek(0xc000516060, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bdb78 sp=0xc0004bdb58 pc=0x6742f3
net/http.(*conn).serve(0xc0000fc1b0, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bdfb8 sp=0xc0004bdb78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bdfe0 sp=0xc0004bdfb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bdfe8 sp=0xc0004bdfe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 53 [IO wait]:
runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0x9?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bf8f8 sp=0xc0004bf8d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bf930 sp=0xc0004bf8f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbc90, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bf950 sp=0xc0004bf930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000462180?, 0xc00053c000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bf978 sp=0xc0004bf950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000462180, {0xc00053c000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bfa10 sp=0xc0004bf978 pc=0x51061a
net.(*netFD).Read(0xc000462180, {0xc00053c000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bfa58 sp=0xc0004bfa10 pc=0x5893c5
net.(*conn).Read(0xc000592018, {0xc00053c000?, 0x0?, 0xc000580218?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bfaa0 sp=0xc0004bfa58 pc=0x597665
net.(*TCPConn).Read(0xc000580210?, {0xc00053c000?, 0x0?, 0xc000551ac0?})
        <autogenerated>:1 +0x25 fp=0xc0004bfad0 sp=0xc0004bfaa0 pc=0x5a9565
net/http.(*connReader).Read(0xc000580210, {0xc00053c000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bfb20 sp=0xc0004bfad0 pc=0x6e458b
bufio.(*Reader).fill(0xc00010e060)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bfb58 sp=0xc0004bfb20 pc=0x6741c3
bufio.(*Reader).Peek(0xc00010e060, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bfb78 sp=0xc0004bfb58 pc=0x6742f3
net/http.(*conn).serve(0xc0000fc240, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bffb8 sp=0xc0004bfb78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bffe0 sp=0xc0004bffb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bffe8 sp=0xc0004bffe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 22 [IO wait]:
runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0xa?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004b98f8 sp=0xc0004b98d8 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004b9930 sp=0xc0004b98f8 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbb98, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004b9950 sp=0xc0004b9930 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc000186000?, 0xc0000c8000?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004b9978 sp=0xc0004b9950 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc000186000, {0xc0000c8000, 0x1000, 0x1000})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004b9a10 sp=0xc0004b9978 pc=0x51061a
net.(*netFD).Read(0xc000186000, {0xc0000c8000?, 0x50f7e5?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004b9a58 sp=0xc0004b9a10 pc=0x5893c5
net.(*conn).Read(0xc000080000, {0xc0000c8000?, 0x0?, 0xc000098518?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004b9aa0 sp=0xc0004b9a58 pc=0x597665
net.(*TCPConn).Read(0xc000098510?, {0xc0000c8000?, 0x0?, 0xc00054dac0?})
        <autogenerated>:1 +0x25 fp=0xc0004b9ad0 sp=0xc0004b9aa0 pc=0x5a9565
net/http.(*connReader).Read(0xc000098510, {0xc0000c8000, 0x1000, 0x1000})
        /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004b9b20 sp=0xc0004b9ad0 pc=0x6e458b
bufio.(*Reader).fill(0xc00018c7e0)
        /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004b9b58 sp=0xc0004b9b20 pc=0x6741c3
bufio.(*Reader).Peek(0xc00018c7e0, 0x4)
        /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004b9b78 sp=0xc0004b9b58 pc=0x6742f3
net/http.(*conn).serve(0xc0000c6000, {0x1783ded8, 0xc0005800f0})
        /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004b9fb8 sp=0xc0004b9b78 pc=0x6ea43c
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004b9fe0 sp=0xc0004b9fb8 pc=0x6eec08
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004b9fe8 sp=0xc0004b9fe0 pc=0x48d961
created by net/http.(*Server).Serve in goroutine 1
        /usr/local/go/src/net/http/server.go:3086 +0x5cb

goroutine 11 [IO wait]:
runtime.gopark(0x0?, 0xb?, 0x0?, 0x0?, 0xb?)
        /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058eda0 sp=0xc00058ed80 pc=0x45de8e
runtime.netpollblock(0x49e718?, 0x428946?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc00058edd8 sp=0xc00058eda0 pc=0x456917
internal/poll.runtime_pollWait(0x7fa18ddcbaa0, 0x72)
        /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc00058edf8 sp=0xc00058edd8 pc=0x4880a5
internal/poll.(*pollDesc).wait(0xc00041a000?, 0xc00048a6a1?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00058ee20 sp=0xc00058edf8 pc=0x50f327
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0xc00041a000, {0xc00048a6a1, 0x1, 0x1})
        /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc00058eeb8 sp=0xc00058ee20 pc=0x51061a
net.(*netFD).Read(0xc00041a000, {0xc00048a6a1?, 0xc00058ef40?, 0x48a030?})
        /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc00058ef00 sp=0xc00058eeb8 pc=0x5893c5
net.(*conn).Read(0xc00005c040, {0xc00048a6a1?, 0x1?, 0xc0004240a0?})
        /usr/local/go/src/net/net.go:179 +0x45 fp=0xc00058ef48 sp=0xc00058ef00 pc=0x597665
net.(*TCPConn).Read(0xc00048a0f0?, {0xc00048a6a1?, 0xc0004240a0?, 0x0?})
        <autogenerated>:1 +0x25 fp=0xc00058ef78 sp=0xc00058ef48 pc=0x5a9565
net/http.(*connReader).backgroundRead(0xc00048a690)
        /usr/local/go/src/net/http/server.go:683 +0x37 fp=0xc00058efc8 sp=0xc00058ef78 pc=0x6e4157
net/http.(*connReader).startBackgroundRead.func2()
        /usr/local/go/src/net/http/server.go:679 +0x25 fp=0xc00058efe0 sp=0xc00058efc8 pc=0x6e4085
runtime.goexit()
        /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058efe8 sp=0xc00058efe0 pc=0x48d961
created by net/http.(*connReader).startBackgroundRead in goroutine 10
        /usr/local/go/src/net/http/server.go:679 +0xba

rax    0x0
rbx    0x7fa18f649640
rcx    0x7fa1d727b9fc
rdx    0x6
rdi    0x1f7
rsi    0x1fa
rbp    0x1fa
rsp    0x7fa18f6481f0
r8     0x7fa18f6482c0
r9     0x7fa18f648260
r10    0x8
r11    0x246
r12    0x6
r13    0x16
r14    0x60c3f8000
r15    0x0
rip    0x7fa1d727b9fc
rflags 0x246
cs     0x33
fs     0x0
gs     0x0
Error: Post "http://127.0.0.1:11434/api/generate": EOF

Help would be appreciated

Originally created by @Subie1 on GitHub (Jan 9, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/1865 Originally assigned to: @dhiltgen on GitHub. The `ollama serve` command runs as normally with the detection of my GPU: ``` 2024/01/09 14:37:45 gpu.go:34: Detecting GPU type ama 2024/01/09 14:37:45 gpu.go:53: Nvidia GPU detected ggml_init_cublas: found 1 CUDA devices: Device 0: Quadro M1000M, compute capability 5.0 ``` Lines which lead me to believe it's loading CUDA: ``` llm_load_tensors: using CUDA for GPU acceleration llm_load_tensors: mem required = 35.23 MiB llm_load_tensors: offloading 22 repeating layers to GPU llm_load_tensors: offloading non-repeating layers to GPU llm_load_tensors: offloaded 23/23 layers to GPU llm_load_tensors: VRAM used: 571.37 MiB ⠴ . llama_new_context_with_model: n_ctx = 2048 llama_new_context_with_model: freq_base = 10000.0 llama_new_context_with_model: freq_scale = 1 llama_kv_cache_init: VRAM kv self = 44.00 MB llama_new_context_with_model: KV self size = 44.00 MiB, K (f16): 22.00 MiB, V (f16): 22.00 MiB ⠦ llama_build_graph: non-view tensors processed: 466/466 llama_new_context_with_model: compute buffer total size = 147.19 MiB ⠧ llama_new_context_with_model: VRAM scratch buffer: 144.00 MiB llama_new_context_with_model: total VRAM used: 759.38 MiB (model: 571.37 MiB, context: 188.00 MiB) ``` The once I run a model it starts normally, then before the finish it crashes with this error: ``` CUDA error 209 at /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: no kernel image is available for execution on the device current device: 0 GGML_ASSERT: /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: !"CUDA error" ⠼ SIGABRT: abort PC=0x7f7fa7b7b9fc m=11 sigcode=18446744073709551610 signal arrived during cgo execution ``` Then continues with this huge error: ``` CUDA error 209 at /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: no kernel image is available for execution on the device current device: 0 GGML_ASSERT: /go/src/github.com/jmorganca/ollama/llm/llama.cpp/ggml-cuda.cu:7801: !"CUDA error" ⠋ SIGABRT: abort PC=0x7fa1d727b9fc m=4 sigcode=18446744073709551610 signal arrived during cgo execution goroutine 10 [syscall]: runtime.cgocall(0x9c1470, 0xc0004ca608) /usr/local/go/src/runtime/cgocall.go:157 +0x4b fp=0xc0004ca5e0 sp=0xc0004ca5a8 pc=0x4291ab github.com/jmorganca/ollama/llm._Cfunc_dynamic_shim_llama_server_init({0x7fa1840014a0, 0x7fa13ce7b2e0, 0x7fa13ce6da80, 0x7fa13ce71270, 0x7fa13ce83770, 0x7fa13ce78900, 0x7fa13ce71430, 0x7fa13ce6db00, 0x7fa13ce7ea00, 0x7fa13ce7e5b0, ...}, ...) _cgo_gotypes.go:287 +0x45 fp=0xc0004ca608 sp=0xc0004ca5e0 pc=0x7cdd85 github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init.func1(0x45971b?, 0x80?, 0x80?) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0xec fp=0xc0004ca6f8 sp=0xc0004ca608 pc=0x7d326c github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init(0xc0000982d0?, 0x0?, 0x200?) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0x13 fp=0xc0004ca720 sp=0xc0004ca6f8 pc=0x7d3153 github.com/jmorganca/ollama/llm.newExtServer({0x17842518, 0xc0000f8360}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/ext_server_common.go:146 +0x7f8 fp=0xc0004ca9a8 sp=0xc0004ca720 pc=0x7cf3b8 github.com/jmorganca/ollama/llm.newDynamicShimExtServer({0xc00047cf00, 0x2b}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:93 +0x54f fp=0xc0004cabd0 sp=0xc0004ca9a8 pc=0x7d45af github.com/jmorganca/ollama/llm.newLlmServer({0xc3d801, 0x4}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/llm.go:86 +0x16b fp=0xc0004cad60 sp=0xc0004cabd0 pc=0x7ccecb github.com/jmorganca/ollama/llm.New({0xc00048e180?, 0x0?}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/llm.go:76 +0x233 fp=0xc0004caef0 sp=0xc0004cad60 pc=0x7ccb33 github.com/jmorganca/ollama/server.load(0xc0004d2000?, 0xc0004d2000, {{0x0, 0x800, 0x200, 0x1, 0xffffffffffffffff, 0x0, 0x0, 0x1, ...}, ...}, ...) /go/src/github.com/jmorganca/ollama/server/routes.go:84 +0x425 fp=0xc0004cb0a0 sp=0xc0004caef0 pc=0x99d825 github.com/jmorganca/ollama/server.GenerateHandler(0xc000522200) /go/src/github.com/jmorganca/ollama/server/routes.go:191 +0x8c8 fp=0xc0004cb748 sp=0xc0004cb0a0 pc=0x99e5c8 github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func1(0xc000522200) /go/src/github.com/jmorganca/ollama/server/routes.go:876 +0x68 fp=0xc0004cb780 sp=0xc0004cb748 pc=0x9a79c8 github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7a fp=0xc0004cb7d0 sp=0xc0004cb780 pc=0x9813ba github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.LoggerWithConfig.func1(0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xde fp=0xc0004cb980 sp=0xc0004cb7d0 pc=0x98055e github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0xc0000d1ba0, 0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x65b fp=0xc0004cbb08 sp=0xc0004cb980 pc=0x97f61b github.com/gin-gonic/gin.(*Engine).ServeHTTP(0xc0000d1ba0, {0x1783c860?, 0xc00041e0e0}, 0xc000522300) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1dd fp=0xc0004cbb48 sp=0xc0004cbb08 pc=0x97eddd net/http.serverHandler.ServeHTTP({0x1783ab80?}, {0x1783c860?, 0xc00041e0e0?}, 0x6?) /usr/local/go/src/net/http/server.go:2938 +0x8e fp=0xc0004cbb78 sp=0xc0004cbb48 pc=0x6ee3ee net/http.(*conn).serve(0xc0004b4120, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2009 +0x5f4 fp=0xc0004cbfb8 sp=0xc0004cbb78 pc=0x6ea2d4 net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004cbfe0 sp=0xc0004cbfb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004cbfe8 sp=0xc0004cbfe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 1 [IO wait]: runtime.gopark(0x4a05b0?, 0xc000533828?, 0x78?, 0x38?, 0x5166dd?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000615808 sp=0xc0006157e8 pc=0x45de8e runtime.netpollblock(0x48b9d2?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc000615840 sp=0xc000615808 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbe80, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc000615860 sp=0xc000615840 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462000?, 0x4?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000615888 sp=0xc000615860 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0xc000462000) /usr/local/go/src/internal/poll/fd_unix.go:611 +0x2ac fp=0xc000615930 sp=0xc000615888 pc=0x51480c net.(*netFD).accept(0xc000462000) /usr/local/go/src/net/fd_unix.go:172 +0x29 fp=0xc0006159e8 sp=0xc000615930 pc=0x58b3e9 net.(*TCPListener).accept(0xc00043b5a0) /usr/local/go/src/net/tcpsock_posix.go:152 +0x1e fp=0xc000615a10 sp=0xc0006159e8 pc=0x5a01fe ) ⠙ /usr/local/go/src/net/tcpsock.go:315 +0x30 fp=0xc000615a40 sp=0xc000615a10 pc=0x59f3b0 net/http.(*onceCloseListener).Accept(0xc0004b4120?) <autogenerated>:1 +0x24 fp=0xc000615a58 sp=0xc000615a40 pc=0x711184 net/http.(*Server).Serve(0xc000366ff0, {0x1783c650, 0xc00043b5a0}) /usr/local/go/src/net/http/server.go:3056 +0x364 fp=0xc000615b88 sp=0xc000615a58 pc=0x6ee844 github.com/jmorganca/ollama/server.Serve({0x1783c650, 0xc00043b5a0}) /go/src/github.com/jmorganca/ollama/server/routes.go:956 +0x389 fp=0xc000615c98 sp=0xc000615b88 pc=0x9a7da9 github.com/jmorganca/ollama/cmd.RunServer(0xc000460300?, {0x17d9db40?, 0x4?, 0xc3d4f5?}) /go/src/github.com/jmorganca/ollama/cmd/cmd.go:634 +0x199 fp=0xc000615d30 sp=0xc000615c98 pc=0x9b9f99 github.com/spf13/cobra.(*Command).execute(0xc000419800, {0x17d9db40, 0x0, 0x0}) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x87c fp=0xc000615e68 sp=0xc000615d30 pc=0x783fbc github.com/spf13/cobra.(*Command).ExecuteC(0xc000418c00) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000615f20 sp=0xc000615e68 pc=0x7847e5 github.com/spf13/cobra.(*Command).Execute(...) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(...) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 main.main() /go/src/github.com/jmorganca/ollama/main.go:11 +0x4d fp=0xc000615f40 sp=0xc000615f20 pc=0x9c04cd runtime.main() /usr/local/go/src/runtime/proc.go:267 +0x2bb fp=0xc000615fe0 sp=0xc000615f40 pc=0x45da3b runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000615fe8 sp=0xc000615fe0 pc=0x48d961 goroutine 2 [force gc (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058fa8 sp=0xc000058f88 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.forcegchelper() /usr/local/go/src/runtime/proc.go:322 +0xb3 fp=0xc000058fe0 sp=0xc000058fa8 pc=0x45dd13 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000058fe8 sp=0xc000058fe0 pc=0x48d961 created by runtime.init.6 in goroutine 1 /usr/local/go/src/runtime/proc.go:310 +0x1a goroutine 3 [GC sweep wait]: runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059778 sp=0xc000059758 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.bgsweep(0x0?) /usr/local/go/src/runtime/mgcsweep.go:321 +0xdf fp=0xc0000597c8 sp=0xc000059778 pc=0x449ddf runtime.gcenable.func1() /usr/local/go/src/runtime/mgc.go:200 +0x25 fp=0xc0000597e0 sp=0xc0000597c8 pc=0x43ef05 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000597e8 sp=0xc0000597e0 pc=0x48d961 created by runtime.gcenable in goroutine 1 /usr/local/go/src/runtime/mgc.go:200 +0x66 goroutine 4 [GC scavenge wait]: runtime.gopark(0x19a55b4?, 0x188b346?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059f70 sp=0xc000059f50 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.(*scavengerState).park(0x17ca7640) /usr/local/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000059fa0 sp=0xc000059f70 pc=0x447609 runtime.bgscavenge(0x0?) /usr/local/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000059fc8 sp=0xc000059fa0 pc=0x447bb9 runtime.gcenable.func2() /usr/local/go/src/runtime/mgc.go:201 +0x25 fp=0xc000059fe0 sp=0xc000059fc8 pc=0x43eea5 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000059fe8 sp=0xc000059fe0 pc=0x48d961 created by runtime.gcenable in goroutine 1 /usr/local/go/src/runtime/mgc.go:201 +0xa5 goroutine 5 [finalizer wait]: runtime.gopark(0xc364c0?, 0x10045f001?, 0x0?, 0x0?, 0x466045?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058628 sp=0xc000058608 pc=0x45de8e runtime.runfinq() /usr/local/go/src/runtime/mfinal.go:193 +0x107 fp=0xc0000587e0 sp=0xc000058628 pc=0x43df87 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000587e8 sp=0xc0000587e0 pc=0x48d961 created by runtime.createfing in goroutine 1 /usr/local/go/src/runtime/mfinal.go:163 +0x3d goroutine 6 [select, locked to thread]: runtime.gopark(0xc00005a7a8?, 0x2?, 0x29?, 0xe1?, 0xc00005a7a4?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005a638 sp=0xc00005a618 pc=0x45de8e runtime.selectgo(0xc00005a7a8, 0xc00005a7a0, 0x0?, 0x0, 0x0?, 0x1) /usr/local/go/src/runtime/select.go:327 +0x725 fp=0xc00005a758 sp=0xc00005a638 pc=0x46d9c5 runtime.ensureSigM.func1() /usr/local/go/src/runtime/signal_unix.go:1014 +0x19f fp=0xc00005a7e0 sp=0xc00005a758 pc=0x4849ff runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005a7e8 sp=0xc00005a7e0 pc=0x48d961 created by runtime.ensureSigM in goroutine 1 /usr/local/go/src/runtime/signal_unix.go:997 +0xc8 goroutine 18 [syscall]: runtime.notetsleepg(0x0?, 0x0?) /usr/local/go/src/runtime/lock_futex.go:236 +0x29 fp=0xc0000547a0 sp=0xc000054768 pc=0x4309e9 os/signal.signal_recv() /usr/local/go/src/runtime/sigqueue.go:152 +0x29 fp=0xc0000547c0 sp=0xc0000547a0 pc=0x48a329 os/signal.loop() /usr/local/go/src/os/signal/signal_unix.go:23 +0x13 fp=0xc0000547e0 sp=0xc0000547c0 pc=0x713bb3 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000547e8 sp=0xc0000547e0 pc=0x48d961 created by os/signal.Notify.func1.1 in goroutine 1 /usr/local/go/src/os/signal/signal.go:151 +0x1f goroutine 7 [chan receive]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005af18 sp=0xc00005aef8 pc=0x45de8e runtime.chanrecv(0xc00018da40, 0x0, 0x1) /usr/local/go/src/runtime/chan.go:583 +0x3cd fp=0xc00005af90 sp=0xc00005af18 pc=0x42b58d runtime.chanrecv1(0x0?, 0x0?) /usr/local/go/src/runtime/chan.go:442 +0x12 fp=0xc00005afb8 sp=0xc00005af90 pc=0x42b192 github.com/jmorganca/ollama/server.Serve.func1() /go/src/github.com/jmorganca/ollama/server/routes.go:938 +0x25 fp=0xc00005afe0 sp=0xc00005afb8 pc=0x9a7ea5 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005afe8 sp=0xc00005afe0 pc=0x48d961 created by github.com/jmorganca/ollama/server.Serve in goroutine 1 /go/src/github.com/jmorganca/ollama/server/routes.go:937 +0x285 goroutine 34 [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058c750 sp=0xc00058c730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058c7e0 sp=0xc00058c750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058c7e8 sp=0xc00058c7e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 35 [GC worker (idle)]: runtime.gopark(0x2d8659670228?, 0x3?, 0xb4?, 0x48?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058cf50 sp=0xc00058cf30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058cfe0 sp=0xc00058cf50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058cfe8 sp=0xc00058cfe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 19 [GC worker (idle)]: runtime.gopark(0x2d8657fbbc81?, 0x3?, 0x24?, 0xe7?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000054f50 sp=0xc000054f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000054fe0 sp=0xc000054f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000054fe8 sp=0xc000054fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 20 [GC worker (idle)]: runtime.gopark(0x2d865966fa58?, 0x1?, 0xa0?, 0xd3?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055750 sp=0xc000055730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0000557e0 sp=0xc000055750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000557e8 sp=0xc0000557e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 50 [GC worker (idle)]: runtime.gopark(0x2d8657fbb961?, 0x3?, 0x58?, 0x66?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588750 sp=0xc000588730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0005887e0 sp=0xc000588750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0005887e8 sp=0xc0005887e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 8 [GC worker (idle)]: runtime.gopark(0x17d9f7a0?, 0x1?, 0xe4?, 0xfa?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005b750 sp=0xc00005b730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00005b7e0 sp=0xc00005b750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005b7e8 sp=0xc00005b7e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 21 [GC worker (idle)]: runtime.gopark(0x2d865963289c?, 0x3?, 0xc0?, 0x89?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055f50 sp=0xc000055f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000055fe0 sp=0xc000055f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000055fe8 sp=0xc000055fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 51 [GC worker (idle)]: runtime.gopark(0x2d865966f800?, 0x3?, 0xd8?, 0x8c?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588f50 sp=0xc000588f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000588fe0 sp=0xc000588f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000588fe8 sp=0xc000588fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 52 [IO wait]: runtime.gopark(0x75?, 0xb?, 0x0?, 0x0?, 0x8?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bd8f8 sp=0xc0004bd8d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bd930 sp=0xc0004bd8f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbd88, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bd950 sp=0xc0004bd930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462080?, 0xc0004b2000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bd978 sp=0xc0004bd950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000462080, {0xc0004b2000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bda10 sp=0xc0004bd978 pc=0x51061a net.(*netFD).Read(0xc000462080, {0xc0004b2000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bda58 sp=0xc0004bda10 pc=0x5893c5 net.(*conn).Read(0xc000592010, {0xc0004b2000?, 0x0?, 0xc00048a0f8?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bdaa0 sp=0xc0004bda58 pc=0x597665 net.(*TCPConn).Read(0xc00048a0f0?, {0xc0004b2000?, 0x0?, 0xc0004bdac0?}) <autogenerated>:1 +0x25 fp=0xc0004bdad0 sp=0xc0004bdaa0 pc=0x5a9565 net/http.(*connReader).Read(0xc00048a0f0, {0xc0004b2000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bdb20 sp=0xc0004bdad0 pc=0x6e458b bufio.(*Reader).fill(0xc000516060) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bdb58 sp=0xc0004bdb20 pc=0x6741c3 bufio.(*Reader).Peek(0xc000516060, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bdb78 sp=0xc0004bdb58 pc=0x6742f3 net/http.(*conn).serve(0xc0000fc1b0, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bdfb8 sp=0xc0004bdb78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bdfe0 sp=0xc0004bdfb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bdfe8 sp=0xc0004bdfe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 53 [IO wait]: runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0x9?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bf8f8 sp=0xc0004bf8d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bf930 sp=0xc0004bf8f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbc90, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bf950 sp=0xc0004bf930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462180?, 0xc00053c000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bf978 sp=0xc0004bf950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000462180, {0xc00053c000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bfa10 sp=0xc0004bf978 pc=0x51061a net.(*netFD).Read(0xc000462180, {0xc00053c000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bfa58 sp=0xc0004bfa10 pc=0x5893c5 net.(*conn).Read(0xc000592018, {0xc00053c000?, 0x0?, 0xc000580218?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bfaa0 sp=0xc0004bfa58 pc=0x597665 net.(*TCPConn).Read(0xc000580210?, {0xc00053c000?, 0x0?, 0xc000551ac0?}) <autogenerated>:1 +0x25 fp=0xc0004bfad0 sp=0xc0004bfaa0 pc=0x5a9565 net/http.(*connReader).Read(0xc000580210, {0xc00053c000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bfb20 sp=0xc0004bfad0 pc=0x6e458b bufio.(*Reader).fill(0xc00010e060) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bfb58 sp=0xc0004bfb20 pc=0x6741c3 bufio.(*Reader).Peek(0xc00010e060, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bfb78 sp=0xc0004bfb58 pc=0x6742f3 net/http.(*conn).serve(0xc0000fc240, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bffb8 sp=0xc0004bfb78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bffe0 sp=0xc0004bffb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bffe8 sp=0xc0004bffe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 22 [IO wait]: runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0xa?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004b98f8 sp=0xc0004b98d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004b9930 sp=0xc0004b98f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbb98, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004b9950 sp=0xc0004b9930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000186000?, 0xc0000c8000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004b9978 sp=0xc0004b9950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000186000, {0xc0000c8000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004b9a10 sp=0xc0004b9978 pc=0x51061a net.(*netFD).Read(0xc000186000, {0xc0000c8000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004b9a58 sp=0xc0004b9a10 pc=0x5893c5 net.(*conn).Read(0xc000080000, {0xc0000c8000?, 0x0?, 0xc000098518?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004b9aa0 sp=0xc0004b9a58 pc=0x597665 net.(*TCPConn).Read(0xc000098510?, {0xc0000c8000?, 0x0?, 0xc00054dac0?}) <autogenerated>:1 +0x25 fp=0xc0004b9ad0 sp=0xc0004b9aa0 pc=0x5a9565 net/http.(*connReader).Read(0xc000098510, {0xc0000c8000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004b9b20 sp=0xc0004b9ad0 pc=0x6e458b bufio.(*Reader).fill(0xc00018c7e0) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004b9b58 sp=0xc0004b9b20 pc=0x6741c3 bufio.(*Reader).Peek(0xc00018c7e0, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004b9b78 sp=0xc0004b9b58 pc=0x6742f3 net/http.(*conn).serve(0xc0000c6000, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004b9fb8 sp=0xc0004b9b78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004b9fe0 sp=0xc0004b9fb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004b9fe8 sp=0xc0004b9fe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 11 [IO wait]: runtime.gopark(0x0?, 0xb?, 0x0?, 0x0?, 0xb?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058eda0 sp=0xc00058ed80 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc00058edd8 sp=0xc00058eda0 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbaa0, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc00058edf8 sp=0xc00058edd8 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc00041a000?, 0xc00048a6a1?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00058ee20 sp=0xc00058edf8 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc00041a000, {0xc00048a6a1, 0x1, 0x1}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc00058eeb8 sp=0xc00058ee20 pc=0x51061a net.(*netFD).Read(0xc00041a000, {0xc00048a6a1?, 0xc00058ef40?, 0x48a030?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc00058ef00 sp=0xc00058eeb8 pc=0x5893c5 net.(*conn).Read(0xc00005c040, {0xc00048a6a1?, 0x1?, 0xc0004240a0?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc00058ef48 sp=0xc00058ef00 pc=0x597665 net.(*TCPConn).Read(0xc00048a0f0?, {0xc00048a6a1?, 0xc0004240a0?, 0x0?}) <autogenerated>:1 +0x25 fp=0xc00058ef78 sp=0xc00058ef48 pc=0x5a9565 net/http.(*connReader).backgroundRead(0xc00048a690) /usr/local/go/src/net/http/server.go:683 +0x37 fp=0xc00058efc8 sp=0xc00058ef78 pc=0x6e4157 net/http.(*connReader).startBackgroundRead.func2() /usr/local/go/src/net/http/server.go:679 +0x25 fp=0xc00058efe0 sp=0xc00058efc8 pc=0x6e4085 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058efe8 sp=0xc00058efe0 pc=0x48d961 created by net/http.(*connReader).startBackgroundRead in goroutine 10 /usr/local/go/src/net/http/server.go:679 +0xba rax 0x0 rbx 0x7fa18f649640 rcx 0x7fa1d727b9fc rdx 0x6 rdi 0x211 rsi 0x211 rbp 0x211 rsp 0x7fa18f6481f0 r8 0x7fa18f6482c0 r9 0x1 r10 0x8 r11 0x246 r12 0x6 r13 0x16 r14 0x60c3f8000 r15 0x0 rip 0x7fa1d727b9fc rflags 0x246 cs 0x33 fs 0x0 gs 0x0 SIGABRT: abort PC=0x7fa1d727b9fc m=4 sigcode=18446744073709551610 signal arrived during cgo execution goroutine 10 [syscall]: runtime.cgocall(0x9c1470, 0xc0004ca608) /usr/local/go/src/runtime/cgocall.go:157 +0x4b fp=0xc0004ca5e0 sp=0xc0004ca5a8 pc=0x4291ab github.com/jmorganca/ollama/llm._Cfunc_dynamic_shim_llama_server_init({0x7fa1840014a0, 0x7fa13ce7b2e0, 0x7fa13ce6da80, 0x7fa13ce71270, 0x7fa13ce83770, 0x7fa13ce78900, 0x7fa13ce71430, 0x7fa13ce6db00, 0x7fa13ce7ea00, 0x7fa13ce7e5b0, ...}, ...) _cgo_gotypes.go:287 +0x45 fp=0xc0004ca608 sp=0xc0004ca5e0 pc=0x7cdd85 github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init.func1(0x45971b?, 0x80?, 0x80?) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0xec fp=0xc0004ca6f8 sp=0xc0004ca608 pc=0x7d326c github.com/jmorganca/ollama/llm.(*shimExtServer).llama_server_init(0xc0000982d0?, 0x0?, 0x200?) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:40 +0x13 fp=0xc0004ca720 sp=0xc0004ca6f8 pc=0x7d3153 github.com/jmorganca/ollama/llm.newExtServer({0x17842518, 0xc0000f8360}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/ext_server_common.go:146 +0x7f8 fp=0xc0004ca9a8 sp=0xc0004ca720 pc=0x7cf3b8 github.com/jmorganca/ollama/llm.newDynamicShimExtServer({0xc00047cf00, 0x2b}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/shim_ext_server.go:93 +0x54f fp=0xc0004cabd0 sp=0xc0004ca9a8 pc=0x7d45af github.com/jmorganca/ollama/llm.newLlmServer({0xc3d801, 0x4}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/llm.go:86 +0x16b fp=0xc0004cad60 sp=0xc0004cabd0 pc=0x7ccecb github.com/jmorganca/ollama/llm.New({0xc00048e180?, 0x0?}, {0xc0004b6230, _}, {_, _, _}, {0x0, 0x0, 0x0}, ...) /go/src/github.com/jmorganca/ollama/llm/llm.go:76 +0x233 fp=0xc0004caef0 sp=0xc0004cad60 pc=0x7ccb33 github.com/jmorganca/ollama/server.load(0xc0004d2000?, 0xc0004d2000, {{0x0, 0x800, 0x200, 0x1, 0xffffffffffffffff, 0x0, 0x0, 0x1, ...}, ...}, ...) /go/src/github.com/jmorganca/ollama/server/routes.go:84 +0x425 fp=0xc0004cb0a0 sp=0xc0004caef0 pc=0x99d825 github.com/jmorganca/ollama/server.GenerateHandler(0xc000522200) /go/src/github.com/jmorganca/ollama/server/routes.go:191 +0x8c8 fp=0xc0004cb748 sp=0xc0004cb0a0 pc=0x99e5c8 github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func1(0xc000522200) /go/src/github.com/jmorganca/ollama/server/routes.go:876 +0x68 fp=0xc0004cb780 sp=0xc0004cb748 pc=0x9a79c8 github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7a fp=0xc0004cb7d0 sp=0xc0004cb780 pc=0x9813ba github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.LoggerWithConfig.func1(0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xde fp=0xc0004cb980 sp=0xc0004cb7d0 pc=0x98055e github.com/gin-gonic/gin.(*Context).Next(...) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0xc0000d1ba0, 0xc000522200) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x65b fp=0xc0004cbb08 sp=0xc0004cb980 pc=0x97f61b github.com/gin-gonic/gin.(*Engine).ServeHTTP(0xc0000d1ba0, {0x1783c860?, 0xc00041e0e0}, 0xc000522300) /root/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1dd fp=0xc0004cbb48 sp=0xc0004cbb08 pc=0x97eddd net/http.serverHandler.ServeHTTP({0x1783ab80?}, {0x1783c860?, 0xc00041e0e0?}, 0x6?) /usr/local/go/src/net/http/server.go:2938 +0x8e fp=0xc0004cbb78 sp=0xc0004cbb48 pc=0x6ee3ee net/http.(*conn).serve(0xc0004b4120, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2009 +0x5f4 fp=0xc0004cbfb8 sp=0xc0004cbb78 pc=0x6ea2d4 net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004cbfe0 sp=0xc0004cbfb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004cbfe8 sp=0xc0004cbfe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 1 [IO wait]: runtime.gopark(0x4a05b0?, 0xc000533828?, 0x78?, 0x38?, 0x5166dd?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000615808 sp=0xc0006157e8 pc=0x45de8e runtime.netpollblock(0x48b9d2?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc000615840 sp=0xc000615808 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbe80, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc000615860 sp=0xc000615840 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462000?, 0x4?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc000615888 sp=0xc000615860 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0xc000462000) /usr/local/go/src/internal/poll/fd_unix.go:611 +0x2ac fp=0xc000615930 sp=0xc000615888 pc=0x51480c net.(*netFD).accept(0xc000462000) /usr/local/go/src/net/fd_unix.go:172 +0x29 fp=0xc0006159e8 sp=0xc000615930 pc=0x58b3e9 net.(*TCPListener).accept(0xc00043b5a0) /usr/local/go/src/net/tcpsock_posix.go:152 +0x1e fp=0xc000615a10 sp=0xc0006159e8 pc=0x5a01fe net.(*TCPListener).Accept(0xc00043b5a0) /usr/local/go/src/net/tcpsock.go:315 +0x30 fp=0xc000615a40 sp=0xc000615a10 pc=0x59f3b0 net/http.(*onceCloseListener).Accept(0xc0004b4120?) <autogenerated>:1 +0x24 fp=0xc000615a58 sp=0xc000615a40 pc=0x711184 net/http.(*Server).Serve(0xc000366ff0, {0x1783c650, 0xc00043b5a0}) /usr/local/go/src/net/http/server.go:3056 +0x364 fp=0xc000615b88 sp=0xc000615a58 pc=0x6ee844 github.com/jmorganca/ollama/server.Serve({0x1783c650, 0xc00043b5a0}) /go/src/github.com/jmorganca/ollama/server/routes.go:956 +0x389 fp=0xc000615c98 sp=0xc000615b88 pc=0x9a7da9 github.com/jmorganca/ollama/cmd.RunServer(0xc000460300?, {0x17d9db40?, 0x4?, 0xc3d4f5?}) /go/src/github.com/jmorganca/ollama/cmd/cmd.go:634 +0x199 fp=0xc000615d30 sp=0xc000615c98 pc=0x9b9f99 github.com/spf13/cobra.(*Command).execute(0xc000419800, {0x17d9db40, 0x0, 0x0}) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x87c fp=0xc000615e68 sp=0xc000615d30 pc=0x783fbc github.com/spf13/cobra.(*Command).ExecuteC(0xc000418c00) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x3a5 fp=0xc000615f20 sp=0xc000615e68 pc=0x7847e5 github.com/spf13/cobra.(*Command).Execute(...) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(...) /root/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 main.main() /go/src/github.com/jmorganca/ollama/main.go:11 +0x4d fp=0xc000615f40 sp=0xc000615f20 pc=0x9c04cd runtime.main() /usr/local/go/src/runtime/proc.go:267 +0x2bb fp=0xc000615fe0 sp=0xc000615f40 pc=0x45da3b runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000615fe8 sp=0xc000615fe0 pc=0x48d961 goroutine 2 [force gc (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058fa8 sp=0xc000058f88 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.forcegchelper() /usr/local/go/src/runtime/proc.go:322 +0xb3 fp=0xc000058fe0 sp=0xc000058fa8 pc=0x45dd13 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000058fe8 sp=0xc000058fe0 pc=0x48d961 created by runtime.init.6 in goroutine 1 /usr/local/go/src/runtime/proc.go:310 +0x1a goroutine 3 [GC sweep wait]: runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059778 sp=0xc000059758 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.bgsweep(0x0?) /usr/local/go/src/runtime/mgcsweep.go:321 +0xdf fp=0xc0000597c8 sp=0xc000059778 pc=0x449ddf runtime.gcenable.func1() /usr/local/go/src/runtime/mgc.go:200 +0x25 fp=0xc0000597e0 sp=0xc0000597c8 pc=0x43ef05 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000597e8 sp=0xc0000597e0 pc=0x48d961 created by runtime.gcenable in goroutine 1 /usr/local/go/src/runtime/mgc.go:200 +0x66 goroutine 4 [GC scavenge wait]: runtime.gopark(0x19a55b4?, 0x188b346?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000059f70 sp=0xc000059f50 pc=0x45de8e runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:404 runtime.(*scavengerState).park(0x17ca7640) /usr/local/go/src/runtime/mgcscavenge.go:425 +0x49 fp=0xc000059fa0 sp=0xc000059f70 pc=0x447609 runtime.bgscavenge(0x0?) /usr/local/go/src/runtime/mgcscavenge.go:658 +0x59 fp=0xc000059fc8 sp=0xc000059fa0 pc=0x447bb9 runtime.gcenable.func2() /usr/local/go/src/runtime/mgc.go:201 +0x25 fp=0xc000059fe0 sp=0xc000059fc8 pc=0x43eea5 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000059fe8 sp=0xc000059fe0 pc=0x48d961 created by runtime.gcenable in goroutine 1 /usr/local/go/src/runtime/mgc.go:201 +0xa5 goroutine 5 [finalizer wait]: runtime.gopark(0xc364c0?, 0x10045f001?, 0x0?, 0x0?, 0x466045?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000058628 sp=0xc000058608 pc=0x45de8e runtime.runfinq() /usr/local/go/src/runtime/mfinal.go:193 +0x107 fp=0xc0000587e0 sp=0xc000058628 pc=0x43df87 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000587e8 sp=0xc0000587e0 pc=0x48d961 created by runtime.createfing in goroutine 1 /usr/local/go/src/runtime/mfinal.go:163 +0x3d goroutine 6 [select, locked to thread]: runtime.gopark(0xc00005a7a8?, 0x2?, 0x29?, 0xe1?, 0xc00005a7a4?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005a638 sp=0xc00005a618 pc=0x45de8e runtime.selectgo(0xc00005a7a8, 0xc00005a7a0, 0x0?, 0x0, 0x0?, 0x1) /usr/local/go/src/runtime/select.go:327 +0x725 fp=0xc00005a758 sp=0xc00005a638 pc=0x46d9c5 runtime.ensureSigM.func1() /usr/local/go/src/runtime/signal_unix.go:1014 +0x19f fp=0xc00005a7e0 sp=0xc00005a758 pc=0x4849ff runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005a7e8 sp=0xc00005a7e0 pc=0x48d961 created by runtime.ensureSigM in goroutine 1 /usr/local/go/src/runtime/signal_unix.go:997 +0xc8 goroutine 18 [syscall]: runtime.notetsleepg(0x0?, 0x0?) /usr/local/go/src/runtime/lock_futex.go:236 +0x29 fp=0xc0000547a0 sp=0xc000054768 pc=0x4309e9 os/signal.signal_recv() /usr/local/go/src/runtime/sigqueue.go:152 +0x29 fp=0xc0000547c0 sp=0xc0000547a0 pc=0x48a329 os/signal.loop() /usr/local/go/src/os/signal/signal_unix.go:23 +0x13 fp=0xc0000547e0 sp=0xc0000547c0 pc=0x713bb3 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000547e8 sp=0xc0000547e0 pc=0x48d961 created by os/signal.Notify.func1.1 in goroutine 1 /usr/local/go/src/os/signal/signal.go:151 +0x1f goroutine 7 [chan receive]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005af18 sp=0xc00005aef8 pc=0x45de8e runtime.chanrecv(0xc00018da40, 0x0, 0x1) /usr/local/go/src/runtime/chan.go:583 +0x3cd fp=0xc00005af90 sp=0xc00005af18 pc=0x42b58d runtime.chanrecv1(0x0?, 0x0?) /usr/local/go/src/runtime/chan.go:442 +0x12 fp=0xc00005afb8 sp=0xc00005af90 pc=0x42b192 github.com/jmorganca/ollama/server.Serve.func1() /go/src/github.com/jmorganca/ollama/server/routes.go:938 +0x25 fp=0xc00005afe0 sp=0xc00005afb8 pc=0x9a7ea5 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005afe8 sp=0xc00005afe0 pc=0x48d961 created by github.com/jmorganca/ollama/server.Serve in goroutine 1 /go/src/github.com/jmorganca/ollama/server/routes.go:937 +0x285 goroutine 34 [GC worker (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058c750 sp=0xc00058c730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058c7e0 sp=0xc00058c750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058c7e8 sp=0xc00058c7e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 35 [GC worker (idle)]: runtime.gopark(0x2d8659670228?, 0x3?, 0xb4?, 0x48?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058cf50 sp=0xc00058cf30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00058cfe0 sp=0xc00058cf50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058cfe8 sp=0xc00058cfe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 19 [GC worker (idle)]: runtime.gopark(0x2d8657fbbc81?, 0x3?, 0x24?, 0xe7?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000054f50 sp=0xc000054f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000054fe0 sp=0xc000054f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000054fe8 sp=0xc000054fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 20 [GC worker (idle)]: runtime.gopark(0x2d865966fa58?, 0x1?, 0xa0?, 0xd3?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055750 sp=0xc000055730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0000557e0 sp=0xc000055750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0000557e8 sp=0xc0000557e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 50 [GC worker (idle)]: runtime.gopark(0x2d8657fbb961?, 0x3?, 0x58?, 0x66?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588750 sp=0xc000588730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc0005887e0 sp=0xc000588750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0005887e8 sp=0xc0005887e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 8 [GC worker (idle)]: runtime.gopark(0x17d9f7a0?, 0x1?, 0xe4?, 0xfa?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00005b750 sp=0xc00005b730 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc00005b7e0 sp=0xc00005b750 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00005b7e8 sp=0xc00005b7e0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 21 [GC worker (idle)]: runtime.gopark(0x2d865963289c?, 0x3?, 0xc0?, 0x89?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000055f50 sp=0xc000055f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000055fe0 sp=0xc000055f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000055fe8 sp=0xc000055fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 51 [GC worker (idle)]: runtime.gopark(0x2d865966f800?, 0x3?, 0xd8?, 0x8c?, 0x0?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc000588f50 sp=0xc000588f30 pc=0x45de8e runtime.gcBgMarkWorker() /usr/local/go/src/runtime/mgc.go:1293 +0xe5 fp=0xc000588fe0 sp=0xc000588f50 pc=0x440a85 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc000588fe8 sp=0xc000588fe0 pc=0x48d961 created by runtime.gcBgMarkStartWorkers in goroutine 1 /usr/local/go/src/runtime/mgc.go:1217 +0x1c goroutine 52 [IO wait]: runtime.gopark(0x75?, 0xb?, 0x0?, 0x0?, 0x8?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bd8f8 sp=0xc0004bd8d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bd930 sp=0xc0004bd8f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbd88, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bd950 sp=0xc0004bd930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462080?, 0xc0004b2000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bd978 sp=0xc0004bd950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000462080, {0xc0004b2000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bda10 sp=0xc0004bd978 pc=0x51061a net.(*netFD).Read(0xc000462080, {0xc0004b2000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bda58 sp=0xc0004bda10 pc=0x5893c5 net.(*conn).Read(0xc000592010, {0xc0004b2000?, 0x0?, 0xc00048a0f8?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bdaa0 sp=0xc0004bda58 pc=0x597665 net.(*TCPConn).Read(0xc00048a0f0?, {0xc0004b2000?, 0x0?, 0xc0004bdac0?}) <autogenerated>:1 +0x25 fp=0xc0004bdad0 sp=0xc0004bdaa0 pc=0x5a9565 net/http.(*connReader).Read(0xc00048a0f0, {0xc0004b2000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bdb20 sp=0xc0004bdad0 pc=0x6e458b bufio.(*Reader).fill(0xc000516060) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bdb58 sp=0xc0004bdb20 pc=0x6741c3 bufio.(*Reader).Peek(0xc000516060, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bdb78 sp=0xc0004bdb58 pc=0x6742f3 net/http.(*conn).serve(0xc0000fc1b0, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bdfb8 sp=0xc0004bdb78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bdfe0 sp=0xc0004bdfb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bdfe8 sp=0xc0004bdfe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 53 [IO wait]: runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0x9?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004bf8f8 sp=0xc0004bf8d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004bf930 sp=0xc0004bf8f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbc90, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004bf950 sp=0xc0004bf930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000462180?, 0xc00053c000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004bf978 sp=0xc0004bf950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000462180, {0xc00053c000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004bfa10 sp=0xc0004bf978 pc=0x51061a net.(*netFD).Read(0xc000462180, {0xc00053c000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004bfa58 sp=0xc0004bfa10 pc=0x5893c5 net.(*conn).Read(0xc000592018, {0xc00053c000?, 0x0?, 0xc000580218?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004bfaa0 sp=0xc0004bfa58 pc=0x597665 net.(*TCPConn).Read(0xc000580210?, {0xc00053c000?, 0x0?, 0xc000551ac0?}) <autogenerated>:1 +0x25 fp=0xc0004bfad0 sp=0xc0004bfaa0 pc=0x5a9565 net/http.(*connReader).Read(0xc000580210, {0xc00053c000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004bfb20 sp=0xc0004bfad0 pc=0x6e458b bufio.(*Reader).fill(0xc00010e060) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004bfb58 sp=0xc0004bfb20 pc=0x6741c3 bufio.(*Reader).Peek(0xc00010e060, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004bfb78 sp=0xc0004bfb58 pc=0x6742f3 net/http.(*conn).serve(0xc0000fc240, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004bffb8 sp=0xc0004bfb78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004bffe0 sp=0xc0004bffb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004bffe8 sp=0xc0004bffe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 22 [IO wait]: runtime.gopark(0x4f8?, 0xb?, 0x0?, 0x0?, 0xa?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc0004b98f8 sp=0xc0004b98d8 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc0004b9930 sp=0xc0004b98f8 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbb98, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc0004b9950 sp=0xc0004b9930 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc000186000?, 0xc0000c8000?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc0004b9978 sp=0xc0004b9950 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc000186000, {0xc0000c8000, 0x1000, 0x1000}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc0004b9a10 sp=0xc0004b9978 pc=0x51061a net.(*netFD).Read(0xc000186000, {0xc0000c8000?, 0x50f7e5?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc0004b9a58 sp=0xc0004b9a10 pc=0x5893c5 net.(*conn).Read(0xc000080000, {0xc0000c8000?, 0x0?, 0xc000098518?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc0004b9aa0 sp=0xc0004b9a58 pc=0x597665 net.(*TCPConn).Read(0xc000098510?, {0xc0000c8000?, 0x0?, 0xc00054dac0?}) <autogenerated>:1 +0x25 fp=0xc0004b9ad0 sp=0xc0004b9aa0 pc=0x5a9565 net/http.(*connReader).Read(0xc000098510, {0xc0000c8000, 0x1000, 0x1000}) /usr/local/go/src/net/http/server.go:791 +0x14b fp=0xc0004b9b20 sp=0xc0004b9ad0 pc=0x6e458b bufio.(*Reader).fill(0xc00018c7e0) /usr/local/go/src/bufio/bufio.go:113 +0x103 fp=0xc0004b9b58 sp=0xc0004b9b20 pc=0x6741c3 bufio.(*Reader).Peek(0xc00018c7e0, 0x4) /usr/local/go/src/bufio/bufio.go:151 +0x53 fp=0xc0004b9b78 sp=0xc0004b9b58 pc=0x6742f3 net/http.(*conn).serve(0xc0000c6000, {0x1783ded8, 0xc0005800f0}) /usr/local/go/src/net/http/server.go:2044 +0x75c fp=0xc0004b9fb8 sp=0xc0004b9b78 pc=0x6ea43c net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3086 +0x28 fp=0xc0004b9fe0 sp=0xc0004b9fb8 pc=0x6eec08 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc0004b9fe8 sp=0xc0004b9fe0 pc=0x48d961 created by net/http.(*Server).Serve in goroutine 1 /usr/local/go/src/net/http/server.go:3086 +0x5cb goroutine 11 [IO wait]: runtime.gopark(0x0?, 0xb?, 0x0?, 0x0?, 0xb?) /usr/local/go/src/runtime/proc.go:398 +0xce fp=0xc00058eda0 sp=0xc00058ed80 pc=0x45de8e runtime.netpollblock(0x49e718?, 0x428946?, 0x0?) /usr/local/go/src/runtime/netpoll.go:564 +0xf7 fp=0xc00058edd8 sp=0xc00058eda0 pc=0x456917 internal/poll.runtime_pollWait(0x7fa18ddcbaa0, 0x72) /usr/local/go/src/runtime/netpoll.go:343 +0x85 fp=0xc00058edf8 sp=0xc00058edd8 pc=0x4880a5 internal/poll.(*pollDesc).wait(0xc00041a000?, 0xc00048a6a1?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x27 fp=0xc00058ee20 sp=0xc00058edf8 pc=0x50f327 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0xc00041a000, {0xc00048a6a1, 0x1, 0x1}) /usr/local/go/src/internal/poll/fd_unix.go:164 +0x27a fp=0xc00058eeb8 sp=0xc00058ee20 pc=0x51061a net.(*netFD).Read(0xc00041a000, {0xc00048a6a1?, 0xc00058ef40?, 0x48a030?}) /usr/local/go/src/net/fd_posix.go:55 +0x25 fp=0xc00058ef00 sp=0xc00058eeb8 pc=0x5893c5 net.(*conn).Read(0xc00005c040, {0xc00048a6a1?, 0x1?, 0xc0004240a0?}) /usr/local/go/src/net/net.go:179 +0x45 fp=0xc00058ef48 sp=0xc00058ef00 pc=0x597665 net.(*TCPConn).Read(0xc00048a0f0?, {0xc00048a6a1?, 0xc0004240a0?, 0x0?}) <autogenerated>:1 +0x25 fp=0xc00058ef78 sp=0xc00058ef48 pc=0x5a9565 net/http.(*connReader).backgroundRead(0xc00048a690) /usr/local/go/src/net/http/server.go:683 +0x37 fp=0xc00058efc8 sp=0xc00058ef78 pc=0x6e4157 net/http.(*connReader).startBackgroundRead.func2() /usr/local/go/src/net/http/server.go:679 +0x25 fp=0xc00058efe0 sp=0xc00058efc8 pc=0x6e4085 runtime.goexit() /usr/local/go/src/runtime/asm_amd64.s:1650 +0x1 fp=0xc00058efe8 sp=0xc00058efe0 pc=0x48d961 created by net/http.(*connReader).startBackgroundRead in goroutine 10 /usr/local/go/src/net/http/server.go:679 +0xba rax 0x0 rbx 0x7fa18f649640 rcx 0x7fa1d727b9fc rdx 0x6 rdi 0x1f7 rsi 0x1fa rbp 0x1fa rsp 0x7fa18f6481f0 r8 0x7fa18f6482c0 r9 0x7fa18f648260 r10 0x8 r11 0x246 r12 0x6 r13 0x16 r14 0x60c3f8000 r15 0x0 rip 0x7fa1d727b9fc rflags 0x246 cs 0x33 fs 0x0 gs 0x0 Error: Post "http://127.0.0.1:11434/api/generate": EOF ``` Help would be appreciated
GiteaMirror added the nvidia label 2026-04-28 04:21:03 -05:00
Author
Owner

@pdevine commented on GitHub (Jan 9, 2024):

That's a very small GPU w/ 2 GB of memory. I would imagine only the very smallest models would work on it. What model were you trying to load?

That said, we should have fallen back to the CPU and only offloaded a couple of layers. cc @jmorganca @dhiltgen

<!-- gh-comment-id:1883598590 --> @pdevine commented on GitHub (Jan 9, 2024): That's a very small GPU w/ 2 GB of memory. I would imagine only the very smallest models would work on it. What model were you trying to load? That said, we should have fallen back to the CPU and only offloaded a couple of layers. cc @jmorganca @dhiltgen
Author
Owner

@Subie1 commented on GitHub (Jan 9, 2024):

@pdevine was trying to load tinyllama not only that I always expected it to fall back to CPU (used to do that in a previous version) when updating it stopped automatically doing that and ends up just crashing like so

<!-- gh-comment-id:1883603510 --> @Subie1 commented on GitHub (Jan 9, 2024): @pdevine was trying to load `tinyllama` not only that I always expected it to fall back to CPU (used to do that in a previous version) when updating it stopped automatically doing that and ends up just crashing like so
Author
Owner

@jmorganca commented on GitHub (Jan 9, 2024):

To chime in: @Subie1 a new update is coming later today that will do that, sorry you hit the error!

https://github.com/jmorganca/ollama/releases/tag/v0.1.19

<!-- gh-comment-id:1883608728 --> @jmorganca commented on GitHub (Jan 9, 2024): To chime in: @Subie1 a new update is coming later today that will do that, sorry you hit the error! https://github.com/jmorganca/ollama/releases/tag/v0.1.19
Author
Owner

@Subie1 commented on GitHub (Jan 9, 2024):

@jmorganca Got it, going to be waiting for the release & thanks for the reassurance.

<!-- gh-comment-id:1883616508 --> @Subie1 commented on GitHub (Jan 9, 2024): @jmorganca Got it, going to be waiting for the release & thanks for the reassurance.
Author
Owner

@dhiltgen commented on GitHub (Jan 9, 2024):

The crash should be resolved in the latest build for 0.1.19 (by falling back to CPU). Lets transition this issue to track CUDA support for 5.0 and 5.2 compute capability cards.

<!-- gh-comment-id:1883838683 --> @dhiltgen commented on GitHub (Jan 9, 2024): The crash should be resolved in the latest build for 0.1.19 (by falling back to CPU). Lets transition this issue to track CUDA support for 5.0 and 5.2 compute capability cards.
Author
Owner

@valentimarco commented on GitHub (Jan 10, 2024):

how we can help?
i have a gtx 960 (4g version) with a compute capability of 5.2 in my little server

<!-- gh-comment-id:1884344807 --> @valentimarco commented on GitHub (Jan 10, 2024): how we can help? i have a gtx 960 (4g version) with a compute capability of 5.2 in my little server
Author
Owner

@valentimarco commented on GitHub (Jan 10, 2024):

i tested the main repo of ollama with only 2 modificationn: change the CudaComputevalue to 5 and apply the change from #1880.
i tested phi and mistral and works just fine

<!-- gh-comment-id:1885884390 --> @valentimarco commented on GitHub (Jan 10, 2024): i tested the main repo of ollama with only 2 modificationn: change the CudaComputevalue to 5 and apply the change from #1880. i tested phi and mistral and works just fine
Author
Owner

@dhiltgen commented on GitHub (Jan 11, 2024):

Thanks!

Once we merge #1819 that will give us the foundation to be able to have multiple cuda libraries compiled to target different generations of compute capability. I'll take a look at this soon.

<!-- gh-comment-id:1887688310 --> @dhiltgen commented on GitHub (Jan 11, 2024): Thanks! Once we merge #1819 that will give us the foundation to be able to have multiple cuda libraries compiled to target different generations of compute capability. I'll take a look at this soon.
Author
Owner

@bonswouar commented on GitHub (Jan 19, 2024):

I personally was able to run some layers on my gpu (GTX 970 with 4GB of vram) by compiling with -DLLAMA_CUDA_FORCE_MMQ=on, but with the latest version it falls back to the cpu with INFO CUDA GPU is too old. Falling back to CPU mode. Compute Capability detected: 5.2
Any way to force it?

<!-- gh-comment-id:1900295807 --> @bonswouar commented on GitHub (Jan 19, 2024): I personally was able to run some layers on my gpu (GTX 970 with 4GB of vram) by compiling with `-DLLAMA_CUDA_FORCE_MMQ=on`, but with the latest version it falls back to the cpu with `INFO CUDA GPU is too old. Falling back to CPU mode. Compute Capability detected: 5.2` Any way to force it?
Author
Owner

@dhiltgen commented on GitHub (Jan 19, 2024):

Any way to force it?

It's not well oiled yet, but probably the fastest approach since you're building from source is to drop https://github.com/jmorganca/ollama/blob/main/gpu/gpu.go#L33 down to "5"

<!-- gh-comment-id:1900812283 --> @dhiltgen commented on GitHub (Jan 19, 2024): > Any way to force it? It's not well oiled yet, but probably the fastest approach since you're building from source is to drop https://github.com/jmorganca/ollama/blob/main/gpu/gpu.go#L33 down to "5"
Author
Owner

@bones0 commented on GitHub (Dec 10, 2024):

ollama 0.5.1 is detecting my NVIDIA Tesla M40, but they are not used. Maybe the included llama.cpp does not support it anymore?

<!-- gh-comment-id:2530842209 --> @bones0 commented on GitHub (Dec 10, 2024): ollama 0.5.1 is detecting my NVIDIA Tesla M40, but they are not used. Maybe the included llama.cpp does not support it anymore?
Author
Owner

@dhiltgen commented on GitHub (Dec 10, 2024):

@bones0 that GPU should work. Please open a new issue and include your server logs so we can investigate.

<!-- gh-comment-id:2532686055 --> @dhiltgen commented on GitHub (Dec 10, 2024): @bones0 that GPU should work. Please open a new issue and include your server logs so we can investigate.
Author
Owner

@ok4kalessin commented on GitHub (Jan 31, 2025):

Hello, Im facing this with version 0.5.7 , error: msg="[0] CUDA GPU is too old. Compute Capability detected: 1.0"
but having CUDA 9 installed, for nvidia 1000M, driver 390.157 .

any hint is appreaciated

<!-- gh-comment-id:2627519106 --> @ok4kalessin commented on GitHub (Jan 31, 2025): Hello, Im facing this with version 0.5.7 , error: msg="[0] CUDA GPU is too old. Compute Capability detected: 1.0" but having CUDA 9 installed, for nvidia 1000M, driver 390.157 . any hint is appreaciated
Author
Owner

@dhiltgen commented on GitHub (Feb 4, 2025):

@ok4kalessin I believe your GPU is ~14 years old, and not supported. We support nvidia GPUs going back ~10 years. (CC 5.0 and up.)

https://developer.nvidia.com/cuda-gpus

<!-- gh-comment-id:2632499888 --> @dhiltgen commented on GitHub (Feb 4, 2025): @ok4kalessin I believe your GPU is ~14 years old, and not supported. We support nvidia GPUs going back ~10 years. (CC 5.0 and up.) https://developer.nvidia.com/cuda-gpus
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#47578