[GH-ISSUE #118] Crashed on M2 Air 8GB #46549

Closed
opened 2026-04-27 22:58:40 -05:00 by GiteaMirror · 6 comments
Owner

Originally created by @chsasank on GitHub (Jul 19, 2023).
Original GitHub issue: https://github.com/ollama/ollama/issues/118

llama.cpp: loading model from /Users/sasank/.ollama/models/blobs/sha256:8daa9615cce30c259a9555b1cc250d461d1bc69980a274b44d7eda0be78076d8
llama_model_load_internal: format     = ggjt v3 (latest)
llama_model_load_internal: n_vocab    = 32000
llama_model_load_internal: n_ctx      = 2048
llama_model_load_internal: n_embd     = 4096
llama_model_load_internal: n_mult     = 256
llama_model_load_internal: n_head     = 32
llama_model_load_internal: n_layer    = 32
llama_model_load_internal: n_rot      = 128
llama_model_load_internal: ftype      = 2 (mostly Q4_0)
llama_model_load_internal: n_ff       = 11008
llama_model_load_internal: model size = 7B
llama_model_load_internal: ggml ctx size =    0.08 MB
llama_model_load_internal: mem required  = 5407.72 MB (+ 1026.00 MB per state)
llama_new_context_with_model: kv self size  = 1024.00 MB
ggml_metal_init: allocating
ggml_metal_init: using MPS
ggml_metal_init: loading '/Users/sasank/code/llama/ollama/ggml-metal.metal'
ggml_metal_init: loaded kernel_add                            0x12aa075a0
ggml_metal_init: loaded kernel_mul                            0x12ab05ee0
ggml_metal_init: loaded kernel_mul_row                        0x12ab06530
ggml_metal_init: loaded kernel_scale                          0x12aa07de0
ggml_metal_init: loaded kernel_silu                           0x12aa08300
ggml_metal_init: loaded kernel_relu                           0x12ab06930
ggml_metal_init: loaded kernel_gelu                           0x12ab06e50
ggml_metal_init: loaded kernel_soft_max                       0x12ab076b0
ggml_metal_init: loaded kernel_diag_mask_inf                  0x12ab07d30
ggml_metal_init: loaded kernel_get_rows_f16                   0x12aa089e0
ggml_metal_init: loaded kernel_get_rows_q4_0                  0x12aa091a0
ggml_metal_init: loaded kernel_get_rows_q4_1                  0x12aa09b30
ggml_metal_init: loaded kernel_get_rows_q2_K                  0x12ab082b0
ggml_metal_init: loaded kernel_get_rows_q3_K                  0x12ab08a70
ggml_metal_init: loaded kernel_get_rows_q4_K                  0x12aa0a0b0
ggml_metal_init: loaded kernel_get_rows_q5_K                  0x12aa0a8b0
ggml_metal_init: loaded kernel_get_rows_q6_K                  0x12aa0af50
ggml_metal_init: loaded kernel_rms_norm                       0x12ab09140
ggml_metal_init: loaded kernel_norm                           0x12ab09920
ggml_metal_init: loaded kernel_mul_mat_f16_f32                0x12aa0b9f0
ggml_metal_init: loaded kernel_mul_mat_q4_0_f32               0x12aa0be30
ggml_metal_init: loaded kernel_mul_mat_q4_1_f32               0x12aa0c530
ggml_metal_init: loaded kernel_mul_mat_q2_K_f32               0x12ab0a350
ggml_metal_init: loaded kernel_mul_mat_q3_K_f32               0x12ab0af40
ggml_metal_init: loaded kernel_mul_mat_q4_K_f32               0x12ab0b5c0
ggml_metal_init: loaded kernel_mul_mat_q5_K_f32               0x12aa0c930
ggml_metal_init: loaded kernel_mul_mat_q6_K_f32               0x12ab0bba0
ggml_metal_init: loaded kernel_rope                           0x12ab0ca80
ggml_metal_init: loaded kernel_alibi_f32                      0x12ab0d360
ggml_metal_init: loaded kernel_cpy_f32_f16                    0x12ab0dc10
ggml_metal_init: loaded kernel_cpy_f32_f32                    0x12ab0e4c0
ggml_metal_init: loaded kernel_cpy_f16_f16                    0x12aa0d550
ggml_metal_init: recommendedMaxWorkingSetSize =  5461.34 MB
ggml_metal_init: hasUnifiedMemory             = true
ggml_metal_init: maxTransferRate              = built-in GPU
llama_new_context_with_model: max tensor size =    70.31 MB
ggml_metal_add_buffer: allocated 'data            ' buffer, size =  3616.08 MB, ( 3616.47 /  5461.34)
ggml_metal_add_buffer: allocated 'eval            ' buffer, size =   768.00 MB, ( 4384.47 /  5461.34)
ggml_metal_add_buffer: allocated 'kv              ' buffer, size =  1026.00 MB, ( 5410.47 /  5461.34)
ggml_metal_add_buffer: allocated 'scr0            ' buffer, size =   512.00 MB, ( 5922.47 /  5461.34), warning: current allocated size is greater than the recommended max working set size
ggml_metal_add_buffer: allocated 'scr1            ' buffer, size =   512.00 MB, ( 6434.47 /  5461.34), warning: current allocated size is greater than the recommended max working set size
ggml_metal_graph_compute: command buffer 0 failed with status 5
GGML_ASSERT: ggml-metal.m:1013: false
SIGABRT: abort
PC=0x19296c724 m=5 sigcode=0
signal arrived during cgo execution

goroutine 6 [syscall]:
runtime.cgocall(0x100c920c0, 0x140000bd298)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/cgocall.go:157 +0x54 fp=0x140000bd260 sp=0x140000bd220 pc=0x100799994
github.com/jmorganca/ollama/llama._Cfunc_llama_eval(0x144008a00, 0x14000486c88, 0x1, 0x0, 0x8)
	_cgo_gotypes.go:208 +0x38 fp=0x140000bd290 sp=0x140000bd260 pc=0x100c81e18
github.com/jmorganca/ollama/llama.New.func4(0x99?, {0x14000486c88, 0x1, 0x14000178540?}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, ...})
	/Users/sasank/code/llama/ollama/llama/llama.go:141 +0x7c fp=0x140000bd2e0 sp=0x140000bd290 pc=0x100c82c2c
github.com/jmorganca/ollama/llama.New({0x140007fc310, 0x6a}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, 0x0, 0x1, ...})
	/Users/sasank/code/llama/ollama/llama/llama.go:141 +0x278 fp=0x140000bd4a0 sp=0x140000bd2e0 pc=0x100c829e8
github.com/jmorganca/ollama/server.generate(0x140000b4300)
	/Users/sasank/code/llama/ollama/server/routes.go:70 +0x700 fp=0x140000bd6e0 sp=0x140000bd4a0 pc=0x100c8d6b0
github.com/gin-gonic/gin.(*Context).Next(...)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0x140000b4300)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7c fp=0x140000bd730 sp=0x140000bd6e0 pc=0x100c7950c
github.com/gin-gonic/gin.(*Context).Next(...)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.LoggerWithConfig.func1(0x140000b4300)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xac fp=0x140000bd8e0 sp=0x140000bd730 pc=0x100c7878c
github.com/gin-gonic/gin.(*Context).Next(...)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0x14000145ba0, 0x140000b4300)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x54c fp=0x140000bda70 sp=0x140000bd8e0 pc=0x100c7789c
github.com/gin-gonic/gin.(*Engine).ServeHTTP(0x14000145ba0, {0x100f019c0?, 0x140004ee1c0}, 0x140000b4200)
	/Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1d4 fp=0x140000bdab0 sp=0x140000bda70 pc=0x100c771a4
net/http.serverHandler.ServeHTTP({0x100effa38?}, {0x100f019c0, 0x140004ee1c0}, 0x140000b4200)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:2936 +0x2d8 fp=0x140000bdb60 sp=0x140000bdab0 pc=0x100a152a8
net/http.(*conn).serve(0x1400017a900, {0x100f02038, 0x1400046e060})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:1995 +0x560 fp=0x140000bdfa0 sp=0x140000bdb60 pc=0x100a10fa0
net/http.(*Server).Serve.func3()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3089 +0x30 fp=0x140000bdfd0 sp=0x140000bdfa0 pc=0x100a15ad0
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000bdfd0 sp=0x140000bdfd0 pc=0x1007fc324
created by net/http.(*Server).Serve
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3089 +0x520

goroutine 1 [IO wait, 14 minutes]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400011f860 sp=0x1400011f840 pc=0x1007ccaa4
runtime.netpollblock(0x1400031f8f8?, 0x87f1a4?, 0x1?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x1400011f8a0 sp=0x1400011f860 pc=0x1007c6138
internal/poll.runtime_pollWait(0x1289ada18, 0x72)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x1400011f8d0 sp=0x1400011f8a0 pc=0x1007f61b0
internal/poll.(*pollDesc).wait(0x1400044a580?, 0x0?, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400011f900 sp=0x1400011f8d0 pc=0x10087a7e8
internal/poll.(*pollDesc).waitRead(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0x1400044a580)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:614 +0x250 fp=0x1400011f9b0 sp=0x1400011f900 pc=0x10087f290
net.(*netFD).accept(0x1400044a580)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_unix.go:172 +0x28 fp=0x1400011fa70 sp=0x1400011f9b0 pc=0x1008be278
net.(*TCPListener).accept(0x1400000edb0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/tcpsock_posix.go:148 +0x28 fp=0x1400011faa0 sp=0x1400011fa70 pc=0x1008d3878
net.(*TCPListener).Accept(0x1400000edb0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/tcpsock.go:297 +0x2c fp=0x1400011fae0 sp=0x1400011faa0 pc=0x1008d29ec
net/http.(*onceCloseListener).Accept(0x1400017a900?)
	<autogenerated>:1 +0x30 fp=0x1400011fb00 sp=0x1400011fae0 pc=0x100a39250
net/http.(*Server).Serve(0x14000366ff0, {0x100f017b0, 0x1400000edb0})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3059 +0x304 fp=0x1400011fc30 sp=0x1400011fb00 pc=0x100a15774
github.com/jmorganca/ollama/server.Serve({0x100f017b0, 0x1400000edb0})
	/Users/sasank/code/llama/ollama/server/routes.go:238 +0x250 fp=0x1400011fca0 sp=0x1400011fc30 pc=0x100c8f4e0
github.com/jmorganca/ollama/cmd.RunServer(0x14000419200?, {0x100ce1dcb?, 0x0?, 0x0?})
	/Users/sasank/code/llama/ollama/cmd/cmd.go:272 +0x114 fp=0x1400011fd20 sp=0x1400011fca0 pc=0x100c91454
github.com/spf13/cobra.(*Command).execute(0x14000419200, {0x101365c48, 0x0, 0x0})
	/Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x5c8 fp=0x1400011fe60 sp=0x1400011fd20 pc=0x100aaf628
github.com/spf13/cobra.(*Command).ExecuteC(0x14000418900)
	/Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x35c fp=0x1400011ff20 sp=0x1400011fe60 pc=0x100aafd7c
github.com/spf13/cobra.(*Command).Execute(...)
	/Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(0x14000054768?, {0x100f01fc8?, 0x140000280b0?})
	/Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 +0x50 fp=0x1400011ff40 sp=0x1400011ff20 pc=0x100aaf910
main.main()
	/Users/sasank/code/llama/ollama/main.go:10 +0x34 fp=0x1400011ff70 sp=0x1400011ff40 pc=0x100c91e94
runtime.main()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:250 +0x248 fp=0x1400011ffd0 sp=0x1400011ff70 pc=0x1007cc678
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400011ffd0 sp=0x1400011ffd0 pc=0x1007fc324

goroutine 2 [force gc (idle), 14 minutes]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000054fa0 sp=0x14000054f80 pc=0x1007ccaa4
runtime.goparkunlock(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387
runtime.forcegchelper()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:305 +0xb8 fp=0x14000054fd0 sp=0x14000054fa0 pc=0x1007cc8e8
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000054fd0 sp=0x14000054fd0 pc=0x1007fc324
created by runtime.init.6
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:293 +0x24

goroutine 3 [GC sweep wait]:
runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000055760 sp=0x14000055740 pc=0x1007ccaa4
runtime.goparkunlock(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387
runtime.bgsweep(0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcsweep.go:319 +0x110 fp=0x140000557b0 sp=0x14000055760 pc=0x1007b9960
runtime.gcenable.func1()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:178 +0x28 fp=0x140000557d0 sp=0x140000557b0 pc=0x1007ae408
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000557d0 sp=0x140000557d0 pc=0x1007fc324
created by runtime.gcenable
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:178 +0x74

goroutine 4 [GC scavenge wait]:
runtime.gopark(0x12b0f92?, 0x1291938?, 0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000055f50 sp=0x14000055f30 pc=0x1007ccaa4
runtime.goparkunlock(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387
runtime.(*scavengerState).park(0x1012aa960)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcscavenge.go:400 +0x5c fp=0x14000055f80 sp=0x14000055f50 pc=0x1007b776c
runtime.bgscavenge(0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcscavenge.go:633 +0xac fp=0x14000055fb0 sp=0x14000055f80 pc=0x1007b7d4c
runtime.gcenable.func2()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:179 +0x28 fp=0x14000055fd0 sp=0x14000055fb0 pc=0x1007ae3a8
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000055fd0 sp=0x14000055fd0 pc=0x1007fc324
created by runtime.gcenable
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:179 +0xb8

goroutine 5 [finalizer wait, 12 minutes]:
runtime.gopark(0x0?, 0x1400048a138?, 0x20?, 0x1?, 0x1000000010?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000065d80 sp=0x14000065d60 pc=0x1007ccaa4
runtime.runfinq()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mfinal.go:193 +0x10c fp=0x14000065fd0 sp=0x14000065d80 pc=0x1007ad49c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000065fd0 sp=0x14000065fd0 pc=0x1007fc324
created by runtime.createfing
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mfinal.go:163 +0x84

goroutine 26 [select]:
runtime.gopark(0x1400051ff80?, 0x2?, 0xa0?, 0x61?, 0x1400051ff24?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400051fdb0 sp=0x1400051fd90 pc=0x1007ccaa4
runtime.selectgo(0x1400051ff80, 0x1400051ff20, 0x14000282680?, 0x0, 0x0?, 0x1)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/select.go:327 +0x690 fp=0x1400051fed0 sp=0x1400051fdb0 pc=0x1007dd1a0
net/http.(*persistConn).writeLoop(0x14000128d80)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:2410 +0x9c fp=0x1400051ffb0 sp=0x1400051fed0 pc=0x100a2a74c
net/http.(*Transport).dialConn.func6()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1766 +0x28 fp=0x1400051ffd0 sp=0x1400051ffb0 pc=0x100a27458
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400051ffd0 sp=0x1400051ffd0 pc=0x1007fc324
created by net/http.(*Transport).dialConn
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1766 +0x1214

goroutine 13 [GC worker (idle), 1 minutes]:
runtime.gopark(0x4f330c0464e0f?, 0x1?, 0x27?, 0xdf?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000056f40 sp=0x14000056f20 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000056fd0 sp=0x14000056f40 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000056fd0 sp=0x14000056fd0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 20 [GC worker (idle)]:
runtime.gopark(0x1013673a0?, 0x1?, 0x16?, 0xeb?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000057740 sp=0x14000057720 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000577d0 sp=0x14000057740 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000577d0 sp=0x140000577d0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 21 [GC worker (idle)]:
runtime.gopark(0x4f347a631f1b8?, 0x3?, 0xc3?, 0x8e?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000050740 sp=0x14000050720 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000507d0 sp=0x14000050740 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000507d0 sp=0x140000507d0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 14 [GC worker (idle)]:
runtime.gopark(0x4f347a634141b?, 0x3?, 0x77?, 0xc?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000057f40 sp=0x14000057f20 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000057fd0 sp=0x14000057f40 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000057fd0 sp=0x14000057fd0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 22 [GC worker (idle)]:
runtime.gopark(0x4f3473d29e65d?, 0x1?, 0x9f?, 0x19?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000050f40 sp=0x14000050f20 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000050fd0 sp=0x14000050f40 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000050fd0 sp=0x14000050fd0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 15 [GC worker (idle)]:
runtime.gopark(0x1013673a0?, 0x3?, 0x2?, 0x4c?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047c740 sp=0x1400047c720 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x1400047c7d0 sp=0x1400047c740 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047c7d0 sp=0x1400047c7d0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 23 [GC worker (idle)]:
runtime.gopark(0x4f3472b8156b1?, 0x3?, 0x93?, 0x2d?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000051740 sp=0x14000051720 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000517d0 sp=0x14000051740 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000517d0 sp=0x140000517d0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 16 [GC worker (idle)]:
runtime.gopark(0x4f3474e2b3524?, 0x3?, 0xe3?, 0x7b?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047cf40 sp=0x1400047cf20 pc=0x1007ccaa4
runtime.gcBgMarkWorker()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x1400047cfd0 sp=0x1400047cf40 pc=0x1007b034c
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047cfd0 sp=0x1400047cfd0 pc=0x1007fc324
created by runtime.gcBgMarkStartWorkers
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28

goroutine 56 [IO wait]:
runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000063580 sp=0x14000063560 pc=0x1007ccaa4
runtime.netpollblock(0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x140000635c0 sp=0x14000063580 pc=0x1007c6138
internal/poll.runtime_pollWait(0x1289ad838, 0x72)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x140000635f0 sp=0x140000635c0 pc=0x1007f61b0
internal/poll.(*pollDesc).wait(0x1400064c000?, 0x140001c4800?, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x14000063620 sp=0x140000635f0 pc=0x10087a7e8
internal/poll.(*pollDesc).waitRead(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0x1400064c000, {0x140001c4800, 0x1800, 0x1800})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x140000636c0 sp=0x14000063620 pc=0x10087bb50
net.(*netFD).Read(0x1400064c000, {0x140001c4800?, 0x14000063878?, 0x100000ece?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x14000063710 sp=0x140000636c0 pc=0x1008bc5d8
net.(*conn).Read(0x140004ba028, {0x140001c4800?, 0x140000637c8?, 0x1007a2304?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x14000063760 sp=0x14000063710 pc=0x1008cabe4
net.(*TCPConn).Read(0x140000637d8?, {0x140001c4800?, 0x1400000e828?, 0x18?})
	<autogenerated>:1 +0x2c fp=0x14000063790 sp=0x14000063760 pc=0x1008dd12c
crypto/tls.(*atLeastReader).Read(0x1400000e828, {0x140001c4800?, 0x1400000e828?, 0x0?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:788 +0x40 fp=0x140000637e0 sp=0x14000063790 pc=0x10096f760
bytes.(*Buffer).ReadFrom(0x140004aa290, {0x100efd580, 0x1400000e828})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/bytes/buffer.go:202 +0x90 fp=0x14000063840 sp=0x140000637e0 pc=0x100831860
crypto/tls.(*Conn).readFromUntil(0x140004aa000, {0x128a27fc8?, 0x140004ba028}, 0x1009c421c?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:810 +0xd4 fp=0x14000063880 sp=0x14000063840 pc=0x10096f954
crypto/tls.(*Conn).readRecordOrCCS(0x140004aa000, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:617 +0xd8 fp=0x14000063bf0 sp=0x14000063880 pc=0x10096d7a8
crypto/tls.(*Conn).readRecord(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:583
crypto/tls.(*Conn).Read(0x140004aa000, {0x140000a1000, 0x1000, 0x1009e1418?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:1316 +0x178 fp=0x14000063c60 sp=0x14000063bf0 pc=0x1009726f8
bufio.(*Reader).Read(0x140006bc900, {0x14000420580, 0x9, 0x10079bfbc?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:237 +0x1e0 fp=0x14000063ca0 sp=0x14000063c60 pc=0x10083e7b0
io.ReadAtLeast({0x100efd3e0, 0x140006bc900}, {0x14000420580, 0x9, 0x9}, 0x9)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/io/io.go:332 +0xa0 fp=0x14000063cf0 sp=0x14000063ca0 pc=0x100827fa0
io.ReadFull(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/io/io.go:351
net/http.http2readFrameHeader({0x14000420580?, 0x9?, 0x14000063d98?}, {0x100efd3e0?, 0x140006bc900?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:1567 +0x58 fp=0x14000063d40 sp=0x14000063cf0 pc=0x1009d8548
net/http.(*http2Framer).ReadFrame(0x14000420540)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:1831 +0x84 fp=0x14000063df0 sp=0x14000063d40 pc=0x1009d8d44
net/http.(*http2clientConnReadLoop).run(0x14000063f88)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:9187 +0xfc fp=0x14000063f40 sp=0x14000063df0 pc=0x1009fa06c
net/http.(*http2ClientConn).readLoop(0x14000175080)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:9082 +0x5c fp=0x14000063fb0 sp=0x14000063f40 pc=0x1009f952c
net/http.(*http2Transport).newClientConn.func1()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:7779 +0x28 fp=0x14000063fd0 sp=0x14000063fb0 pc=0x1009f26b8
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000063fd0 sp=0x14000063fd0 pc=0x1007fc324
created by net/http.(*http2Transport).newClientConn
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:7779 +0xad0

goroutine 39 [IO wait]:
runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047ad40 sp=0x1400047ad20 pc=0x1007ccaa4
runtime.netpollblock(0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x1400047ad80 sp=0x1400047ad40 pc=0x1007c6138
internal/poll.runtime_pollWait(0x1289ad928, 0x72)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x1400047adb0 sp=0x1400047ad80 pc=0x1007f61b0
internal/poll.(*pollDesc).wait(0x1400044a600?, 0x1400046e161?, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400047ade0 sp=0x1400047adb0 pc=0x10087a7e8
internal/poll.(*pollDesc).waitRead(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0x1400044a600, {0x1400046e161, 0x1, 0x1})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x1400047ae80 sp=0x1400047ade0 pc=0x10087bb50
net.(*netFD).Read(0x1400044a600, {0x1400046e161?, 0x0?, 0x0?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x1400047aed0 sp=0x1400047ae80 pc=0x1008bc5d8
net.(*conn).Read(0x14000010d10, {0x1400046e161?, 0x0?, 0x0?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x1400047af20 sp=0x1400047aed0 pc=0x1008cabe4
net.(*TCPConn).Read(0x0?, {0x1400046e161?, 0x0?, 0x0?})
	<autogenerated>:1 +0x2c fp=0x1400047af50 sp=0x1400047af20 pc=0x1008dd12c
net/http.(*connReader).backgroundRead(0x1400046e150)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:674 +0x44 fp=0x1400047afb0 sp=0x1400047af50 pc=0x100a0b454
net/http.(*connReader).startBackgroundRead.func2()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:670 +0x28 fp=0x1400047afd0 sp=0x1400047afb0 pc=0x100a0b378
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047afd0 sp=0x1400047afd0 pc=0x1007fc324
created by net/http.(*connReader).startBackgroundRead
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:670 +0xcc

goroutine 25 [IO wait]:
runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000062580 sp=0x14000062560 pc=0x1007ccaa4
runtime.netpollblock(0x0?, 0x0?, 0x0?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x140000625c0 sp=0x14000062580 pc=0x1007c6138
internal/poll.runtime_pollWait(0x1289ad748, 0x72)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x140000625f0 sp=0x140000625c0 pc=0x1007f61b0
internal/poll.(*pollDesc).wait(0x14000480200?, 0x140002d0000?, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x14000062620 sp=0x140000625f0 pc=0x10087a7e8
internal/poll.(*pollDesc).waitRead(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0x14000480200, {0x140002d0000, 0xa000, 0xa000})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x140000626c0 sp=0x14000062620 pc=0x10087bb50
net.(*netFD).Read(0x14000480200, {0x140002d0000?, 0x14000062878?, 0x10096df7c?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x14000062710 sp=0x140000626c0 pc=0x1008bc5d8
net.(*conn).Read(0x140004ba000, {0x140002d0000?, 0x100ce6ad4?, 0x5?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x14000062760 sp=0x14000062710 pc=0x1008cabe4
net.(*TCPConn).Read(0x140000627d8?, {0x140002d0000?, 0x140006d00d8?, 0x18?})
	<autogenerated>:1 +0x2c fp=0x14000062790 sp=0x14000062760 pc=0x1008dd12c
crypto/tls.(*atLeastReader).Read(0x140006d00d8, {0x140002d0000?, 0x140006d00d8?, 0x0?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:788 +0x40 fp=0x140000627e0 sp=0x14000062790 pc=0x10096f760
bytes.(*Buffer).ReadFrom(0x14000452290, {0x100efd580, 0x140006d00d8})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/bytes/buffer.go:202 +0x90 fp=0x14000062840 sp=0x140000627e0 pc=0x100831860
crypto/tls.(*Conn).readFromUntil(0x14000452000, {0x128a27fc8?, 0x140004ba000}, 0x7fffffffffffffff?)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:810 +0xd4 fp=0x14000062880 sp=0x14000062840 pc=0x10096f954
crypto/tls.(*Conn).readRecordOrCCS(0x14000452000, 0x0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:617 +0xd8 fp=0x14000062bf0 sp=0x14000062880 pc=0x10096d7a8
crypto/tls.(*Conn).readRecord(...)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:583
crypto/tls.(*Conn).Read(0x14000452000, {0x140004df000, 0x1000, 0x140003e8180?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:1316 +0x178 fp=0x14000062c60 sp=0x14000062bf0 pc=0x1009726f8
net/http.(*persistConn).Read(0x14000128d80, {0x140004df000?, 0x10079b930?, 0x1400049e780?})
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1943 +0x50 fp=0x14000062cc0 sp=0x14000062c60 pc=0x100a27e60
bufio.(*Reader).fill(0x140004fc4e0)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:106 +0xfc fp=0x14000062d00 sp=0x14000062cc0 pc=0x10083e18c
bufio.(*Reader).Peek(0x140004fc4e0, 0x1)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:144 +0x60 fp=0x14000062d20 sp=0x14000062d00 pc=0x10083e300
net/http.(*persistConn).readLoop(0x14000128d80)
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:2107 +0x144 fp=0x14000062fb0 sp=0x14000062d20 pc=0x100a28d14
net/http.(*Transport).dialConn.func5()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1765 +0x28 fp=0x14000062fd0 sp=0x14000062fb0 pc=0x100a274b8
runtime.goexit()
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000062fd0 sp=0x14000062fd0 pc=0x1007fc324
created by net/http.(*Transport).dialConn
	/opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1765 +0x11c8

r0      0x0
r1      0x0
r2      0x0
r3      0x0
r4      0x0
r5      0x171672c10
r6      0xa
r7      0x0
r8      0x6b684de7b1e616cc
r9      0x6b684de6c08ea6cc
r10     0x2
r11     0xfffffffd
r12     0x10000000000
r13     0x0
r14     0x0
r15     0x0
r16     0x148
r17     0x1f292cf20
r18     0x0
r19     0x6
r20     0x17168b000
r21     0x1a03
r22     0x17168b0e0
r23     0x8
r24     0x7
r25     0x8
r26     0x1ede07460
r27     0x100cd3094
r28     0x100df50c0
r29     0x171672bc0
lr      0x1929a3c28
sp      0x171672ba0
pc      0x19296c724
fault   0x19296c724      
Originally created by @chsasank on GitHub (Jul 19, 2023). Original GitHub issue: https://github.com/ollama/ollama/issues/118 ```[GIN] 2023/07/19 - 11:58:16 | 200 | 13m51s | 127.0.0.1 | POST "/api/pull" llama.cpp: loading model from /Users/sasank/.ollama/models/blobs/sha256:8daa9615cce30c259a9555b1cc250d461d1bc69980a274b44d7eda0be78076d8 llama_model_load_internal: format = ggjt v3 (latest) llama_model_load_internal: n_vocab = 32000 llama_model_load_internal: n_ctx = 2048 llama_model_load_internal: n_embd = 4096 llama_model_load_internal: n_mult = 256 llama_model_load_internal: n_head = 32 llama_model_load_internal: n_layer = 32 llama_model_load_internal: n_rot = 128 llama_model_load_internal: ftype = 2 (mostly Q4_0) llama_model_load_internal: n_ff = 11008 llama_model_load_internal: model size = 7B llama_model_load_internal: ggml ctx size = 0.08 MB llama_model_load_internal: mem required = 5407.72 MB (+ 1026.00 MB per state) llama_new_context_with_model: kv self size = 1024.00 MB ggml_metal_init: allocating ggml_metal_init: using MPS ggml_metal_init: loading '/Users/sasank/code/llama/ollama/ggml-metal.metal' ggml_metal_init: loaded kernel_add 0x12aa075a0 ggml_metal_init: loaded kernel_mul 0x12ab05ee0 ggml_metal_init: loaded kernel_mul_row 0x12ab06530 ggml_metal_init: loaded kernel_scale 0x12aa07de0 ggml_metal_init: loaded kernel_silu 0x12aa08300 ggml_metal_init: loaded kernel_relu 0x12ab06930 ggml_metal_init: loaded kernel_gelu 0x12ab06e50 ggml_metal_init: loaded kernel_soft_max 0x12ab076b0 ggml_metal_init: loaded kernel_diag_mask_inf 0x12ab07d30 ggml_metal_init: loaded kernel_get_rows_f16 0x12aa089e0 ggml_metal_init: loaded kernel_get_rows_q4_0 0x12aa091a0 ggml_metal_init: loaded kernel_get_rows_q4_1 0x12aa09b30 ggml_metal_init: loaded kernel_get_rows_q2_K 0x12ab082b0 ggml_metal_init: loaded kernel_get_rows_q3_K 0x12ab08a70 ggml_metal_init: loaded kernel_get_rows_q4_K 0x12aa0a0b0 ggml_metal_init: loaded kernel_get_rows_q5_K 0x12aa0a8b0 ggml_metal_init: loaded kernel_get_rows_q6_K 0x12aa0af50 ggml_metal_init: loaded kernel_rms_norm 0x12ab09140 ggml_metal_init: loaded kernel_norm 0x12ab09920 ggml_metal_init: loaded kernel_mul_mat_f16_f32 0x12aa0b9f0 ggml_metal_init: loaded kernel_mul_mat_q4_0_f32 0x12aa0be30 ggml_metal_init: loaded kernel_mul_mat_q4_1_f32 0x12aa0c530 ggml_metal_init: loaded kernel_mul_mat_q2_K_f32 0x12ab0a350 ggml_metal_init: loaded kernel_mul_mat_q3_K_f32 0x12ab0af40 ggml_metal_init: loaded kernel_mul_mat_q4_K_f32 0x12ab0b5c0 ggml_metal_init: loaded kernel_mul_mat_q5_K_f32 0x12aa0c930 ggml_metal_init: loaded kernel_mul_mat_q6_K_f32 0x12ab0bba0 ggml_metal_init: loaded kernel_rope 0x12ab0ca80 ggml_metal_init: loaded kernel_alibi_f32 0x12ab0d360 ggml_metal_init: loaded kernel_cpy_f32_f16 0x12ab0dc10 ggml_metal_init: loaded kernel_cpy_f32_f32 0x12ab0e4c0 ggml_metal_init: loaded kernel_cpy_f16_f16 0x12aa0d550 ggml_metal_init: recommendedMaxWorkingSetSize = 5461.34 MB ggml_metal_init: hasUnifiedMemory = true ggml_metal_init: maxTransferRate = built-in GPU llama_new_context_with_model: max tensor size = 70.31 MB ggml_metal_add_buffer: allocated 'data ' buffer, size = 3616.08 MB, ( 3616.47 / 5461.34) ggml_metal_add_buffer: allocated 'eval ' buffer, size = 768.00 MB, ( 4384.47 / 5461.34) ggml_metal_add_buffer: allocated 'kv ' buffer, size = 1026.00 MB, ( 5410.47 / 5461.34) ggml_metal_add_buffer: allocated 'scr0 ' buffer, size = 512.00 MB, ( 5922.47 / 5461.34), warning: current allocated size is greater than the recommended max working set size ggml_metal_add_buffer: allocated 'scr1 ' buffer, size = 512.00 MB, ( 6434.47 / 5461.34), warning: current allocated size is greater than the recommended max working set size ggml_metal_graph_compute: command buffer 0 failed with status 5 GGML_ASSERT: ggml-metal.m:1013: false SIGABRT: abort PC=0x19296c724 m=5 sigcode=0 signal arrived during cgo execution goroutine 6 [syscall]: runtime.cgocall(0x100c920c0, 0x140000bd298) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/cgocall.go:157 +0x54 fp=0x140000bd260 sp=0x140000bd220 pc=0x100799994 github.com/jmorganca/ollama/llama._Cfunc_llama_eval(0x144008a00, 0x14000486c88, 0x1, 0x0, 0x8) _cgo_gotypes.go:208 +0x38 fp=0x140000bd290 sp=0x140000bd260 pc=0x100c81e18 github.com/jmorganca/ollama/llama.New.func4(0x99?, {0x14000486c88, 0x1, 0x14000178540?}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, ...}) /Users/sasank/code/llama/ollama/llama/llama.go:141 +0x7c fp=0x140000bd2e0 sp=0x140000bd290 pc=0x100c82c2c github.com/jmorganca/ollama/llama.New({0x140007fc310, 0x6a}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, 0x0, 0x1, ...}) /Users/sasank/code/llama/ollama/llama/llama.go:141 +0x278 fp=0x140000bd4a0 sp=0x140000bd2e0 pc=0x100c829e8 github.com/jmorganca/ollama/server.generate(0x140000b4300) /Users/sasank/code/llama/ollama/server/routes.go:70 +0x700 fp=0x140000bd6e0 sp=0x140000bd4a0 pc=0x100c8d6b0 github.com/gin-gonic/gin.(*Context).Next(...) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0x140000b4300) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7c fp=0x140000bd730 sp=0x140000bd6e0 pc=0x100c7950c github.com/gin-gonic/gin.(*Context).Next(...) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.LoggerWithConfig.func1(0x140000b4300) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xac fp=0x140000bd8e0 sp=0x140000bd730 pc=0x100c7878c github.com/gin-gonic/gin.(*Context).Next(...) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0x14000145ba0, 0x140000b4300) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x54c fp=0x140000bda70 sp=0x140000bd8e0 pc=0x100c7789c github.com/gin-gonic/gin.(*Engine).ServeHTTP(0x14000145ba0, {0x100f019c0?, 0x140004ee1c0}, 0x140000b4200) /Users/sasank/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1d4 fp=0x140000bdab0 sp=0x140000bda70 pc=0x100c771a4 net/http.serverHandler.ServeHTTP({0x100effa38?}, {0x100f019c0, 0x140004ee1c0}, 0x140000b4200) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:2936 +0x2d8 fp=0x140000bdb60 sp=0x140000bdab0 pc=0x100a152a8 net/http.(*conn).serve(0x1400017a900, {0x100f02038, 0x1400046e060}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:1995 +0x560 fp=0x140000bdfa0 sp=0x140000bdb60 pc=0x100a10fa0 net/http.(*Server).Serve.func3() /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3089 +0x30 fp=0x140000bdfd0 sp=0x140000bdfa0 pc=0x100a15ad0 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000bdfd0 sp=0x140000bdfd0 pc=0x1007fc324 created by net/http.(*Server).Serve /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3089 +0x520 goroutine 1 [IO wait, 14 minutes]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400011f860 sp=0x1400011f840 pc=0x1007ccaa4 runtime.netpollblock(0x1400031f8f8?, 0x87f1a4?, 0x1?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x1400011f8a0 sp=0x1400011f860 pc=0x1007c6138 internal/poll.runtime_pollWait(0x1289ada18, 0x72) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x1400011f8d0 sp=0x1400011f8a0 pc=0x1007f61b0 internal/poll.(*pollDesc).wait(0x1400044a580?, 0x0?, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400011f900 sp=0x1400011f8d0 pc=0x10087a7e8 internal/poll.(*pollDesc).waitRead(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0x1400044a580) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:614 +0x250 fp=0x1400011f9b0 sp=0x1400011f900 pc=0x10087f290 net.(*netFD).accept(0x1400044a580) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_unix.go:172 +0x28 fp=0x1400011fa70 sp=0x1400011f9b0 pc=0x1008be278 net.(*TCPListener).accept(0x1400000edb0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/tcpsock_posix.go:148 +0x28 fp=0x1400011faa0 sp=0x1400011fa70 pc=0x1008d3878 net.(*TCPListener).Accept(0x1400000edb0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/tcpsock.go:297 +0x2c fp=0x1400011fae0 sp=0x1400011faa0 pc=0x1008d29ec net/http.(*onceCloseListener).Accept(0x1400017a900?) <autogenerated>:1 +0x30 fp=0x1400011fb00 sp=0x1400011fae0 pc=0x100a39250 net/http.(*Server).Serve(0x14000366ff0, {0x100f017b0, 0x1400000edb0}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:3059 +0x304 fp=0x1400011fc30 sp=0x1400011fb00 pc=0x100a15774 github.com/jmorganca/ollama/server.Serve({0x100f017b0, 0x1400000edb0}) /Users/sasank/code/llama/ollama/server/routes.go:238 +0x250 fp=0x1400011fca0 sp=0x1400011fc30 pc=0x100c8f4e0 github.com/jmorganca/ollama/cmd.RunServer(0x14000419200?, {0x100ce1dcb?, 0x0?, 0x0?}) /Users/sasank/code/llama/ollama/cmd/cmd.go:272 +0x114 fp=0x1400011fd20 sp=0x1400011fca0 pc=0x100c91454 github.com/spf13/cobra.(*Command).execute(0x14000419200, {0x101365c48, 0x0, 0x0}) /Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x5c8 fp=0x1400011fe60 sp=0x1400011fd20 pc=0x100aaf628 github.com/spf13/cobra.(*Command).ExecuteC(0x14000418900) /Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x35c fp=0x1400011ff20 sp=0x1400011fe60 pc=0x100aafd7c github.com/spf13/cobra.(*Command).Execute(...) /Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(0x14000054768?, {0x100f01fc8?, 0x140000280b0?}) /Users/sasank/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 +0x50 fp=0x1400011ff40 sp=0x1400011ff20 pc=0x100aaf910 main.main() /Users/sasank/code/llama/ollama/main.go:10 +0x34 fp=0x1400011ff70 sp=0x1400011ff40 pc=0x100c91e94 runtime.main() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:250 +0x248 fp=0x1400011ffd0 sp=0x1400011ff70 pc=0x1007cc678 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400011ffd0 sp=0x1400011ffd0 pc=0x1007fc324 goroutine 2 [force gc (idle), 14 minutes]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000054fa0 sp=0x14000054f80 pc=0x1007ccaa4 runtime.goparkunlock(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387 runtime.forcegchelper() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:305 +0xb8 fp=0x14000054fd0 sp=0x14000054fa0 pc=0x1007cc8e8 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000054fd0 sp=0x14000054fd0 pc=0x1007fc324 created by runtime.init.6 /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:293 +0x24 goroutine 3 [GC sweep wait]: runtime.gopark(0x1?, 0x0?, 0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000055760 sp=0x14000055740 pc=0x1007ccaa4 runtime.goparkunlock(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387 runtime.bgsweep(0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcsweep.go:319 +0x110 fp=0x140000557b0 sp=0x14000055760 pc=0x1007b9960 runtime.gcenable.func1() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:178 +0x28 fp=0x140000557d0 sp=0x140000557b0 pc=0x1007ae408 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000557d0 sp=0x140000557d0 pc=0x1007fc324 created by runtime.gcenable /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:178 +0x74 goroutine 4 [GC scavenge wait]: runtime.gopark(0x12b0f92?, 0x1291938?, 0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000055f50 sp=0x14000055f30 pc=0x1007ccaa4 runtime.goparkunlock(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:387 runtime.(*scavengerState).park(0x1012aa960) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcscavenge.go:400 +0x5c fp=0x14000055f80 sp=0x14000055f50 pc=0x1007b776c runtime.bgscavenge(0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgcscavenge.go:633 +0xac fp=0x14000055fb0 sp=0x14000055f80 pc=0x1007b7d4c runtime.gcenable.func2() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:179 +0x28 fp=0x14000055fd0 sp=0x14000055fb0 pc=0x1007ae3a8 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000055fd0 sp=0x14000055fd0 pc=0x1007fc324 created by runtime.gcenable /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:179 +0xb8 goroutine 5 [finalizer wait, 12 minutes]: runtime.gopark(0x0?, 0x1400048a138?, 0x20?, 0x1?, 0x1000000010?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000065d80 sp=0x14000065d60 pc=0x1007ccaa4 runtime.runfinq() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mfinal.go:193 +0x10c fp=0x14000065fd0 sp=0x14000065d80 pc=0x1007ad49c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000065fd0 sp=0x14000065fd0 pc=0x1007fc324 created by runtime.createfing /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mfinal.go:163 +0x84 goroutine 26 [select]: runtime.gopark(0x1400051ff80?, 0x2?, 0xa0?, 0x61?, 0x1400051ff24?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400051fdb0 sp=0x1400051fd90 pc=0x1007ccaa4 runtime.selectgo(0x1400051ff80, 0x1400051ff20, 0x14000282680?, 0x0, 0x0?, 0x1) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/select.go:327 +0x690 fp=0x1400051fed0 sp=0x1400051fdb0 pc=0x1007dd1a0 net/http.(*persistConn).writeLoop(0x14000128d80) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:2410 +0x9c fp=0x1400051ffb0 sp=0x1400051fed0 pc=0x100a2a74c net/http.(*Transport).dialConn.func6() /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1766 +0x28 fp=0x1400051ffd0 sp=0x1400051ffb0 pc=0x100a27458 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400051ffd0 sp=0x1400051ffd0 pc=0x1007fc324 created by net/http.(*Transport).dialConn /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1766 +0x1214 goroutine 13 [GC worker (idle), 1 minutes]: runtime.gopark(0x4f330c0464e0f?, 0x1?, 0x27?, 0xdf?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000056f40 sp=0x14000056f20 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000056fd0 sp=0x14000056f40 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000056fd0 sp=0x14000056fd0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 20 [GC worker (idle)]: runtime.gopark(0x1013673a0?, 0x1?, 0x16?, 0xeb?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000057740 sp=0x14000057720 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000577d0 sp=0x14000057740 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000577d0 sp=0x140000577d0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 21 [GC worker (idle)]: runtime.gopark(0x4f347a631f1b8?, 0x3?, 0xc3?, 0x8e?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000050740 sp=0x14000050720 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000507d0 sp=0x14000050740 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000507d0 sp=0x140000507d0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 14 [GC worker (idle)]: runtime.gopark(0x4f347a634141b?, 0x3?, 0x77?, 0xc?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000057f40 sp=0x14000057f20 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000057fd0 sp=0x14000057f40 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000057fd0 sp=0x14000057fd0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 22 [GC worker (idle)]: runtime.gopark(0x4f3473d29e65d?, 0x1?, 0x9f?, 0x19?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000050f40 sp=0x14000050f20 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x14000050fd0 sp=0x14000050f40 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000050fd0 sp=0x14000050fd0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 15 [GC worker (idle)]: runtime.gopark(0x1013673a0?, 0x3?, 0x2?, 0x4c?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047c740 sp=0x1400047c720 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x1400047c7d0 sp=0x1400047c740 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047c7d0 sp=0x1400047c7d0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 23 [GC worker (idle)]: runtime.gopark(0x4f3472b8156b1?, 0x3?, 0x93?, 0x2d?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000051740 sp=0x14000051720 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x140000517d0 sp=0x14000051740 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000517d0 sp=0x140000517d0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 16 [GC worker (idle)]: runtime.gopark(0x4f3474e2b3524?, 0x3?, 0xe3?, 0x7b?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047cf40 sp=0x1400047cf20 pc=0x1007ccaa4 runtime.gcBgMarkWorker() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1275 +0xec fp=0x1400047cfd0 sp=0x1400047cf40 pc=0x1007b034c runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047cfd0 sp=0x1400047cfd0 pc=0x1007fc324 created by runtime.gcBgMarkStartWorkers /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/mgc.go:1199 +0x28 goroutine 56 [IO wait]: runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000063580 sp=0x14000063560 pc=0x1007ccaa4 runtime.netpollblock(0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x140000635c0 sp=0x14000063580 pc=0x1007c6138 internal/poll.runtime_pollWait(0x1289ad838, 0x72) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x140000635f0 sp=0x140000635c0 pc=0x1007f61b0 internal/poll.(*pollDesc).wait(0x1400064c000?, 0x140001c4800?, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x14000063620 sp=0x140000635f0 pc=0x10087a7e8 internal/poll.(*pollDesc).waitRead(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0x1400064c000, {0x140001c4800, 0x1800, 0x1800}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x140000636c0 sp=0x14000063620 pc=0x10087bb50 net.(*netFD).Read(0x1400064c000, {0x140001c4800?, 0x14000063878?, 0x100000ece?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x14000063710 sp=0x140000636c0 pc=0x1008bc5d8 net.(*conn).Read(0x140004ba028, {0x140001c4800?, 0x140000637c8?, 0x1007a2304?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x14000063760 sp=0x14000063710 pc=0x1008cabe4 net.(*TCPConn).Read(0x140000637d8?, {0x140001c4800?, 0x1400000e828?, 0x18?}) <autogenerated>:1 +0x2c fp=0x14000063790 sp=0x14000063760 pc=0x1008dd12c crypto/tls.(*atLeastReader).Read(0x1400000e828, {0x140001c4800?, 0x1400000e828?, 0x0?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:788 +0x40 fp=0x140000637e0 sp=0x14000063790 pc=0x10096f760 bytes.(*Buffer).ReadFrom(0x140004aa290, {0x100efd580, 0x1400000e828}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/bytes/buffer.go:202 +0x90 fp=0x14000063840 sp=0x140000637e0 pc=0x100831860 crypto/tls.(*Conn).readFromUntil(0x140004aa000, {0x128a27fc8?, 0x140004ba028}, 0x1009c421c?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:810 +0xd4 fp=0x14000063880 sp=0x14000063840 pc=0x10096f954 crypto/tls.(*Conn).readRecordOrCCS(0x140004aa000, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:617 +0xd8 fp=0x14000063bf0 sp=0x14000063880 pc=0x10096d7a8 crypto/tls.(*Conn).readRecord(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:583 crypto/tls.(*Conn).Read(0x140004aa000, {0x140000a1000, 0x1000, 0x1009e1418?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:1316 +0x178 fp=0x14000063c60 sp=0x14000063bf0 pc=0x1009726f8 bufio.(*Reader).Read(0x140006bc900, {0x14000420580, 0x9, 0x10079bfbc?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:237 +0x1e0 fp=0x14000063ca0 sp=0x14000063c60 pc=0x10083e7b0 io.ReadAtLeast({0x100efd3e0, 0x140006bc900}, {0x14000420580, 0x9, 0x9}, 0x9) /opt/homebrew/Cellar/go/1.20.6/libexec/src/io/io.go:332 +0xa0 fp=0x14000063cf0 sp=0x14000063ca0 pc=0x100827fa0 io.ReadFull(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/io/io.go:351 net/http.http2readFrameHeader({0x14000420580?, 0x9?, 0x14000063d98?}, {0x100efd3e0?, 0x140006bc900?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:1567 +0x58 fp=0x14000063d40 sp=0x14000063cf0 pc=0x1009d8548 net/http.(*http2Framer).ReadFrame(0x14000420540) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:1831 +0x84 fp=0x14000063df0 sp=0x14000063d40 pc=0x1009d8d44 net/http.(*http2clientConnReadLoop).run(0x14000063f88) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:9187 +0xfc fp=0x14000063f40 sp=0x14000063df0 pc=0x1009fa06c net/http.(*http2ClientConn).readLoop(0x14000175080) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:9082 +0x5c fp=0x14000063fb0 sp=0x14000063f40 pc=0x1009f952c net/http.(*http2Transport).newClientConn.func1() /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:7779 +0x28 fp=0x14000063fd0 sp=0x14000063fb0 pc=0x1009f26b8 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000063fd0 sp=0x14000063fd0 pc=0x1007fc324 created by net/http.(*http2Transport).newClientConn /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/h2_bundle.go:7779 +0xad0 goroutine 39 [IO wait]: runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x1400047ad40 sp=0x1400047ad20 pc=0x1007ccaa4 runtime.netpollblock(0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x1400047ad80 sp=0x1400047ad40 pc=0x1007c6138 internal/poll.runtime_pollWait(0x1289ad928, 0x72) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x1400047adb0 sp=0x1400047ad80 pc=0x1007f61b0 internal/poll.(*pollDesc).wait(0x1400044a600?, 0x1400046e161?, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400047ade0 sp=0x1400047adb0 pc=0x10087a7e8 internal/poll.(*pollDesc).waitRead(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0x1400044a600, {0x1400046e161, 0x1, 0x1}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x1400047ae80 sp=0x1400047ade0 pc=0x10087bb50 net.(*netFD).Read(0x1400044a600, {0x1400046e161?, 0x0?, 0x0?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x1400047aed0 sp=0x1400047ae80 pc=0x1008bc5d8 net.(*conn).Read(0x14000010d10, {0x1400046e161?, 0x0?, 0x0?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x1400047af20 sp=0x1400047aed0 pc=0x1008cabe4 net.(*TCPConn).Read(0x0?, {0x1400046e161?, 0x0?, 0x0?}) <autogenerated>:1 +0x2c fp=0x1400047af50 sp=0x1400047af20 pc=0x1008dd12c net/http.(*connReader).backgroundRead(0x1400046e150) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:674 +0x44 fp=0x1400047afb0 sp=0x1400047af50 pc=0x100a0b454 net/http.(*connReader).startBackgroundRead.func2() /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:670 +0x28 fp=0x1400047afd0 sp=0x1400047afb0 pc=0x100a0b378 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400047afd0 sp=0x1400047afd0 pc=0x1007fc324 created by net/http.(*connReader).startBackgroundRead /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/server.go:670 +0xcc goroutine 25 [IO wait]: runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x10080e540?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/proc.go:381 +0xe4 fp=0x14000062580 sp=0x14000062560 pc=0x1007ccaa4 runtime.netpollblock(0x0?, 0x0?, 0x0?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:527 +0x158 fp=0x140000625c0 sp=0x14000062580 pc=0x1007c6138 internal/poll.runtime_pollWait(0x1289ad748, 0x72) /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/netpoll.go:306 +0xa0 fp=0x140000625f0 sp=0x140000625c0 pc=0x1007f61b0 internal/poll.(*pollDesc).wait(0x14000480200?, 0x140002d0000?, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x14000062620 sp=0x140000625f0 pc=0x10087a7e8 internal/poll.(*pollDesc).waitRead(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0x14000480200, {0x140002d0000, 0xa000, 0xa000}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/internal/poll/fd_unix.go:167 +0x200 fp=0x140000626c0 sp=0x14000062620 pc=0x10087bb50 net.(*netFD).Read(0x14000480200, {0x140002d0000?, 0x14000062878?, 0x10096df7c?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/fd_posix.go:55 +0x28 fp=0x14000062710 sp=0x140000626c0 pc=0x1008bc5d8 net.(*conn).Read(0x140004ba000, {0x140002d0000?, 0x100ce6ad4?, 0x5?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/net.go:183 +0x34 fp=0x14000062760 sp=0x14000062710 pc=0x1008cabe4 net.(*TCPConn).Read(0x140000627d8?, {0x140002d0000?, 0x140006d00d8?, 0x18?}) <autogenerated>:1 +0x2c fp=0x14000062790 sp=0x14000062760 pc=0x1008dd12c crypto/tls.(*atLeastReader).Read(0x140006d00d8, {0x140002d0000?, 0x140006d00d8?, 0x0?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:788 +0x40 fp=0x140000627e0 sp=0x14000062790 pc=0x10096f760 bytes.(*Buffer).ReadFrom(0x14000452290, {0x100efd580, 0x140006d00d8}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/bytes/buffer.go:202 +0x90 fp=0x14000062840 sp=0x140000627e0 pc=0x100831860 crypto/tls.(*Conn).readFromUntil(0x14000452000, {0x128a27fc8?, 0x140004ba000}, 0x7fffffffffffffff?) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:810 +0xd4 fp=0x14000062880 sp=0x14000062840 pc=0x10096f954 crypto/tls.(*Conn).readRecordOrCCS(0x14000452000, 0x0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:617 +0xd8 fp=0x14000062bf0 sp=0x14000062880 pc=0x10096d7a8 crypto/tls.(*Conn).readRecord(...) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:583 crypto/tls.(*Conn).Read(0x14000452000, {0x140004df000, 0x1000, 0x140003e8180?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/crypto/tls/conn.go:1316 +0x178 fp=0x14000062c60 sp=0x14000062bf0 pc=0x1009726f8 net/http.(*persistConn).Read(0x14000128d80, {0x140004df000?, 0x10079b930?, 0x1400049e780?}) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1943 +0x50 fp=0x14000062cc0 sp=0x14000062c60 pc=0x100a27e60 bufio.(*Reader).fill(0x140004fc4e0) /opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:106 +0xfc fp=0x14000062d00 sp=0x14000062cc0 pc=0x10083e18c bufio.(*Reader).Peek(0x140004fc4e0, 0x1) /opt/homebrew/Cellar/go/1.20.6/libexec/src/bufio/bufio.go:144 +0x60 fp=0x14000062d20 sp=0x14000062d00 pc=0x10083e300 net/http.(*persistConn).readLoop(0x14000128d80) /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:2107 +0x144 fp=0x14000062fb0 sp=0x14000062d20 pc=0x100a28d14 net/http.(*Transport).dialConn.func5() /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1765 +0x28 fp=0x14000062fd0 sp=0x14000062fb0 pc=0x100a274b8 runtime.goexit() /opt/homebrew/Cellar/go/1.20.6/libexec/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000062fd0 sp=0x14000062fd0 pc=0x1007fc324 created by net/http.(*Transport).dialConn /opt/homebrew/Cellar/go/1.20.6/libexec/src/net/http/transport.go:1765 +0x11c8 r0 0x0 r1 0x0 r2 0x0 r3 0x0 r4 0x0 r5 0x171672c10 r6 0xa r7 0x0 r8 0x6b684de7b1e616cc r9 0x6b684de6c08ea6cc r10 0x2 r11 0xfffffffd r12 0x10000000000 r13 0x0 r14 0x0 r15 0x0 r16 0x148 r17 0x1f292cf20 r18 0x0 r19 0x6 r20 0x17168b000 r21 0x1a03 r22 0x17168b0e0 r23 0x8 r24 0x7 r25 0x8 r26 0x1ede07460 r27 0x100cd3094 r28 0x100df50c0 r29 0x171672bc0 lr 0x1929a3c28 sp 0x171672ba0 pc 0x19296c724 fault 0x19296c724 ```
GiteaMirror added the bug label 2026-04-27 22:58:41 -05:00
Author
Owner

@chsasank commented on GitHub (Jul 19, 2023):

Mostly because it's out of memory?

<!-- gh-comment-id:1641496359 --> @chsasank commented on GitHub (Jul 19, 2023): Mostly because it's out of memory?
Author
Owner

@mchiang0610 commented on GitHub (Jul 19, 2023):

Thanks @chsasank for submitting this. May I ask which model you were running? It does look like there isn't enough memory, and Ollama tried to allocate more memory

<!-- gh-comment-id:1641498941 --> @mchiang0610 commented on GitHub (Jul 19, 2023): Thanks @chsasank for submitting this. May I ask which model you were running? It does look like there isn't enough memory, and Ollama tried to allocate more memory
Author
Owner

@chsasank commented on GitHub (Jul 19, 2023):

Ran llama2

<!-- gh-comment-id:1641500163 --> @chsasank commented on GitHub (Jul 19, 2023): Ran llama2
Author
Owner

@chsasank commented on GitHub (Jul 19, 2023):

Orca worked fine though.

llama_model_load_internal: format     = ggjt v3 (latest)
llama_model_load_internal: n_vocab    = 32000
llama_model_load_internal: n_ctx      = 2048
llama_model_load_internal: n_embd     = 3200
llama_model_load_internal: n_mult     = 240
llama_model_load_internal: n_head     = 32
llama_model_load_internal: n_layer    = 26
llama_model_load_internal: n_rot      = 100
llama_model_load_internal: ftype      = 2 (mostly Q4_0)
llama_model_load_internal: n_ff       = 8640
llama_model_load_internal: model size = 3B
llama_model_load_internal: ggml ctx size =    0.06 MB
llama_model_load_internal: mem required  = 2862.72 MB (+  682.00 MB per state)
llama_new_context_with_model: kv self size  =  650.00 MB
ggml_metal_init: allocating
ggml_metal_init: using MPS
ggml_metal_init: loading '/Users/sasank/code/llama/ollama/ggml-metal.metal'
ggml_metal_init: loaded kernel_add                            0x157107260
ggml_metal_init: loaded kernel_mul                            0x157107900
ggml_metal_init: loaded kernel_mul_row                        0x157107f30
ggml_metal_init: loaded kernel_scale                          0x157108450
ggml_metal_init: loaded kernel_silu                           0x157108970
ggml_metal_init: loaded kernel_relu                           0x157108e90
ggml_metal_init: loaded kernel_gelu                           0x1571093b0
ggml_metal_init: loaded kernel_soft_max                       0x157109a60
ggml_metal_init: loaded kernel_diag_mask_inf                  0x15710a0c0
ggml_metal_init: loaded kernel_get_rows_f16                   0x15710a740
ggml_metal_init: loaded kernel_get_rows_q4_0                  0x15710adc0
ggml_metal_init: loaded kernel_get_rows_q4_1                  0x15710b5b0
ggml_metal_init: loaded kernel_get_rows_q2_K                  0x15710bc30
ggml_metal_init: loaded kernel_get_rows_q3_K                  0x15710c2b0
ggml_metal_init: loaded kernel_get_rows_q4_K                  0x15710c930
ggml_metal_init: loaded kernel_get_rows_q5_K                  0x15710cfb0
ggml_metal_init: loaded kernel_get_rows_q6_K                  0x155f04a60
ggml_metal_init: loaded kernel_rms_norm                       0x155f05310
ggml_metal_init: loaded kernel_norm                           0x155f059c0
ggml_metal_init: loaded kernel_mul_mat_f16_f32                0x155f064b0
ggml_metal_init: loaded kernel_mul_mat_q4_0_f32               0x15710d570
ggml_metal_init: loaded kernel_mul_mat_q4_1_f32               0x15710dc50
ggml_metal_init: loaded kernel_mul_mat_q2_K_f32               0x15710e330
ggml_metal_init: loaded kernel_mul_mat_q3_K_f32               0x15710ebb0
ggml_metal_init: loaded kernel_mul_mat_q4_K_f32               0x15710f290
ggml_metal_init: loaded kernel_mul_mat_q5_K_f32               0x15710f970
ggml_metal_init: loaded kernel_mul_mat_q6_K_f32               0x157110050
ggml_metal_init: loaded kernel_rope                           0x157110b40
ggml_metal_init: loaded kernel_alibi_f32                      0x157111400
ggml_metal_init: loaded kernel_cpy_f32_f16                    0x157111c90
ggml_metal_init: loaded kernel_cpy_f32_f32                    0x157112520
ggml_metal_init: loaded kernel_cpy_f16_f16                    0x157112db0
ggml_metal_init: recommendedMaxWorkingSetSize =  5461.34 MB
ggml_metal_init: hasUnifiedMemory             = true
ggml_metal_init: maxTransferRate              = built-in GPU
llama_new_context_with_model: max tensor size =    54.93 MB
ggml_metal_add_buffer: allocated 'data            ' buffer, size =  1839.12 MB, ( 1839.52 /  5461.34)
ggml_metal_add_buffer: allocated 'eval            ' buffer, size =   512.00 MB, ( 2351.52 /  5461.34)
ggml_metal_add_buffer: allocated 'kv              ' buffer, size =   652.00 MB, ( 3003.52 /  5461.34)
ggml_metal_add_buffer: allocated 'scr0            ' buffer, size =   256.00 MB, ( 3259.52 /  5461.34)
ggml_metal_add_buffer: allocated 'scr1            ' buffer, size =   256.00 MB, ( 3515.52 /  5461.34)

llama_print_timings:        load time =  5199.96 ms
llama_print_timings:      sample time =     6.94 ms /    31 runs   (    0.22 ms per token,  4465.57 tokens per second)
llama_print_timings: prompt eval time =  1579.53 ms /    39 tokens (   40.50 ms per token,    24.69 tokens per second)
llama_print_timings:        eval time =  1119.88 ms /    30 runs   (   37.33 ms per token,    26.79 tokens per second)
llama_print_timings:       total time =  2748.18 ms
ggml_metal_free: deallocating
[GIN] 2023/07/19 - 12:09:46 | 200 |  7.978695084s |       127.0.0.1 | POST     "/api/generate"
<!-- gh-comment-id:1641504324 --> @chsasank commented on GitHub (Jul 19, 2023): Orca worked fine though. ```llama.cpp: loading model from /Users/sasank/.ollama/models/blobs/sha256:e84705205f71dd55be7b24a778f248f0eda9999a125d313358c087e092d83148 llama_model_load_internal: format = ggjt v3 (latest) llama_model_load_internal: n_vocab = 32000 llama_model_load_internal: n_ctx = 2048 llama_model_load_internal: n_embd = 3200 llama_model_load_internal: n_mult = 240 llama_model_load_internal: n_head = 32 llama_model_load_internal: n_layer = 26 llama_model_load_internal: n_rot = 100 llama_model_load_internal: ftype = 2 (mostly Q4_0) llama_model_load_internal: n_ff = 8640 llama_model_load_internal: model size = 3B llama_model_load_internal: ggml ctx size = 0.06 MB llama_model_load_internal: mem required = 2862.72 MB (+ 682.00 MB per state) llama_new_context_with_model: kv self size = 650.00 MB ggml_metal_init: allocating ggml_metal_init: using MPS ggml_metal_init: loading '/Users/sasank/code/llama/ollama/ggml-metal.metal' ggml_metal_init: loaded kernel_add 0x157107260 ggml_metal_init: loaded kernel_mul 0x157107900 ggml_metal_init: loaded kernel_mul_row 0x157107f30 ggml_metal_init: loaded kernel_scale 0x157108450 ggml_metal_init: loaded kernel_silu 0x157108970 ggml_metal_init: loaded kernel_relu 0x157108e90 ggml_metal_init: loaded kernel_gelu 0x1571093b0 ggml_metal_init: loaded kernel_soft_max 0x157109a60 ggml_metal_init: loaded kernel_diag_mask_inf 0x15710a0c0 ggml_metal_init: loaded kernel_get_rows_f16 0x15710a740 ggml_metal_init: loaded kernel_get_rows_q4_0 0x15710adc0 ggml_metal_init: loaded kernel_get_rows_q4_1 0x15710b5b0 ggml_metal_init: loaded kernel_get_rows_q2_K 0x15710bc30 ggml_metal_init: loaded kernel_get_rows_q3_K 0x15710c2b0 ggml_metal_init: loaded kernel_get_rows_q4_K 0x15710c930 ggml_metal_init: loaded kernel_get_rows_q5_K 0x15710cfb0 ggml_metal_init: loaded kernel_get_rows_q6_K 0x155f04a60 ggml_metal_init: loaded kernel_rms_norm 0x155f05310 ggml_metal_init: loaded kernel_norm 0x155f059c0 ggml_metal_init: loaded kernel_mul_mat_f16_f32 0x155f064b0 ggml_metal_init: loaded kernel_mul_mat_q4_0_f32 0x15710d570 ggml_metal_init: loaded kernel_mul_mat_q4_1_f32 0x15710dc50 ggml_metal_init: loaded kernel_mul_mat_q2_K_f32 0x15710e330 ggml_metal_init: loaded kernel_mul_mat_q3_K_f32 0x15710ebb0 ggml_metal_init: loaded kernel_mul_mat_q4_K_f32 0x15710f290 ggml_metal_init: loaded kernel_mul_mat_q5_K_f32 0x15710f970 ggml_metal_init: loaded kernel_mul_mat_q6_K_f32 0x157110050 ggml_metal_init: loaded kernel_rope 0x157110b40 ggml_metal_init: loaded kernel_alibi_f32 0x157111400 ggml_metal_init: loaded kernel_cpy_f32_f16 0x157111c90 ggml_metal_init: loaded kernel_cpy_f32_f32 0x157112520 ggml_metal_init: loaded kernel_cpy_f16_f16 0x157112db0 ggml_metal_init: recommendedMaxWorkingSetSize = 5461.34 MB ggml_metal_init: hasUnifiedMemory = true ggml_metal_init: maxTransferRate = built-in GPU llama_new_context_with_model: max tensor size = 54.93 MB ggml_metal_add_buffer: allocated 'data ' buffer, size = 1839.12 MB, ( 1839.52 / 5461.34) ggml_metal_add_buffer: allocated 'eval ' buffer, size = 512.00 MB, ( 2351.52 / 5461.34) ggml_metal_add_buffer: allocated 'kv ' buffer, size = 652.00 MB, ( 3003.52 / 5461.34) ggml_metal_add_buffer: allocated 'scr0 ' buffer, size = 256.00 MB, ( 3259.52 / 5461.34) ggml_metal_add_buffer: allocated 'scr1 ' buffer, size = 256.00 MB, ( 3515.52 / 5461.34) llama_print_timings: load time = 5199.96 ms llama_print_timings: sample time = 6.94 ms / 31 runs ( 0.22 ms per token, 4465.57 tokens per second) llama_print_timings: prompt eval time = 1579.53 ms / 39 tokens ( 40.50 ms per token, 24.69 tokens per second) llama_print_timings: eval time = 1119.88 ms / 30 runs ( 37.33 ms per token, 26.79 tokens per second) llama_print_timings: total time = 2748.18 ms ggml_metal_free: deallocating [GIN] 2023/07/19 - 12:09:46 | 200 | 7.978695084s | 127.0.0.1 | POST "/api/generate" ```
Author
Owner

@vilaB commented on GitHub (Jul 23, 2023):

Same happening to me at M2 Air 8GB. Probably needs more ram.

./ollama serve
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:   export GIN_MODE=release
 - using code:  gin.SetMode(gin.ReleaseMode)

[GIN-debug] GET    /                         --> github.com/jmorganca/ollama/server.Serve.func1 (4 handlers)
[GIN-debug] POST   /api/pull                 --> github.com/jmorganca/ollama/server.PullModelHandler (4 handlers)
[GIN-debug] POST   /api/generate             --> github.com/jmorganca/ollama/server.GenerateHandler (4 handlers)
[GIN-debug] POST   /api/create               --> github.com/jmorganca/ollama/server.CreateModelHandler (4 handlers)
[GIN-debug] POST   /api/push                 --> github.com/jmorganca/ollama/server.PushModelHandler (4 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/jmorganca/ollama/server.ListModelsHandler (4 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/jmorganca/ollama/server.DeleteModelHandler (4 handlers)
2023/07/23 13:59:25 routes.go:260: Listening on 127.0.0.1:11434
llama.cpp: loading model from /Users/gabriel/.ollama/models/blobs/sha256:8daa9615cce30c259a9555b1cc250d461d1bc69980a274b44d7eda0be78076d8
llama_model_load_internal: format     = ggjt v3 (latest)
llama_model_load_internal: n_vocab    = 32000
llama_model_load_internal: n_ctx      = 2048
llama_model_load_internal: n_embd     = 4096
llama_model_load_internal: n_mult     = 256
llama_model_load_internal: n_head     = 32
llama_model_load_internal: n_layer    = 32
llama_model_load_internal: n_rot      = 128
llama_model_load_internal: freq_base  = 10000.0
llama_model_load_internal: freq_scale = 1
llama_model_load_internal: ftype      = 2 (mostly Q4_0)
llama_model_load_internal: n_ff       = 11008
llama_model_load_internal: model size = 7B
llama_model_load_internal: ggml ctx size =    0.08 MB
llama_model_load_internal: mem required  = 5287.72 MB (+ 1026.00 MB per state)
llama_new_context_with_model: kv self size  = 1024.00 MB
ggml_metal_init: allocating
ggml_metal_init: using MPS
ggml_metal_init: loading '/Users/gabriel/Documents/Llama2/ollama/ggml-metal.metal'
ggml_metal_init: loaded kernel_add                            0x13c610890
ggml_metal_init: loaded kernel_mul                            0x13c611de0
ggml_metal_init: loaded kernel_mul_row                        0x13c613300
ggml_metal_init: loaded kernel_scale                          0x13c613600
ggml_metal_init: loaded kernel_silu                           0x13c613e00
ggml_metal_init: loaded kernel_relu                           0x13c612380
ggml_metal_init: loaded kernel_gelu                           0x13c6147d0
ggml_metal_init: loaded kernel_soft_max                       0x13c615750
ggml_metal_init: loaded kernel_diag_mask_inf                  0x13c616ab0
ggml_metal_init: loaded kernel_get_rows_f16                   0x13c616d10
ggml_metal_init: loaded kernel_get_rows_q4_0                  0x13c615fc0
ggml_metal_init: loaded kernel_get_rows_q4_1                  0x13c6176d0
ggml_metal_init: loaded kernel_get_rows_q2_K                  0x13c618a40
ggml_metal_init: loaded kernel_get_rows_q3_K                  0x13c617f20
ggml_metal_init: loaded kernel_get_rows_q4_K                  0x13c619180
ggml_metal_init: loaded kernel_get_rows_q5_K                  0x13c619ac0
ggml_metal_init: loaded kernel_get_rows_q6_K                  0x13c61a400
ggml_metal_init: loaded kernel_rms_norm                       0x13c61aee0
ggml_metal_init: loaded kernel_norm                           0x13c61b9b0
ggml_metal_init: loaded kernel_mul_mat_f16_f32                0x13c61cc80
ggml_metal_init: loaded kernel_mul_mat_q4_0_f32               0x13c61d680
ggml_metal_init: loaded kernel_mul_mat_q4_1_f32               0x13c61e080
ggml_metal_init: loaded kernel_mul_mat_q2_K_f32               0x13c61ea90
ggml_metal_init: loaded kernel_mul_mat_q3_K_f32               0x13c61f4b0
ggml_metal_init: loaded kernel_mul_mat_q4_K_f32               0x13c61ffd0
ggml_metal_init: loaded kernel_mul_mat_q5_K_f32               0x13c620ab0
ggml_metal_init: loaded kernel_mul_mat_q6_K_f32               0x13c6214a0
ggml_metal_init: loaded kernel_rope                           0x13c621c90
ggml_metal_init: loaded kernel_alibi_f32                      0x13c6229c0
ggml_metal_init: loaded kernel_cpy_f32_f16                    0x13c623820
ggml_metal_init: loaded kernel_cpy_f32_f32                    0x13c6243a0
ggml_metal_init: loaded kernel_cpy_f16_f16                    0x13c624f00
ggml_metal_init: recommendedMaxWorkingSetSize =  5461.34 MB
ggml_metal_init: hasUnifiedMemory             = true
ggml_metal_init: maxTransferRate              = built-in GPU
llama_new_context_with_model: max tensor size =    70.31 MB
ggml_metal_add_buffer: allocated 'data            ' buffer, size =  3616.08 MB, ( 3616.53 /  5461.34)
ggml_metal_add_buffer: allocated 'eval            ' buffer, size =   776.00 MB, ( 4392.53 /  5461.34)
ggml_metal_add_buffer: allocated 'kv              ' buffer, size =  1026.00 MB, ( 5418.53 /  5461.34)
ggml_metal_add_buffer: allocated 'scr0            ' buffer, size =   384.00 MB, ( 5802.53 /  5461.34), warning: current allocated size is greater than the recommended max working set size
ggml_metal_add_buffer: allocated 'scr1            ' buffer, size =   512.00 MB, ( 6314.53 /  5461.34), warning: current allocated size is greater than the recommended max working set size
ggml_metal_graph_compute: command buffer 0 failed with status 5
GGML_ASSERT: ggml-metal.m:1023: false
SIGABRT: abort
PC=0x198a84724 m=3 sigcode=0
signal arrived during cgo execution

goroutine 19 [syscall]:
runtime.cgocall(0x10286948c, 0x14000123278)
        /usr/local/go/src/runtime/cgocall.go:157 +0x54 fp=0x14000123240 sp=0x14000123200 pc=0x1023556a4
github.com/jmorganca/ollama/llama._Cfunc_llama_eval(0x13d01b000, 0x14000409ef8, 0x1, 0x0, 0x8)
        _cgo_gotypes.go:210 +0x38 fp=0x14000123270 sp=0x14000123240 pc=0x102856388
github.com/jmorganca/ollama/llama.New.func4(0x102a92f00?, {0x14000409ef8, 0x1, 0x0?}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, ...})
        /Users/gabriel/Documents/Llama2/ollama/llama/llama.go:141 +0x7c fp=0x140001232c0 sp=0x14000123270 pc=0x1028571ac
github.com/jmorganca/ollama/llama.New({0x14000224e00, 0x6b}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, 0x0, 0x1, ...})
        /Users/gabriel/Documents/Llama2/ollama/llama/llama.go:141 +0x288 fp=0x14000123480 sp=0x140001232c0 pc=0x102856f68
github.com/jmorganca/ollama/server.GenerateHandler(0x14000432500)
        /Users/gabriel/Documents/Llama2/ollama/server/routes.go:56 +0x5c0 fp=0x140001236e0 sp=0x14000123480 pc=0x102862c40
github.com/gin-gonic/gin.(*Context).Next(...)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0x14000432500)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7c fp=0x14000123730 sp=0x140001236e0 pc=0x10284b40c
github.com/gin-gonic/gin.(*Context).Next(...)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.LoggerWithConfig.func1(0x14000432500)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xac fp=0x140001238e0 sp=0x14000123730 pc=0x10284a68c
github.com/gin-gonic/gin.(*Context).Next(...)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174
github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0x140003e4d00, 0x14000432500)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x54c fp=0x14000123a70 sp=0x140001238e0 pc=0x10284979c
github.com/gin-gonic/gin.(*Engine).ServeHTTP(0x140003e4d00, {0x102ae52a0?, 0x140003ed420}, 0x14000432400)
        /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1d4 fp=0x14000123ab0 sp=0x14000123a70 pc=0x1028490a4
net/http.serverHandler.ServeHTTP({0x102ae3230?}, {0x102ae52a0, 0x140003ed420}, 0x14000432400)
        /usr/local/go/src/net/http/server.go:2936 +0x2d8 fp=0x14000123b60 sp=0x14000123ab0 pc=0x1025d2dd8
net/http.(*conn).serve(0x140001387e0, {0x102ae5918, 0x14000434240})
        /usr/local/go/src/net/http/server.go:1995 +0x560 fp=0x14000123fa0 sp=0x14000123b60 pc=0x1025cead0
net/http.(*Server).Serve.func3()
        /usr/local/go/src/net/http/server.go:3089 +0x30 fp=0x14000123fd0 sp=0x14000123fa0 pc=0x1025d3600
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000123fd0 sp=0x14000123fd0 pc=0x1023b8b24
created by net/http.(*Server).Serve
        /usr/local/go/src/net/http/server.go:3089 +0x520

goroutine 1 [IO wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400031f700 sp=0x1400031f6e0 pc=0x102388924
runtime.netpollblock(0x1400031f798?, 0x243c754?, 0x1?)
        /usr/local/go/src/runtime/netpoll.go:527 +0x158 fp=0x1400031f740 sp=0x1400031f700 pc=0x102381e48
internal/poll.runtime_pollWait(0x12a568b18, 0x72)
        /usr/local/go/src/runtime/netpoll.go:306 +0xa0 fp=0x1400031f770 sp=0x1400031f740 pc=0x1023b26f0
internal/poll.(*pollDesc).wait(0x14000412600?, 0x0?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400031f7a0 sp=0x1400031f770 pc=0x102437d98
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Accept(0x14000412600)
        /usr/local/go/src/internal/poll/fd_unix.go:614 +0x250 fp=0x1400031f850 sp=0x1400031f7a0 pc=0x10243c840
net.(*netFD).accept(0x14000412600)
        /usr/local/go/src/net/fd_unix.go:172 +0x28 fp=0x1400031f910 sp=0x1400031f850 pc=0x10247bda8
net.(*TCPListener).accept(0x140000c6d38)
        /usr/local/go/src/net/tcpsock_posix.go:148 +0x28 fp=0x1400031f940 sp=0x1400031f910 pc=0x1024913a8
net.(*TCPListener).Accept(0x140000c6d38)
        /usr/local/go/src/net/tcpsock.go:297 +0x2c fp=0x1400031f980 sp=0x1400031f940 pc=0x10249051c
net/http.(*onceCloseListener).Accept(0x140001387e0?)
        <autogenerated>:1 +0x30 fp=0x1400031f9a0 sp=0x1400031f980 pc=0x1025f6d80
net/http.(*Server).Serve(0x14000338ff0, {0x102ae5090, 0x140000c6d38})
        /usr/local/go/src/net/http/server.go:3059 +0x304 fp=0x1400031fad0 sp=0x1400031f9a0 pc=0x1025d32a4
github.com/jmorganca/ollama/server.Serve({0x102ae5090, 0x140000c6d38})
        /Users/gabriel/Documents/Llama2/ollama/server/routes.go:265 +0x4e0 fp=0x1400031fca0 sp=0x1400031fad0 pc=0x102864e40
github.com/jmorganca/ollama/cmd.RunServer(0x140003c7200?, {0x1028ba248?, 0x0?, 0x0?})
        /Users/gabriel/Documents/Llama2/ollama/cmd/cmd.go:406 +0x114 fp=0x1400031fd20 sp=0x1400031fca0 pc=0x1028685f4
github.com/spf13/cobra.(*Command).execute(0x140003c7200, {0x102f5e450, 0x0, 0x0})
        /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x5c8 fp=0x1400031fe60 sp=0x1400031fd20 pc=0x102679528
github.com/spf13/cobra.(*Command).ExecuteC(0x140003c6900)
        /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x35c fp=0x1400031ff20 sp=0x1400031fe60 pc=0x102679c7c
github.com/spf13/cobra.(*Command).Execute(...)
        /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992
github.com/spf13/cobra.(*Command).ExecuteContext(0x1400005c768?, {0x102ae58a8?, 0x140000b8010?})
        /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 +0x50 fp=0x1400031ff40 sp=0x1400031ff20 pc=0x102679810
main.main()
        /Users/gabriel/Documents/Llama2/ollama/main.go:10 +0x34 fp=0x1400031ff70 sp=0x1400031ff40 pc=0x102869254
runtime.main()
        /usr/local/go/src/runtime/proc.go:250 +0x248 fp=0x1400031ffd0 sp=0x1400031ff70 pc=0x1023884f8
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400031ffd0 sp=0x1400031ffd0 pc=0x1023b8b24

goroutine 2 [force gc (idle)]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005cfa0 sp=0x1400005cf80 pc=0x102388924
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:387
runtime.forcegchelper()
        /usr/local/go/src/runtime/proc.go:305 +0xb8 fp=0x1400005cfd0 sp=0x1400005cfa0 pc=0x102388768
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005cfd0 sp=0x1400005cfd0 pc=0x1023b8b24
created by runtime.init.6
        /usr/local/go/src/runtime/proc.go:293 +0x24

goroutine 3 [GC sweep wait]:
runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005d760 sp=0x1400005d740 pc=0x102388924
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:387
runtime.bgsweep(0x0?)
        /usr/local/go/src/runtime/mgcsweep.go:278 +0xa4 fp=0x1400005d7b0 sp=0x1400005d760 pc=0x102375604
runtime.gcenable.func1()
        /usr/local/go/src/runtime/mgc.go:178 +0x28 fp=0x1400005d7d0 sp=0x1400005d7b0 pc=0x10236a118
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005d7d0 sp=0x1400005d7d0 pc=0x1023b8b24
created by runtime.gcenable
        /usr/local/go/src/runtime/mgc.go:178 +0x74

goroutine 4 [GC scavenge wait]:
runtime.gopark(0x14000038070?, 0x1029b8678?, 0x1?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005df50 sp=0x1400005df30 pc=0x102388924
runtime.goparkunlock(...)
        /usr/local/go/src/runtime/proc.go:387
runtime.(*scavengerState).park(0x102ea2f20)
        /usr/local/go/src/runtime/mgcscavenge.go:400 +0x5c fp=0x1400005df80 sp=0x1400005df50 pc=0x10237347c
runtime.bgscavenge(0x0?)
        /usr/local/go/src/runtime/mgcscavenge.go:628 +0x44 fp=0x1400005dfb0 sp=0x1400005df80 pc=0x1023739f4
runtime.gcenable.func2()
        /usr/local/go/src/runtime/mgc.go:179 +0x28 fp=0x1400005dfd0 sp=0x1400005dfb0 pc=0x10236a0b8
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005dfd0 sp=0x1400005dfd0 pc=0x1023b8b24
created by runtime.gcenable
        /usr/local/go/src/runtime/mgc.go:179 +0xb8

goroutine 18 [finalizer wait]:
runtime.gopark(0x1a0?, 0x102ea3960?, 0x80?, 0x26?, 0x0?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005c580 sp=0x1400005c560 pc=0x102388924
runtime.runfinq()
        /usr/local/go/src/runtime/mfinal.go:193 +0x10c fp=0x1400005c7d0 sp=0x1400005c580 pc=0x1023691ac
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005c7d0 sp=0x1400005c7d0 pc=0x1023b8b24
created by runtime.createfing
        /usr/local/go/src/runtime/mfinal.go:163 +0x84

goroutine 20 [IO wait]:
runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x1023cb340?)
        /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x14000058540 sp=0x14000058520 pc=0x102388924
runtime.netpollblock(0x0?, 0x0?, 0x0?)
        /usr/local/go/src/runtime/netpoll.go:527 +0x158 fp=0x14000058580 sp=0x14000058540 pc=0x102381e48
internal/poll.runtime_pollWait(0x12a568a28, 0x72)
        /usr/local/go/src/runtime/netpoll.go:306 +0xa0 fp=0x140000585b0 sp=0x14000058580 pc=0x1023b26f0
internal/poll.(*pollDesc).wait(0x14000412800?, 0x14000434341?, 0x0)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x140000585e0 sp=0x140000585b0 pc=0x102437d98
internal/poll.(*pollDesc).waitRead(...)
        /usr/local/go/src/internal/poll/fd_poll_runtime.go:89
internal/poll.(*FD).Read(0x14000412800, {0x14000434341, 0x1, 0x1})
        /usr/local/go/src/internal/poll/fd_unix.go:167 +0x200 fp=0x14000058680 sp=0x140000585e0 pc=0x102439100
net.(*netFD).Read(0x14000412800, {0x14000434341?, 0x0?, 0x0?})
        /usr/local/go/src/net/fd_posix.go:55 +0x28 fp=0x140000586d0 sp=0x14000058680 pc=0x10247a108
net.(*conn).Read(0x140000c8d08, {0x14000434341?, 0x0?, 0x0?})
        /usr/local/go/src/net/net.go:183 +0x34 fp=0x14000058720 sp=0x140000586d0 pc=0x102488714
net.(*TCPConn).Read(0x0?, {0x14000434341?, 0x0?, 0x0?})
        <autogenerated>:1 +0x2c fp=0x14000058750 sp=0x14000058720 pc=0x10249ac5c
net/http.(*connReader).backgroundRead(0x14000434330)
        /usr/local/go/src/net/http/server.go:674 +0x44 fp=0x140000587b0 sp=0x14000058750 pc=0x1025c8f84
net/http.(*connReader).startBackgroundRead.func2()
        /usr/local/go/src/net/http/server.go:670 +0x28 fp=0x140000587d0 sp=0x140000587b0 pc=0x1025c8ea8
runtime.goexit()
        /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000587d0 sp=0x140000587d0 pc=0x1023b8b24
created by net/http.(*connReader).startBackgroundRead
        /usr/local/go/src/net/http/server.go:670 +0xcc

r0      0x0
r1      0x0
r2      0x0
r3      0x0
r4      0x0
r5      0x16eaa2c00
r6      0xa
r7      0x0
r8      0x58a7f816d970080f
r9      0x58a7f817b7dbb80f
r10     0x2
r11     0xfffffffd
r12     0x10000000000
r13     0x0
r14     0x0
r15     0x0
r16     0x148
r17     0x1f85b8f60
r18     0x0
r19     0x6
r20     0x16eabb000
r21     0x1903
r22     0x16eabb0e0
r23     0x8
r24     0x7
r25     0x8
r26     0x1f3a97720
r27     0x1028aafc0
r28     0x1029d10f0
r29     0x16eaa2bb0
lr      0x198abbc28
sp      0x16eaa2b90
pc      0x198a84724
fault   0x198a84724

I don´t think this is something that can get fixed. I built the exeutable using the readme instructions.

Maybe a Warning message could be good, since someone arriving at the repo and trying this model at fisrt could lost a lot of time trying to see what is the problem.

<!-- gh-comment-id:1646822832 --> @vilaB commented on GitHub (Jul 23, 2023): Same happening to me at M2 Air 8GB. Probably needs more ram. ``` ./ollama serve [GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached. [GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production. - using env: export GIN_MODE=release - using code: gin.SetMode(gin.ReleaseMode) [GIN-debug] GET / --> github.com/jmorganca/ollama/server.Serve.func1 (4 handlers) [GIN-debug] POST /api/pull --> github.com/jmorganca/ollama/server.PullModelHandler (4 handlers) [GIN-debug] POST /api/generate --> github.com/jmorganca/ollama/server.GenerateHandler (4 handlers) [GIN-debug] POST /api/create --> github.com/jmorganca/ollama/server.CreateModelHandler (4 handlers) [GIN-debug] POST /api/push --> github.com/jmorganca/ollama/server.PushModelHandler (4 handlers) [GIN-debug] GET /api/tags --> github.com/jmorganca/ollama/server.ListModelsHandler (4 handlers) [GIN-debug] DELETE /api/delete --> github.com/jmorganca/ollama/server.DeleteModelHandler (4 handlers) 2023/07/23 13:59:25 routes.go:260: Listening on 127.0.0.1:11434 llama.cpp: loading model from /Users/gabriel/.ollama/models/blobs/sha256:8daa9615cce30c259a9555b1cc250d461d1bc69980a274b44d7eda0be78076d8 llama_model_load_internal: format = ggjt v3 (latest) llama_model_load_internal: n_vocab = 32000 llama_model_load_internal: n_ctx = 2048 llama_model_load_internal: n_embd = 4096 llama_model_load_internal: n_mult = 256 llama_model_load_internal: n_head = 32 llama_model_load_internal: n_layer = 32 llama_model_load_internal: n_rot = 128 llama_model_load_internal: freq_base = 10000.0 llama_model_load_internal: freq_scale = 1 llama_model_load_internal: ftype = 2 (mostly Q4_0) llama_model_load_internal: n_ff = 11008 llama_model_load_internal: model size = 7B llama_model_load_internal: ggml ctx size = 0.08 MB llama_model_load_internal: mem required = 5287.72 MB (+ 1026.00 MB per state) llama_new_context_with_model: kv self size = 1024.00 MB ggml_metal_init: allocating ggml_metal_init: using MPS ggml_metal_init: loading '/Users/gabriel/Documents/Llama2/ollama/ggml-metal.metal' ggml_metal_init: loaded kernel_add 0x13c610890 ggml_metal_init: loaded kernel_mul 0x13c611de0 ggml_metal_init: loaded kernel_mul_row 0x13c613300 ggml_metal_init: loaded kernel_scale 0x13c613600 ggml_metal_init: loaded kernel_silu 0x13c613e00 ggml_metal_init: loaded kernel_relu 0x13c612380 ggml_metal_init: loaded kernel_gelu 0x13c6147d0 ggml_metal_init: loaded kernel_soft_max 0x13c615750 ggml_metal_init: loaded kernel_diag_mask_inf 0x13c616ab0 ggml_metal_init: loaded kernel_get_rows_f16 0x13c616d10 ggml_metal_init: loaded kernel_get_rows_q4_0 0x13c615fc0 ggml_metal_init: loaded kernel_get_rows_q4_1 0x13c6176d0 ggml_metal_init: loaded kernel_get_rows_q2_K 0x13c618a40 ggml_metal_init: loaded kernel_get_rows_q3_K 0x13c617f20 ggml_metal_init: loaded kernel_get_rows_q4_K 0x13c619180 ggml_metal_init: loaded kernel_get_rows_q5_K 0x13c619ac0 ggml_metal_init: loaded kernel_get_rows_q6_K 0x13c61a400 ggml_metal_init: loaded kernel_rms_norm 0x13c61aee0 ggml_metal_init: loaded kernel_norm 0x13c61b9b0 ggml_metal_init: loaded kernel_mul_mat_f16_f32 0x13c61cc80 ggml_metal_init: loaded kernel_mul_mat_q4_0_f32 0x13c61d680 ggml_metal_init: loaded kernel_mul_mat_q4_1_f32 0x13c61e080 ggml_metal_init: loaded kernel_mul_mat_q2_K_f32 0x13c61ea90 ggml_metal_init: loaded kernel_mul_mat_q3_K_f32 0x13c61f4b0 ggml_metal_init: loaded kernel_mul_mat_q4_K_f32 0x13c61ffd0 ggml_metal_init: loaded kernel_mul_mat_q5_K_f32 0x13c620ab0 ggml_metal_init: loaded kernel_mul_mat_q6_K_f32 0x13c6214a0 ggml_metal_init: loaded kernel_rope 0x13c621c90 ggml_metal_init: loaded kernel_alibi_f32 0x13c6229c0 ggml_metal_init: loaded kernel_cpy_f32_f16 0x13c623820 ggml_metal_init: loaded kernel_cpy_f32_f32 0x13c6243a0 ggml_metal_init: loaded kernel_cpy_f16_f16 0x13c624f00 ggml_metal_init: recommendedMaxWorkingSetSize = 5461.34 MB ggml_metal_init: hasUnifiedMemory = true ggml_metal_init: maxTransferRate = built-in GPU llama_new_context_with_model: max tensor size = 70.31 MB ggml_metal_add_buffer: allocated 'data ' buffer, size = 3616.08 MB, ( 3616.53 / 5461.34) ggml_metal_add_buffer: allocated 'eval ' buffer, size = 776.00 MB, ( 4392.53 / 5461.34) ggml_metal_add_buffer: allocated 'kv ' buffer, size = 1026.00 MB, ( 5418.53 / 5461.34) ggml_metal_add_buffer: allocated 'scr0 ' buffer, size = 384.00 MB, ( 5802.53 / 5461.34), warning: current allocated size is greater than the recommended max working set size ggml_metal_add_buffer: allocated 'scr1 ' buffer, size = 512.00 MB, ( 6314.53 / 5461.34), warning: current allocated size is greater than the recommended max working set size ggml_metal_graph_compute: command buffer 0 failed with status 5 GGML_ASSERT: ggml-metal.m:1023: false SIGABRT: abort PC=0x198a84724 m=3 sigcode=0 signal arrived during cgo execution goroutine 19 [syscall]: runtime.cgocall(0x10286948c, 0x14000123278) /usr/local/go/src/runtime/cgocall.go:157 +0x54 fp=0x14000123240 sp=0x14000123200 pc=0x1023556a4 github.com/jmorganca/ollama/llama._Cfunc_llama_eval(0x13d01b000, 0x14000409ef8, 0x1, 0x0, 0x8) _cgo_gotypes.go:210 +0x38 fp=0x14000123270 sp=0x14000123240 pc=0x102856388 github.com/jmorganca/ollama/llama.New.func4(0x102a92f00?, {0x14000409ef8, 0x1, 0x0?}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, ...}) /Users/gabriel/Documents/Llama2/ollama/llama/llama.go:141 +0x7c fp=0x140001232c0 sp=0x14000123270 pc=0x1028571ac github.com/jmorganca/ollama/llama.New({0x14000224e00, 0x6b}, {0xffffffffffffffff, 0x0, 0x800, 0x200, 0x1, 0x0, 0x0, 0x1, ...}) /Users/gabriel/Documents/Llama2/ollama/llama/llama.go:141 +0x288 fp=0x14000123480 sp=0x140001232c0 pc=0x102856f68 github.com/jmorganca/ollama/server.GenerateHandler(0x14000432500) /Users/gabriel/Documents/Llama2/ollama/server/routes.go:56 +0x5c0 fp=0x140001236e0 sp=0x14000123480 pc=0x102862c40 github.com/gin-gonic/gin.(*Context).Next(...) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.CustomRecoveryWithWriter.func1(0x14000432500) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/recovery.go:102 +0x7c fp=0x14000123730 sp=0x140001236e0 pc=0x10284b40c github.com/gin-gonic/gin.(*Context).Next(...) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.LoggerWithConfig.func1(0x14000432500) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/logger.go:240 +0xac fp=0x140001238e0 sp=0x14000123730 pc=0x10284a68c github.com/gin-gonic/gin.(*Context).Next(...) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/context.go:174 github.com/gin-gonic/gin.(*Engine).handleHTTPRequest(0x140003e4d00, 0x14000432500) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:620 +0x54c fp=0x14000123a70 sp=0x140001238e0 pc=0x10284979c github.com/gin-gonic/gin.(*Engine).ServeHTTP(0x140003e4d00, {0x102ae52a0?, 0x140003ed420}, 0x14000432400) /Users/gabriel/go/pkg/mod/github.com/gin-gonic/gin@v1.9.1/gin.go:576 +0x1d4 fp=0x14000123ab0 sp=0x14000123a70 pc=0x1028490a4 net/http.serverHandler.ServeHTTP({0x102ae3230?}, {0x102ae52a0, 0x140003ed420}, 0x14000432400) /usr/local/go/src/net/http/server.go:2936 +0x2d8 fp=0x14000123b60 sp=0x14000123ab0 pc=0x1025d2dd8 net/http.(*conn).serve(0x140001387e0, {0x102ae5918, 0x14000434240}) /usr/local/go/src/net/http/server.go:1995 +0x560 fp=0x14000123fa0 sp=0x14000123b60 pc=0x1025cead0 net/http.(*Server).Serve.func3() /usr/local/go/src/net/http/server.go:3089 +0x30 fp=0x14000123fd0 sp=0x14000123fa0 pc=0x1025d3600 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x14000123fd0 sp=0x14000123fd0 pc=0x1023b8b24 created by net/http.(*Server).Serve /usr/local/go/src/net/http/server.go:3089 +0x520 goroutine 1 [IO wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400031f700 sp=0x1400031f6e0 pc=0x102388924 runtime.netpollblock(0x1400031f798?, 0x243c754?, 0x1?) /usr/local/go/src/runtime/netpoll.go:527 +0x158 fp=0x1400031f740 sp=0x1400031f700 pc=0x102381e48 internal/poll.runtime_pollWait(0x12a568b18, 0x72) /usr/local/go/src/runtime/netpoll.go:306 +0xa0 fp=0x1400031f770 sp=0x1400031f740 pc=0x1023b26f0 internal/poll.(*pollDesc).wait(0x14000412600?, 0x0?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x1400031f7a0 sp=0x1400031f770 pc=0x102437d98 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Accept(0x14000412600) /usr/local/go/src/internal/poll/fd_unix.go:614 +0x250 fp=0x1400031f850 sp=0x1400031f7a0 pc=0x10243c840 net.(*netFD).accept(0x14000412600) /usr/local/go/src/net/fd_unix.go:172 +0x28 fp=0x1400031f910 sp=0x1400031f850 pc=0x10247bda8 net.(*TCPListener).accept(0x140000c6d38) /usr/local/go/src/net/tcpsock_posix.go:148 +0x28 fp=0x1400031f940 sp=0x1400031f910 pc=0x1024913a8 net.(*TCPListener).Accept(0x140000c6d38) /usr/local/go/src/net/tcpsock.go:297 +0x2c fp=0x1400031f980 sp=0x1400031f940 pc=0x10249051c net/http.(*onceCloseListener).Accept(0x140001387e0?) <autogenerated>:1 +0x30 fp=0x1400031f9a0 sp=0x1400031f980 pc=0x1025f6d80 net/http.(*Server).Serve(0x14000338ff0, {0x102ae5090, 0x140000c6d38}) /usr/local/go/src/net/http/server.go:3059 +0x304 fp=0x1400031fad0 sp=0x1400031f9a0 pc=0x1025d32a4 github.com/jmorganca/ollama/server.Serve({0x102ae5090, 0x140000c6d38}) /Users/gabriel/Documents/Llama2/ollama/server/routes.go:265 +0x4e0 fp=0x1400031fca0 sp=0x1400031fad0 pc=0x102864e40 github.com/jmorganca/ollama/cmd.RunServer(0x140003c7200?, {0x1028ba248?, 0x0?, 0x0?}) /Users/gabriel/Documents/Llama2/ollama/cmd/cmd.go:406 +0x114 fp=0x1400031fd20 sp=0x1400031fca0 pc=0x1028685f4 github.com/spf13/cobra.(*Command).execute(0x140003c7200, {0x102f5e450, 0x0, 0x0}) /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:940 +0x5c8 fp=0x1400031fe60 sp=0x1400031fd20 pc=0x102679528 github.com/spf13/cobra.(*Command).ExecuteC(0x140003c6900) /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:1068 +0x35c fp=0x1400031ff20 sp=0x1400031fe60 pc=0x102679c7c github.com/spf13/cobra.(*Command).Execute(...) /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:992 github.com/spf13/cobra.(*Command).ExecuteContext(0x1400005c768?, {0x102ae58a8?, 0x140000b8010?}) /Users/gabriel/go/pkg/mod/github.com/spf13/cobra@v1.7.0/command.go:985 +0x50 fp=0x1400031ff40 sp=0x1400031ff20 pc=0x102679810 main.main() /Users/gabriel/Documents/Llama2/ollama/main.go:10 +0x34 fp=0x1400031ff70 sp=0x1400031ff40 pc=0x102869254 runtime.main() /usr/local/go/src/runtime/proc.go:250 +0x248 fp=0x1400031ffd0 sp=0x1400031ff70 pc=0x1023884f8 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400031ffd0 sp=0x1400031ffd0 pc=0x1023b8b24 goroutine 2 [force gc (idle)]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005cfa0 sp=0x1400005cf80 pc=0x102388924 runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:387 runtime.forcegchelper() /usr/local/go/src/runtime/proc.go:305 +0xb8 fp=0x1400005cfd0 sp=0x1400005cfa0 pc=0x102388768 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005cfd0 sp=0x1400005cfd0 pc=0x1023b8b24 created by runtime.init.6 /usr/local/go/src/runtime/proc.go:293 +0x24 goroutine 3 [GC sweep wait]: runtime.gopark(0x0?, 0x0?, 0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005d760 sp=0x1400005d740 pc=0x102388924 runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:387 runtime.bgsweep(0x0?) /usr/local/go/src/runtime/mgcsweep.go:278 +0xa4 fp=0x1400005d7b0 sp=0x1400005d760 pc=0x102375604 runtime.gcenable.func1() /usr/local/go/src/runtime/mgc.go:178 +0x28 fp=0x1400005d7d0 sp=0x1400005d7b0 pc=0x10236a118 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005d7d0 sp=0x1400005d7d0 pc=0x1023b8b24 created by runtime.gcenable /usr/local/go/src/runtime/mgc.go:178 +0x74 goroutine 4 [GC scavenge wait]: runtime.gopark(0x14000038070?, 0x1029b8678?, 0x1?, 0x0?, 0x0?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005df50 sp=0x1400005df30 pc=0x102388924 runtime.goparkunlock(...) /usr/local/go/src/runtime/proc.go:387 runtime.(*scavengerState).park(0x102ea2f20) /usr/local/go/src/runtime/mgcscavenge.go:400 +0x5c fp=0x1400005df80 sp=0x1400005df50 pc=0x10237347c runtime.bgscavenge(0x0?) /usr/local/go/src/runtime/mgcscavenge.go:628 +0x44 fp=0x1400005dfb0 sp=0x1400005df80 pc=0x1023739f4 runtime.gcenable.func2() /usr/local/go/src/runtime/mgc.go:179 +0x28 fp=0x1400005dfd0 sp=0x1400005dfb0 pc=0x10236a0b8 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005dfd0 sp=0x1400005dfd0 pc=0x1023b8b24 created by runtime.gcenable /usr/local/go/src/runtime/mgc.go:179 +0xb8 goroutine 18 [finalizer wait]: runtime.gopark(0x1a0?, 0x102ea3960?, 0x80?, 0x26?, 0x0?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x1400005c580 sp=0x1400005c560 pc=0x102388924 runtime.runfinq() /usr/local/go/src/runtime/mfinal.go:193 +0x10c fp=0x1400005c7d0 sp=0x1400005c580 pc=0x1023691ac runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x1400005c7d0 sp=0x1400005c7d0 pc=0x1023b8b24 created by runtime.createfing /usr/local/go/src/runtime/mfinal.go:163 +0x84 goroutine 20 [IO wait]: runtime.gopark(0xffffffffffffffff?, 0xffffffffffffffff?, 0x23?, 0x0?, 0x1023cb340?) /usr/local/go/src/runtime/proc.go:381 +0xe4 fp=0x14000058540 sp=0x14000058520 pc=0x102388924 runtime.netpollblock(0x0?, 0x0?, 0x0?) /usr/local/go/src/runtime/netpoll.go:527 +0x158 fp=0x14000058580 sp=0x14000058540 pc=0x102381e48 internal/poll.runtime_pollWait(0x12a568a28, 0x72) /usr/local/go/src/runtime/netpoll.go:306 +0xa0 fp=0x140000585b0 sp=0x14000058580 pc=0x1023b26f0 internal/poll.(*pollDesc).wait(0x14000412800?, 0x14000434341?, 0x0) /usr/local/go/src/internal/poll/fd_poll_runtime.go:84 +0x28 fp=0x140000585e0 sp=0x140000585b0 pc=0x102437d98 internal/poll.(*pollDesc).waitRead(...) /usr/local/go/src/internal/poll/fd_poll_runtime.go:89 internal/poll.(*FD).Read(0x14000412800, {0x14000434341, 0x1, 0x1}) /usr/local/go/src/internal/poll/fd_unix.go:167 +0x200 fp=0x14000058680 sp=0x140000585e0 pc=0x102439100 net.(*netFD).Read(0x14000412800, {0x14000434341?, 0x0?, 0x0?}) /usr/local/go/src/net/fd_posix.go:55 +0x28 fp=0x140000586d0 sp=0x14000058680 pc=0x10247a108 net.(*conn).Read(0x140000c8d08, {0x14000434341?, 0x0?, 0x0?}) /usr/local/go/src/net/net.go:183 +0x34 fp=0x14000058720 sp=0x140000586d0 pc=0x102488714 net.(*TCPConn).Read(0x0?, {0x14000434341?, 0x0?, 0x0?}) <autogenerated>:1 +0x2c fp=0x14000058750 sp=0x14000058720 pc=0x10249ac5c net/http.(*connReader).backgroundRead(0x14000434330) /usr/local/go/src/net/http/server.go:674 +0x44 fp=0x140000587b0 sp=0x14000058750 pc=0x1025c8f84 net/http.(*connReader).startBackgroundRead.func2() /usr/local/go/src/net/http/server.go:670 +0x28 fp=0x140000587d0 sp=0x140000587b0 pc=0x1025c8ea8 runtime.goexit() /usr/local/go/src/runtime/asm_arm64.s:1172 +0x4 fp=0x140000587d0 sp=0x140000587d0 pc=0x1023b8b24 created by net/http.(*connReader).startBackgroundRead /usr/local/go/src/net/http/server.go:670 +0xcc r0 0x0 r1 0x0 r2 0x0 r3 0x0 r4 0x0 r5 0x16eaa2c00 r6 0xa r7 0x0 r8 0x58a7f816d970080f r9 0x58a7f817b7dbb80f r10 0x2 r11 0xfffffffd r12 0x10000000000 r13 0x0 r14 0x0 r15 0x0 r16 0x148 r17 0x1f85b8f60 r18 0x0 r19 0x6 r20 0x16eabb000 r21 0x1903 r22 0x16eabb0e0 r23 0x8 r24 0x7 r25 0x8 r26 0x1f3a97720 r27 0x1028aafc0 r28 0x1029d10f0 r29 0x16eaa2bb0 lr 0x198abbc28 sp 0x16eaa2b90 pc 0x198a84724 fault 0x198a84724 ``` I don´t think this is something that can get fixed. I built the exeutable using the readme instructions. Maybe a Warning message could be good, since someone arriving at the repo and trying this model at fisrt could lost a lot of time trying to see what is the problem.
Author
Owner

@jmorganca commented on GitHub (Aug 23, 2023):

This should be fixed on main and the latest versions – note that running larger than 7GB models on will still mostly likely encounter an issue on 8GB Macs

<!-- gh-comment-id:1690375753 --> @jmorganca commented on GitHub (Aug 23, 2023): This should be fixed on `main` and the latest versions – note that running larger than 7GB models on will still mostly likely encounter an issue on 8GB Macs
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#46549