[GH-ISSUE #2869] Ollama doesn't use Radeon RX 6600 #27511

New Issue

GiteaMirror · 2026-04-22T04:54:21-05:00

GiteaMirror commented

2026-04-22 04:54:21 -05:00

Originally created by @nameiwillforget on GitHub (Mar 1, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/2869

Originally assigned to: @dhiltgen on GitHub.

I'm using Arch Linux with the latest updates installed and ollama installed from its AUR package. When I use the Smaug model, it uses my CPU considerably but my GPU not at all:

I put the output of ollama serve and ollama running Smaug into a file:
ollama.txt
smaug.txt
I've installed Cuda because I thought for a moment it is needed, but I don't think that's the reason it doesn't work.

Originally created by @nameiwillforget on GitHub (Mar 1, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/2869 Originally assigned to: @dhiltgen on GitHub. I'm using Arch Linux with the latest updates installed and ollama installed from its AUR package. When I use the Smaug model, it uses my CPU considerably but my GPU not at all: ![amdgpu](https://github.com/ollama/ollama/assets/81373487/be629472-a4eb-4f31-b8e9-726e2f9a8c21) I put the output of `ollama serve` and ollama running Smaug into a file: [ollama.txt](https://github.com/ollama/ollama/files/14466737/ollama.txt) [smaug.txt](https://github.com/ollama/ollama/files/14466741/smaug.txt) I've installed Cuda because I thought for a moment it is needed, but I don't think that's the reason it doesn't work.

GiteaMirror closed this issue

2026-04-22 04:54:22 -05:00

GiteaMirror commented

2026-04-22 04:54:24 -05:00

@dhiltgen commented on GitHub (Mar 1, 2024):

Can you run the server with OLLAMA_DEBUG=1 set so we can see some more diagnostic information about why it wasn't able to initialize the GPU?

@dhiltgen commented on GitHub (Mar 1, 2024): Can you run the server with `OLLAMA_DEBUG=1` set so we can see some more diagnostic information about why it wasn't able to initialize the GPU?

GiteaMirror commented

2026-04-22 04:54:26 -05:00

@nameiwillforget commented on GitHub (Mar 1, 2024):

Alright:
ollama.txt

@nameiwillforget commented on GitHub (Mar 1, 2024): Alright: [ollama.txt](https://github.com/ollama/ollama/files/14466947/ollama.txt)

GiteaMirror commented

2026-04-22 04:54:27 -05:00

@dhiltgen commented on GitHub (Mar 1, 2024):

The attached log doesn't seem to have debug enabled. Try...

sudo systemctl stop ollama
OLLAMA_DEBUG=1 ollama serve

@dhiltgen commented on GitHub (Mar 1, 2024): The attached log doesn't seem to have debug enabled. Try... ``` sudo systemctl stop ollama OLLAMA_DEBUG=1 ollama serve ```

GiteaMirror commented

2026-04-22 04:54:27 -05:00

@tannisroot commented on GitHub (Mar 2, 2024):

If this is the model you are trying to run:
https://ollama.com/sammcj/smaug
note that it is 44GB in size.
Rx 6600 has only 8GB of VRAM.
I've found that Ollama won't use the GPU (at least on Linux) if it can't allocate it entirely to GPU's VRAM and fallback to CPU.

@tannisroot commented on GitHub (Mar 2, 2024): If this is the model you are trying to run: https://ollama.com/sammcj/smaug note that it is 44GB in size. Rx 6600 has only 8GB of VRAM. I've found that Ollama won't use the GPU (at least on Linux) if it can't allocate it entirely to GPU's VRAM and fallback to CPU.

GiteaMirror commented

2026-04-22 04:54:27 -05:00

@nameiwillforget commented on GitHub (Mar 2, 2024):

Alright, here it is again:
ollama.txt
Looks the same to me though.

@nameiwillforget commented on GitHub (Mar 2, 2024): Alright, here it is again: [ollama.txt](https://github.com/ollama/ollama/files/14467373/ollama.txt) Looks the same to me though.

GiteaMirror commented

2026-04-22 04:54:28 -05:00

@nameiwillforget commented on GitHub (Mar 2, 2024):

If this is the model you are trying to run: https://ollama.com/sammcj/smaug note that it is 44GB in size. Rx 6600 has only 8GB of VRAM. I've found that Ollama won't use the GPU (at least on Linux) if it can't allocate it entirely to GPU's VRAM and fallback to CPU.

Oh, I see. Is this intended behavior?

@nameiwillforget commented on GitHub (Mar 2, 2024): > If this is the model you are trying to run: https://ollama.com/sammcj/smaug note that it is 44GB in size. Rx 6600 has only 8GB of VRAM. I've found that Ollama won't use the GPU (at least on Linux) if it can't allocate it entirely to GPU's VRAM and fallback to CPU. Oh, I see. Is this intended behavior?

GiteaMirror commented

2026-04-22 04:54:28 -05:00

@dhiltgen commented on GitHub (Mar 2, 2024):

Hmm... your output doesn't look like what I'm expecting to see as ollama starts up when we're doing initial GPU discovery. Here's what I see with 0.1.27 on a system with an RX 7600

% OLLAMA_DEBUG=1 ollama serve
time=2024-03-02T17:03:56.859Z level=INFO source=images.go:710 msg="total blobs: 11"
time=2024-03-02T17:03:56.860Z level=INFO source=images.go:717 msg="total unused blobs removed: 0"
time=2024-03-02T17:03:56.861Z level=INFO source=routes.go:1019 msg="Listening on 127.0.0.1:11434 (version 0.1.27)"
time=2024-03-02T17:03:56.861Z level=INFO source=payload_common.go:107 msg="Extracting dynamic libraries..."
time=2024-03-02T17:03:59.554Z level=INFO source=payload_common.go:146 msg="Dynamic LLM libraries [cpu_avx2 cuda_v11 rocm_v6 rocm_v5 cpu cpu_avx]"
time=2024-03-02T17:03:59.554Z level=DEBUG source=payload_common.go:147 msg="Override detection logic by setting OLLAMA_LLM_LIBRARY"
time=2024-03-02T17:03:59.554Z level=INFO source=gpu.go:94 msg="Detecting GPU type"
time=2024-03-02T17:03:59.554Z level=INFO source=gpu.go:265 msg="Searching for GPU management library libnvidia-ml.so"
time=2024-03-02T17:03:59.554Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/usr/local/cuda/lib64/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/libnvidia-ml.so* /usr/lib/wsl/lib/libnvidia-ml.so* /usr/lib/wsl/drivers/*/libnvidia-ml.so* /opt/cuda/lib64/libnvidia-ml.so* /usr/lib*/libnvidia-ml.so* /usr/local/lib*/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/libnvidia-ml.so* /opt/cuda/targets/x86_64-linux/lib/stubs/libnvidia-ml.so* /home/daniel/libnvidia-ml.so*]"
time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []"
time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:265 msg="Searching for GPU management library librocm_smi64.so"
time=2024-03-02T17:03:59.555Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/opt/rocm*/lib*/librocm_smi64.so* /home/daniel/librocm_smi64.so*]"
time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: [/opt/rocm/lib/librocm_smi64.so.6.0.60002 /opt/rocm-6.0.2/lib/librocm_smi64.so.6.0.60002]"
wiring rocm management library functions in /opt/rocm/lib/librocm_smi64.so.6.0.60002
dlsym: rsmi_init
dlsym: rsmi_shut_down
dlsym: rsmi_dev_memory_total_get
dlsym: rsmi_dev_memory_usage_get
dlsym: rsmi_version_get
dlsym: rsmi_num_monitor_devices
dlsym: rsmi_dev_id_get
dlsym: rsmi_dev_name_get
dlsym: rsmi_dev_brand_get
dlsym: rsmi_dev_vendor_name_get
dlsym: rsmi_dev_vram_vendor_get
dlsym: rsmi_dev_serial_number_get
dlsym: rsmi_dev_subsystem_name_get
dlsym: rsmi_dev_vbios_version_get
time=2024-03-02T17:03:59.558Z level=INFO source=gpu.go:109 msg="Radeon GPU detected"
time=2024-03-02T17:03:59.558Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2"
time=2024-03-02T17:03:59.558Z level=INFO source=gpu.go:155 msg="AMD Driver: 6.3.6"
time=2024-03-02T17:03:59.558Z level=DEBUG source=amd.go:76 msg="malformed gfx_target_version 0"
discovered 1 ROCm GPU Devices
[0] ROCm device name: Navi 33 [Radeon RX 7700S/7600/7600S/7600M XT/PRO W7600]
[0] ROCm brand: Navi 33 [Radeon RX 7700S/7600/7600S/7600M XT/PRO W7600]
[0] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI]
[0] ROCm VRAM vendor: samsung
rsmi_dev_serial_number_get failed: 2
[0] ROCm subsystem name: RX 7600 Challenger OC
[0] ROCm vbios version: 113-D7451000-0001
[0] ROCm totalMem 8573157376
[0] ROCm usedMem 27176960
time=2024-03-02T17:03:59.561Z level=DEBUG source=gpu.go:254 msg="rocm detected 1 devices with 7126M available memory"

That said, yes, if you're attempting to load a 44G model into a 8G GPU, then most of the work is being done by the CPU.

@dhiltgen commented on GitHub (Mar 2, 2024): Hmm... your output doesn't look like what I'm expecting to see as ollama starts up when we're doing initial GPU discovery. Here's what I see with 0.1.27 on a system with an RX 7600 ``` % OLLAMA_DEBUG=1 ollama serve time=2024-03-02T17:03:56.859Z level=INFO source=images.go:710 msg="total blobs: 11" time=2024-03-02T17:03:56.860Z level=INFO source=images.go:717 msg="total unused blobs removed: 0" time=2024-03-02T17:03:56.861Z level=INFO source=routes.go:1019 msg="Listening on 127.0.0.1:11434 (version 0.1.27)" time=2024-03-02T17:03:56.861Z level=INFO source=payload_common.go:107 msg="Extracting dynamic libraries..." time=2024-03-02T17:03:59.554Z level=INFO source=payload_common.go:146 msg="Dynamic LLM libraries [cpu_avx2 cuda_v11 rocm_v6 rocm_v5 cpu cpu_avx]" time=2024-03-02T17:03:59.554Z level=DEBUG source=payload_common.go:147 msg="Override detection logic by setting OLLAMA_LLM_LIBRARY" time=2024-03-02T17:03:59.554Z level=INFO source=gpu.go:94 msg="Detecting GPU type" time=2024-03-02T17:03:59.554Z level=INFO source=gpu.go:265 msg="Searching for GPU management library libnvidia-ml.so" time=2024-03-02T17:03:59.554Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/usr/local/cuda/lib64/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/libnvidia-ml.so* /usr/lib/wsl/lib/libnvidia-ml.so* /usr/lib/wsl/drivers/*/libnvidia-ml.so* /opt/cuda/lib64/libnvidia-ml.so* /usr/lib*/libnvidia-ml.so* /usr/local/lib*/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/libnvidia-ml.so* /opt/cuda/targets/x86_64-linux/lib/stubs/libnvidia-ml.so* /home/daniel/libnvidia-ml.so*]" time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []" time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:265 msg="Searching for GPU management library librocm_smi64.so" time=2024-03-02T17:03:59.555Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/opt/rocm*/lib*/librocm_smi64.so* /home/daniel/librocm_smi64.so*]" time=2024-03-02T17:03:59.555Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: [/opt/rocm/lib/librocm_smi64.so.6.0.60002 /opt/rocm-6.0.2/lib/librocm_smi64.so.6.0.60002]" wiring rocm management library functions in /opt/rocm/lib/librocm_smi64.so.6.0.60002 dlsym: rsmi_init dlsym: rsmi_shut_down dlsym: rsmi_dev_memory_total_get dlsym: rsmi_dev_memory_usage_get dlsym: rsmi_version_get dlsym: rsmi_num_monitor_devices dlsym: rsmi_dev_id_get dlsym: rsmi_dev_name_get dlsym: rsmi_dev_brand_get dlsym: rsmi_dev_vendor_name_get dlsym: rsmi_dev_vram_vendor_get dlsym: rsmi_dev_serial_number_get dlsym: rsmi_dev_subsystem_name_get dlsym: rsmi_dev_vbios_version_get time=2024-03-02T17:03:59.558Z level=INFO source=gpu.go:109 msg="Radeon GPU detected" time=2024-03-02T17:03:59.558Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2" time=2024-03-02T17:03:59.558Z level=INFO source=gpu.go:155 msg="AMD Driver: 6.3.6" time=2024-03-02T17:03:59.558Z level=DEBUG source=amd.go:76 msg="malformed gfx_target_version 0" discovered 1 ROCm GPU Devices [0] ROCm device name: Navi 33 [Radeon RX 7700S/7600/7600S/7600M XT/PRO W7600] [0] ROCm brand: Navi 33 [Radeon RX 7700S/7600/7600S/7600M XT/PRO W7600] [0] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI] [0] ROCm VRAM vendor: samsung rsmi_dev_serial_number_get failed: 2 [0] ROCm subsystem name: RX 7600 Challenger OC [0] ROCm vbios version: 113-D7451000-0001 [0] ROCm totalMem 8573157376 [0] ROCm usedMem 27176960 time=2024-03-02T17:03:59.561Z level=DEBUG source=gpu.go:254 msg="rocm detected 1 devices with 7126M available memory" ``` That said, yes, if you're attempting to load a 44G model into a 8G GPU, then most of the work is being done by the CPU.

GiteaMirror commented

2026-04-22 04:54:29 -05:00

@Jaspix commented on GitHub (Mar 5, 2024):

I'm running ollama from the official Arch package and facing the same issue. I got this log but all I can see it's both my GPU's getting discovered, however whenever I run a model, even small ones, it defaults to CPU.

discovered 2 ROCm GPU Devices
[0] ROCm device name: Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
[0] ROCm brand: Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
[0] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI]
[0] ROCm VRAM vendor: samsung
rsmi_dev_serial_number_get failed: 2
[0] ROCm subsystem name: Radeon RX 6800M
[0] ROCm vbios version: SWBRT79208.001
[0] ROCm totalMem 12868124672
[0] ROCm usedMem 16650240
[1] ROCm device name: Cezanne [Radeon Vega Series / Radeon Vega Mobile Series]
[1] ROCm brand: Cezanne [Radeon Vega Series / Radeon Vega Mobile Series]
[1] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI]
rsmi_dev_vram_vendor_get failed: 2
rsmi_dev_serial_number_get failed: 2
[1] ROCm subsystem name: Radeon Vega 8
[1] ROCm vbios version: 113-CEZANNE-018
[1] ROCm totalMem 536870912
[1] ROCm usedMem 524304384
[1] ROCm integrated GPU
time=2024-03-04T20:33:33.692-05:00 level=INFO source=gpu.go:199 msg="ROCm integrated GPU detected - ROCR_VISIBLE_DEVICES=0"
time=2024-03-04T20:33:33.692-05:00 level=DEBUG source=gpu.go:254 msg="rocm detected 2 devices with 10208M available memory"

@Jaspix commented on GitHub (Mar 5, 2024): I'm running ollama from the official Arch package and facing the same issue. I got this log but all I can see it's both my GPU's getting discovered, however whenever I run a model, even small ones, it defaults to CPU. ``` discovered 2 ROCm GPU Devices [0] ROCm device name: Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT] [0] ROCm brand: Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT] [0] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI] [0] ROCm VRAM vendor: samsung rsmi_dev_serial_number_get failed: 2 [0] ROCm subsystem name: Radeon RX 6800M [0] ROCm vbios version: SWBRT79208.001 [0] ROCm totalMem 12868124672 [0] ROCm usedMem 16650240 [1] ROCm device name: Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] [1] ROCm brand: Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] [1] ROCm vendor: Advanced Micro Devices, Inc. [AMD/ATI] rsmi_dev_vram_vendor_get failed: 2 rsmi_dev_serial_number_get failed: 2 [1] ROCm subsystem name: Radeon Vega 8 [1] ROCm vbios version: 113-CEZANNE-018 [1] ROCm totalMem 536870912 [1] ROCm usedMem 524304384 [1] ROCm integrated GPU time=2024-03-04T20:33:33.692-05:00 level=INFO source=gpu.go:199 msg="ROCm integrated GPU detected - ROCR_VISIBLE_DEVICES=0" time=2024-03-04T20:33:33.692-05:00 level=DEBUG source=gpu.go:254 msg="rocm detected 2 devices with 10208M available memory" ```

GiteaMirror commented

2026-04-22 04:54:29 -05:00

@tannisroot commented on GitHub (Mar 5, 2024):

@Jaspix this is just a guess but could it be trying to use the integrated graphics first, runs out of memory and falls back to CPU?

@tannisroot commented on GitHub (Mar 5, 2024): @Jaspix this is just a guess but could it be trying to use the integrated graphics first, runs out of memory and falls back to CPU?

GiteaMirror commented

2026-04-22 04:54:30 -05:00

@Jaspix commented on GitHub (Mar 5, 2024):

@Jaspix this is just a guess but could it be trying to use the integrated graphics first, runs out of memory and falls back to CPU?

Possibly, but that would mean the program it's confusing the dedicated for the integrated as ROCR_VISIBLE_DEVICES=0 mean it's using the 0 device, I suppose?

@Jaspix commented on GitHub (Mar 5, 2024): > @Jaspix this is just a guess but could it be trying to use the integrated graphics first, runs out of memory and falls back to CPU? Possibly, but that would mean the program it's confusing the dedicated for the integrated as ROCR_VISIBLE_DEVICES=0 mean it's using the 0 device, I suppose?

GiteaMirror commented

2026-04-22 04:54:31 -05:00

@totterman commented on GitHub (Mar 5, 2024):

I'm running ollama from the official Arch package and facing the same issue: RX 7600 not detected. Perhaps because GPU libraries are not discovered?

$ OLLAMA_DEBUG=1 ollama serve
time=2024-03-05T10:18:23.527Z level=INFO source=images.go:710 msg="total blobs: 0"
time=2024-03-05T10:18:23.527Z level=INFO source=images.go:717 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:	export GIN_MODE=release
 - using code:	gin.SetMode(gin.ReleaseMode)

[GIN-debug] POST   /api/pull                 --> github.com/jmorganca/ollama/server.PullModelHandler (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/jmorganca/ollama/server.GenerateHandler (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/jmorganca/ollama/server.ChatHandler (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/jmorganca/ollama/server.EmbeddingHandler (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/jmorganca/ollama/server.CreateModelHandler (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/jmorganca/ollama/server.PushModelHandler (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/jmorganca/ollama/server.CopyModelHandler (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/jmorganca/ollama/server.DeleteModelHandler (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/jmorganca/ollama/server.ShowModelHandler (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/jmorganca/ollama/server.CreateBlobHandler (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/jmorganca/ollama/server.HeadBlobHandler (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/jmorganca/ollama/server.ChatHandler (6 handlers)
[GIN-debug] GET    /                         --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] HEAD   /                         --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
time=2024-03-05T10:18:23.527Z level=INFO source=routes.go:1019 msg="Listening on 127.0.0.1:11434 (version 0.1.27)"
time=2024-03-05T10:18:23.528Z level=INFO source=payload_common.go:107 msg="Extracting dynamic libraries..."
time=2024-03-05T10:18:23.649Z level=INFO source=payload_common.go:146 msg="Dynamic LLM libraries [cpu_avx cpu_avx2 cpu]"
time=2024-03-05T10:18:23.649Z level=DEBUG source=payload_common.go:147 msg="Override detection logic by setting OLLAMA_LLM_LIBRARY"
time=2024-03-05T10:18:23.650Z level=INFO source=gpu.go:94 msg="Detecting GPU type"
time=2024-03-05T10:18:23.650Z level=INFO source=gpu.go:265 msg="Searching for GPU management library libnvidia-ml.so"
time=2024-03-05T10:18:23.650Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/usr/local/cuda/lib64/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/libnvidia-ml.so* /usr/lib/wsl/lib/libnvidia-ml.so* /usr/lib/wsl/drivers/*/libnvidia-ml.so* /opt/cuda/lib64/libnvidia-ml.so* /usr/lib*/libnvidia-ml.so* /usr/local/lib*/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/libnvidia-ml.so* /opt/cuda/targets/x86_64-linux/lib/stubs/libnvidia-ml.so* /home/pbt/src/ai/poro/libnvidia-ml.so*]"
time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []"
time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:265 msg="Searching for GPU management library librocm_smi64.so"
time=2024-03-05T10:18:23.656Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/opt/rocm*/lib*/librocm_smi64.so* /home/pbt/src/ai/poro/librocm_smi64.so*]"
time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []"
time=2024-03-05T10:18:23.656Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2"
time=2024-03-05T10:18:23.656Z level=INFO source=routes.go:1042 msg="no GPU detected"

@totterman commented on GitHub (Mar 5, 2024): I'm running ollama from the official Arch package and facing the same issue: RX 7600 not detected. Perhaps because GPU libraries are not discovered? ``` $ OLLAMA_DEBUG=1 ollama serve time=2024-03-05T10:18:23.527Z level=INFO source=images.go:710 msg="total blobs: 0" time=2024-03-05T10:18:23.527Z level=INFO source=images.go:717 msg="total unused blobs removed: 0" [GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached. [GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production. - using env: export GIN_MODE=release - using code: gin.SetMode(gin.ReleaseMode) [GIN-debug] POST /api/pull --> github.com/jmorganca/ollama/server.PullModelHandler (5 handlers) [GIN-debug] POST /api/generate --> github.com/jmorganca/ollama/server.GenerateHandler (5 handlers) [GIN-debug] POST /api/chat --> github.com/jmorganca/ollama/server.ChatHandler (5 handlers) [GIN-debug] POST /api/embeddings --> github.com/jmorganca/ollama/server.EmbeddingHandler (5 handlers) [GIN-debug] POST /api/create --> github.com/jmorganca/ollama/server.CreateModelHandler (5 handlers) [GIN-debug] POST /api/push --> github.com/jmorganca/ollama/server.PushModelHandler (5 handlers) [GIN-debug] POST /api/copy --> github.com/jmorganca/ollama/server.CopyModelHandler (5 handlers) [GIN-debug] DELETE /api/delete --> github.com/jmorganca/ollama/server.DeleteModelHandler (5 handlers) [GIN-debug] POST /api/show --> github.com/jmorganca/ollama/server.ShowModelHandler (5 handlers) [GIN-debug] POST /api/blobs/:digest --> github.com/jmorganca/ollama/server.CreateBlobHandler (5 handlers) [GIN-debug] HEAD /api/blobs/:digest --> github.com/jmorganca/ollama/server.HeadBlobHandler (5 handlers) [GIN-debug] POST /v1/chat/completions --> github.com/jmorganca/ollama/server.ChatHandler (6 handlers) [GIN-debug] GET / --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] GET /api/tags --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers) [GIN-debug] GET /api/version --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) [GIN-debug] HEAD / --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] HEAD /api/tags --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers) [GIN-debug] HEAD /api/version --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) time=2024-03-05T10:18:23.527Z level=INFO source=routes.go:1019 msg="Listening on 127.0.0.1:11434 (version 0.1.27)" time=2024-03-05T10:18:23.528Z level=INFO source=payload_common.go:107 msg="Extracting dynamic libraries..." time=2024-03-05T10:18:23.649Z level=INFO source=payload_common.go:146 msg="Dynamic LLM libraries [cpu_avx cpu_avx2 cpu]" time=2024-03-05T10:18:23.649Z level=DEBUG source=payload_common.go:147 msg="Override detection logic by setting OLLAMA_LLM_LIBRARY" time=2024-03-05T10:18:23.650Z level=INFO source=gpu.go:94 msg="Detecting GPU type" time=2024-03-05T10:18:23.650Z level=INFO source=gpu.go:265 msg="Searching for GPU management library libnvidia-ml.so" time=2024-03-05T10:18:23.650Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/usr/local/cuda/lib64/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/x86_64-linux-gnu/libnvidia-ml.so* /usr/lib/wsl/lib/libnvidia-ml.so* /usr/lib/wsl/drivers/*/libnvidia-ml.so* /opt/cuda/lib64/libnvidia-ml.so* /usr/lib*/libnvidia-ml.so* /usr/local/lib*/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/nvidia/current/libnvidia-ml.so* /usr/lib/aarch64-linux-gnu/libnvidia-ml.so* /opt/cuda/targets/x86_64-linux/lib/stubs/libnvidia-ml.so* /home/pbt/src/ai/poro/libnvidia-ml.so*]" time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []" time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:265 msg="Searching for GPU management library librocm_smi64.so" time=2024-03-05T10:18:23.656Z level=DEBUG source=gpu.go:283 msg="gpu management search paths: [/opt/rocm*/lib*/librocm_smi64.so* /home/pbt/src/ai/poro/librocm_smi64.so*]" time=2024-03-05T10:18:23.656Z level=INFO source=gpu.go:311 msg="Discovered GPU libraries: []" time=2024-03-05T10:18:23.656Z level=INFO source=cpu_common.go:11 msg="CPU has AVX2" time=2024-03-05T10:18:23.656Z level=INFO source=routes.go:1042 msg="no GPU detected" ```

GiteaMirror commented

2026-04-22 04:54:31 -05:00

@jmorganca commented on GitHub (Mar 12, 2024):

Hi there, would it be possible to:

Try the new 0.1.29 pre-release with AMD Preview: https://github.com/ollama/ollama/releases/tag/v0.1.29
The RX 6600 isn't officially supported by AMD ROCm but you can override this by setting HSA_OVERRIDE_GFX_VERSION="10.3.0" (you can see how to set this here).

This should provide you GPU acceleration on AMD. Let me know if that doesn't work for any reason!

@jmorganca commented on GitHub (Mar 12, 2024): Hi there, would it be possible to: * Try the new 0.1.29 pre-release with AMD Preview: https://github.com/ollama/ollama/releases/tag/v0.1.29 * The RX 6600 isn't officially supported by AMD ROCm but you can override this by setting `HSA_OVERRIDE_GFX_VERSION="10.3.0"` (you can see how to set this [here](https://github.com/ollama/ollama/blob/main/docs/faq.md#how-do-i-configure-ollama-server)). This should provide you GPU acceleration on AMD. Let me know if that doesn't work for any reason!

GiteaMirror commented

2026-04-22 04:54:32 -05:00

@segg21 commented on GitHub (Mar 12, 2024):

Hi there, would it be possible to:

Try the new 0.1.29 pre-release with AMD Preview: https://github.com/ollama/ollama/releases/tag/v0.1.29

The RX 6600 isn't officially supported by AMD ROCm but you can override this by setting HSA_OVERRIDE_GFX_VERSION="10.3.0" (you can see how to set this here).

This should provide you GPU acceleration on AMD. Let me know if that doesn't work for any reason!

didn't work setting environment variable. :/

@segg21 commented on GitHub (Mar 12, 2024): > Hi there, would it be possible to: > > * Try the new 0.1.29 pre-release with AMD Preview: https://github.com/ollama/ollama/releases/tag/v0.1.29 > * The RX 6600 isn't officially supported by AMD ROCm but you can override this by setting `HSA_OVERRIDE_GFX_VERSION="10.3.0"` (you can see how to set this [here](https://github.com/ollama/ollama/blob/main/docs/faq.md#how-do-i-configure-ollama-server)). > > This should provide you GPU acceleration on AMD. Let me know if that doesn't work for any reason! didn't work setting environment variable. :/

GiteaMirror commented

2026-04-22 04:54:33 -05:00

@dhiltgen commented on GitHub (Mar 13, 2024):

@totterman your logs indicate the ollama binary was compiled without GPU support Dynamic LLM libraries [cpu_avx cpu_avx2 cpu]". It's missing CUDA and ROCm. The official builds we host on github container all CPU types and GPU types in a single release.

@segg21 I just fixed some defects with the iGPU detection logic which might be related to your problem. We'll be updating the binaries for 0.1.29 (still in pre-release) later today to pick up that fix. Please give that a try, and if you're still seeing problems, please share the server log so we can see what's going on.

@dhiltgen commented on GitHub (Mar 13, 2024): @totterman your logs indicate the ollama binary was compiled without GPU support `Dynamic LLM libraries [cpu_avx cpu_avx2 cpu]"`. It's missing CUDA and ROCm. The official builds we host on github container all CPU types and GPU types in a single release. @segg21 I just fixed some defects with the iGPU detection logic which might be related to your problem. We'll be updating the binaries for 0.1.29 (still in pre-release) later today to pick up that fix. Please give that a try, and if you're still seeing problems, please share the server log so we can see what's going on.

GiteaMirror commented

2026-04-22 04:54:33 -05:00

@dhiltgen commented on GitHub (Mar 18, 2024):

@segg21 0.1.29 is now the latest official release https://github.com/ollama/ollama/releases
If you installed one of the earlier pre-release builds, please re-install.

@dhiltgen commented on GitHub (Mar 18, 2024): @segg21 0.1.29 is now the latest official release https://github.com/ollama/ollama/releases If you installed one of the earlier pre-release builds, please re-install.

GiteaMirror commented

2026-04-22 04:54:34 -05:00

@dhiltgen commented on GitHub (Mar 19, 2024):

@segg21 can you share your server log?

@dhiltgen commented on GitHub (Mar 19, 2024): @segg21 can you share your server log?

GiteaMirror commented

2026-04-22 04:54:35 -05:00

@segg21 commented on GitHub (Mar 19, 2024):

@segg21 can you share your server log?

sorry for delay. i meant to provide it with my previous message and forgot.
i'm running into another issue after I uninstalled to attempt again.

I'm attempting to use llama2 model, which i run ollama run llama2. Haven't had this issue before and I've restarted my PC. netstat also doesn't show this port being in use, but I'm now getting the error
Error: Post "http://127.0.0.1:11434/api/chat": read tcp 127.0.0.1:50125->127.0.0.1:11434: wsarecv: An existing connection was forcibly closed by the remote host.

Here's the server.log

Additionally I noticed that it's trying to find the file C:\Users\****\AppData\Local\Programs\Ollama\rocm\/rocblas/library/TensileLibrary.dat which doesn't seem to exist in this folder :/

@segg21 commented on GitHub (Mar 19, 2024): > @segg21 can you share your server log? sorry for delay. i meant to provide it with my previous message and forgot. i'm running into another issue after I uninstalled to attempt again. I'm attempting to use llama2 model, which i run `ollama run llama2`. Haven't had this issue before and I've restarted my PC. `netstat` also doesn't show this port being in use, but I'm now getting the error `Error: Post "http://127.0.0.1:11434/api/chat": read tcp 127.0.0.1:50125->127.0.0.1:11434: wsarecv: An existing connection was forcibly closed by the remote host.` Here's the [server.log](https://github.com/ollama/ollama/files/14654783/server.log) Additionally I noticed that it's trying to find the file `C:\Users\****\AppData\Local\Programs\Ollama\rocm\/rocblas/library/TensileLibrary.dat` which doesn't seem to exist in this folder :/

GiteaMirror commented

2026-04-22 04:54:36 -05:00

@dhiltgen commented on GitHub (Mar 20, 2024):

Unfortunately the ROCm library does not yet support your GPU (gfx1032) and the override mechanism is only possible on linux (see #3107)

The system should detect this and fallback to CPU mode. Is it possible you're running an older pre-release of 0.1.29? Can you uninstall and re-install the latest binaries from https://github.com/ollama/ollama/releases/tag/v0.1.29 just to make sure? If you still see a crash instead of falling back to CPU, that's a bug we want to fix.

@dhiltgen commented on GitHub (Mar 20, 2024): Unfortunately the ROCm library does not yet support your GPU (gfx1032) and the override mechanism is only possible on linux (see #3107) The system should detect this and fallback to CPU mode. Is it possible you're running an older pre-release of 0.1.29? Can you uninstall and re-install the latest binaries from https://github.com/ollama/ollama/releases/tag/v0.1.29 just to make sure? If you still see a crash instead of falling back to CPU, that's a bug we want to fix.

GiteaMirror commented

2026-04-22 04:54:38 -05:00

@ftoppi commented on GitHub (Mar 20, 2024):

Unfortunately the ROCm library does not yet support your GPU (gfx1032) and the override mechanism is only possible on linux (see #3107)

Hi @dhiltgen , I'm trying to understand the support of AMD GPU. gfx1032 is has "runtime support" according to AMD website. Does it only work with cards with "HIP SDK" support?

Thanks for your work :)

@ftoppi commented on GitHub (Mar 20, 2024): > Unfortunately the ROCm library does not yet support your GPU (gfx1032) and the override mechanism is only possible on linux (see #3107) > Hi @dhiltgen , I'm trying to understand the support of AMD GPU. gfx1032 is has "runtime support" according to [AMD website](https://rocm.docs.amd.com/en/docs-5.7.1/release/windows_support.html). Does it only work with cards with "HIP SDK" support? Thanks for your work :)

GiteaMirror commented

2026-04-22 04:54:39 -05:00

@dhiltgen commented on GitHub (Mar 20, 2024):

@ftoppi yes, that's correct. The HIP SDK math libraries are what make LLMs work on GPUs.

@dhiltgen commented on GitHub (Mar 20, 2024): @ftoppi yes, that's correct. The HIP SDK math libraries are what make LLMs work on GPUs.

GiteaMirror commented

2026-04-22 04:54:40 -05:00

@muhammedaligurdal commented on GitHub (Jul 26, 2024):

@ftoppi yes, that's correct. The HIP SDK math libraries are what make LLMs work on GPUs.

My graphics card is RX 6600. It saddens me that it supports this graphics card. I tried many methods and failed.

@muhammedaligurdal commented on GitHub (Jul 26, 2024): > @ftoppi yes, that's correct. The HIP SDK math libraries are what make LLMs work on GPUs. My graphics card is RX 6600. It saddens me that it supports this graphics card. I tried many methods and failed.

GiteaMirror commented

2026-04-22 04:54:40 -05:00

@diogovalada commented on GitHub (Sep 6, 2024):

I am trying in my laptop, with AMD RX 6700s and Windows 11, but it also doesn't use my GPU, only the CPU.
Ollama 0.3.9

When I set the debug environment variable and run ollama run qwen2-math, I get:

[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.

using env: export GIN_MODE=release
using code: gin.SetMode(gin.ReleaseMode)

[GIN-debug] POST   /api/pull                 --> github.com/jmorganca/ollama/server.PullModelHandler (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/jmorganca/ollama/server.GenerateHandler (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/jmorganca/ollama/server.ChatHandler (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/jmorganca/ollama/server.EmbeddingHandler (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/jmorganca/ollama/server.CreateModelHandler (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/jmorganca/ollama/server.PushModelHandler (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/jmorganca/ollama/server.CopyModelHandler (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/jmorganca/ollama/server.DeleteModelHandler (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/jmorganca/ollama/server.ShowModelHandler (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/jmorganca/ollama/server.CreateBlobHandler (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/jmorganca/ollama/server.HeadBlobHandler (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/jmorganca/ollama/server.ChatHandler (6 handlers)
[GIN-debug] GET    /                         --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] HEAD   /                         --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN] 2024/03/02 - 01:40:27 | 200 |      27.997µs |       127.0.0.1 | HEAD     "/"
[GIN] 2024/03/02 - 01:40:27 | 200 |     407.172µs |       127.0.0.1 | POST     "/api/show"
[GIN] 2024/03/02 - 01:40:27 | 200 |     119.568µs |       127.0.0.1 | POST     "/api/show"
[GIN] 2024/03/02 - 01:41:01 | 200 |  33.44071703s |       127.0.0.1 | POST     "/api/chat"
[GIN] 2024/03/02 - 01:42:42 | 200 |         1m21s |       127.0.0.1 | POST     "/api/chat"
[GIN] 2024/03/02 - 01:42:42 | 400 |          1m4s |       127.0.0.1 | POST     "/api/chat"

Do I need to install ROCM HIP or do the necessary resources already come bundled with Ollama?
Any ideas?

@diogovalada commented on GitHub (Sep 6, 2024): I am trying in my laptop, with AMD RX 6700s and Windows 11, but it also doesn't use my GPU, only the CPU. Ollama 0.3.9 When I set the debug environment variable and run `ollama run qwen2-math`, I get: [GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached. [GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production. - using env: export GIN_MODE=release - using code: gin.SetMode(gin.ReleaseMode) ``` [GIN-debug] POST /api/pull --> github.com/jmorganca/ollama/server.PullModelHandler (5 handlers) [GIN-debug] POST /api/generate --> github.com/jmorganca/ollama/server.GenerateHandler (5 handlers) [GIN-debug] POST /api/chat --> github.com/jmorganca/ollama/server.ChatHandler (5 handlers) [GIN-debug] POST /api/embeddings --> github.com/jmorganca/ollama/server.EmbeddingHandler (5 handlers) [GIN-debug] POST /api/create --> github.com/jmorganca/ollama/server.CreateModelHandler (5 handlers) [GIN-debug] POST /api/push --> github.com/jmorganca/ollama/server.PushModelHandler (5 handlers) [GIN-debug] POST /api/copy --> github.com/jmorganca/ollama/server.CopyModelHandler (5 handlers) [GIN-debug] DELETE /api/delete --> github.com/jmorganca/ollama/server.DeleteModelHandler (5 handlers) [GIN-debug] POST /api/show --> github.com/jmorganca/ollama/server.ShowModelHandler (5 handlers) [GIN-debug] POST /api/blobs/:digest --> github.com/jmorganca/ollama/server.CreateBlobHandler (5 handlers) [GIN-debug] HEAD /api/blobs/:digest --> github.com/jmorganca/ollama/server.HeadBlobHandler (5 handlers) [GIN-debug] POST /v1/chat/completions --> github.com/jmorganca/ollama/server.ChatHandler (6 handlers) [GIN-debug] GET / --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] GET /api/tags --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers) [GIN-debug] GET /api/version --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) [GIN-debug] HEAD / --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] HEAD /api/tags --> github.com/jmorganca/ollama/server.ListModelsHandler (5 handlers) [GIN-debug] HEAD /api/version --> github.com/jmorganca/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) [GIN] 2024/03/02 - 01:40:27 | 200 | 27.997µs | 127.0.0.1 | HEAD "/" [GIN] 2024/03/02 - 01:40:27 | 200 | 407.172µs | 127.0.0.1 | POST "/api/show" [GIN] 2024/03/02 - 01:40:27 | 200 | 119.568µs | 127.0.0.1 | POST "/api/show" [GIN] 2024/03/02 - 01:41:01 | 200 | 33.44071703s | 127.0.0.1 | POST "/api/chat" [GIN] 2024/03/02 - 01:42:42 | 200 | 1m21s | 127.0.0.1 | POST "/api/chat" [GIN] 2024/03/02 - 01:42:42 | 400 | 1m4s | 127.0.0.1 | POST "/api/chat" ``` Do I need to install ROCM HIP or do the necessary resources already come bundled with Ollama? Any ideas?

Sign in to join this conversation.

Branches Tags

main

hoyyeva/anthropic-local-image-path

dhiltgen/ci

dhiltgen/llama-runner

parth-remove-claude-desktop-launch

hoyyeva/anthropic-reference-images-path

parth-anthropic-reference-images-path

brucemacd/download-before-remove

hoyyeva/editor-config-repair

parth-mlx-decode-checkpoints

parth-launch-codex-app

hoyyeva/fix-codex-model-metadata-warning

hoyyeva/qwen

parth/hide-claude-desktop-till-release

hoyyeva/opencode-image-modality

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

hoyyeva/opencode-thinking

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

parth/update-claude-docs

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#27511