[GH-ISSUE #11546] ollama serve freeze #7618

Closed
opened 2026-04-12 19:42:40 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @terminalskid on GitHub (Jul 27, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/11546

┌──(kay㉿kali)-[~]
└─$ ollama serve
time=2025-07-27T08:42:57.311-05:00 level=INFO source=routes.go:1235 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/kay/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:476 msg="total blobs: 0"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=routes.go:1288 msg="Listening on 127.0.0.1:11434 (version 0.9.6)"
time=2025-07-27T08:42:57.313-05:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs"
time=2025-07-27T08:42:57.328-05:00 level=INFO source=gpu.go:377 msg="no compatible GPUs were discovered"
time=2025-07-27T08:42:57.328-05:00 level=INFO source=types.go:130 msg="inference compute" id=0 library=cpu variant="" compute="" driver=0.0 name="" total="3.8 GiB" available="2.0 GiB"
s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/kay/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:
https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:476 msg="total blobs: 0"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0"
time=2025-07-27T08:42:57.312-05:00 level=INFO source=routes.go:1288 msg="Listening on 127.0.0.1:11434 (version 0.9.6)"
time=2025-07-27T08:42:57.313-05:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs"
time=2025-07-27T08:42:57.328-05:00 level=INFO source=gpu.go:377 msg="no compatible GPUs were discovered"
time=2025-07-27T08:42:57.328-05:00 level=INFO source=types.go:130 msg="inference compute" id=0 library=cpu variant="" compute="" driver=0.0 name="" total="3.8 GiB" available

Originally created by @terminalskid on GitHub (Jul 27, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/11546 ┌──(kay㉿kali)-[~] └─$ ollama serve time=2025-07-27T08:42:57.311-05:00 level=INFO source=routes.go:1235 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:INFO OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/kay/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]" time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:476 msg="total blobs: 0" time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0" time=2025-07-27T08:42:57.312-05:00 level=INFO source=routes.go:1288 msg="Listening on 127.0.0.1:11434 (version 0.9.6)" time=2025-07-27T08:42:57.313-05:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs" time=2025-07-27T08:42:57.328-05:00 level=INFO source=gpu.go:377 msg="no compatible GPUs were discovered" time=2025-07-27T08:42:57.328-05:00 level=INFO source=types.go:130 msg="inference compute" id=0 library=cpu variant="" compute="" driver=0.0 name="" total="3.8 GiB" available="2.0 GiB" s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/home/kay/.ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES: http_proxy: https_proxy: no_proxy:]" time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:476 msg="total blobs: 0" time=2025-07-27T08:42:57.312-05:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0" time=2025-07-27T08:42:57.312-05:00 level=INFO source=routes.go:1288 msg="Listening on 127.0.0.1:11434 (version 0.9.6)" time=2025-07-27T08:42:57.313-05:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs" time=2025-07-27T08:42:57.328-05:00 level=INFO source=gpu.go:377 msg="no compatible GPUs were discovered" time=2025-07-27T08:42:57.328-05:00 level=INFO source=types.go:130 msg="inference compute" id=0 library=cpu variant="" compute="" driver=0.0 name="" total="3.8 GiB" available
Author
Owner

@rick-github commented on GitHub (Jul 27, 2025):

It's not frozen, it's waiting to be told which model to load. Open a new terminal and run ollama run llama3.2.

<!-- gh-comment-id:3124434513 --> @rick-github commented on GitHub (Jul 27, 2025): It's not frozen, it's waiting to be told which model to load. Open a new terminal and run `ollama run llama3.2`.
Author
Owner

@terminalskid commented on GitHub (Jul 27, 2025):

It's not frozen, it's waiting to be told which model to load. Open a new terminal and run ollama run llama3.2.

thank you but it didnt work ┌──(kay㉿kali)-[~]
└─$ ollama run llama3.2
Error: ollama server not responding - could not connect to ollama server, run 'ollama serve' to start it

┌──(kay㉿kali)-[~]
└─$
the title was just to explain the problem simply im unsure of the issue
OS: Kali Linux

<!-- gh-comment-id:3124436051 --> @terminalskid commented on GitHub (Jul 27, 2025): > It's not frozen, it's waiting to be told which model to load. Open a new terminal and run `ollama run llama3.2`. thank you but it didnt work ┌──(kay㉿kali)-[~] └─$ ollama run llama3.2 Error: ollama server not responding - could not connect to ollama server, run 'ollama serve' to start it ┌──(kay㉿kali)-[~] └─$ the title was just to explain the problem simply im unsure of the issue OS: Kali Linux
Author
Owner

@rick-github commented on GitHub (Jul 27, 2025):

You have to run the server (ollama serve) and the client (ollama run llama3.2) at the same time.

<!-- gh-comment-id:3124438119 --> @rick-github commented on GitHub (Jul 27, 2025): You have to run the server (`ollama serve`) and the client (`ollama run llama3.2`) at the same time.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#7618