[GH-ISSUE #15195] Parallel task managing #56240

Open
opened 2026-04-29 10:28:08 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @n4s3r on GitHub (Apr 1, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/15195

What is the issue?

https://github.com/lfnovo/open-notebook/issues/711

I don't want parallelism in my setup, so I have this config but doesn't work:

powershell -NoExit -Command "$host.UI.RawUI.WindowTitle = 'Ollama LLMs';$env:OLLAMA_GPU='1';$env:OLLAMA_NUM_THREADS='10';$env:OMP_NUM_THREADS='10';$env:OLLAMA_GPU_MEMORY_FRACTION='1';$env:OLLAMA_MAX_LOADED_MODELS='1';$env:OLLAMA_NUM_PARALLEL='1';$env:OLLAMA_KV_CACHE_TYPE='q8_0';$env:OLLAMA_NEW_ENGINE='1';$env:OLLAMA_VULKAN='0';$env:OLLAMA_MAX_QUEUE='1';$env:OLLAMA_HOST='127.0.0.1:11435';$env:OLLAMA_SCHED_SPREAD='1';$env:OLLAMA_FLASH_ATTENTION='1';ollama serve"

Relevant log output

[GIN] 2026/03/31 - 19:55:44 | 200 | 84.3503ms | 127.0.0.1 | HEAD "/"
[GIN] 2026/03/31 - 19:55:45 | 200 | 111.554ms | 127.0.0.1 | GET "/api/ps"
[GIN] 2026/03/31 - 20:13:45 | 200 | 1.4590673s | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:14:56 | 200 | 412.5284ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:15:08 | 200 | 194.5318ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:15:11 | 200 | 267.7857ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:15:14 | 200 | 530.0361ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:15:56 | 200 | 635.156ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:16:04 | 200 | 301.8981ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:19:23 | 200 | 4.327851s | 127.0.0.1 | GET "/api/tags"
[GIN] 2026/03/31 - 20:31:18 | 200 | 127.1235ms | 127.0.0.1 | HEAD "/"
[GIN] 2026/03/31 - 20:31:18 | 200 | 32.8794ms | 127.0.0.1 | GET "/api/ps"
[GIN] 2026/03/31 - 22:17:53 | 200 | 2h24m6s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 00:19:53 | 200 | 1.2835584s | 127.0.0.1 | HEAD "/"
[GIN] 2026/04/01 - 00:19:56 | 200 | 888.8263ms | 127.0.0.1 | GET "/api/ps"
[GIN] 2026/04/01 - 00:43:01 | 200 | 4h48m46s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 01:03:51 | 200 | 7.1998ms | 127.0.0.1 | HEAD "/"
[GIN] 2026/04/01 - 01:03:51 | 200 | 13.6305ms | 127.0.0.1 | GET "/api/ps"
[GIN] 2026/04/01 - 02:22:52 | 200 | 6h28m36s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 02:30:31 | 200 | 6h36m16s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 02:42:20 | 200 | 6h48m5s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 02:51:54 | 200 | 6h58m7s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 03:35:48 | 200 | 7h41m33s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 03:40:55 | 200 | 7h46m40s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 03:52:03 | 200 | 7h57m47s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 04:52:13 | 200 | 8h57m57s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 05:12:38 | 200 | 9h18m22s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 05:52:32 | 200 | 9h58m5s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 06:15:38 | 200 | 10h21m28s | 127.0.0.1 | POST "/api/chat"
[GIN] 2026/04/01 - 06:34:57 | 200 | 10h40m29s | 127.0.0.1 | POST "/api/chat"

OS

Windows

GPU

Nvidia

CPU

Intel

Ollama version

0.18.3

Originally created by @n4s3r on GitHub (Apr 1, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/15195 ### What is the issue? https://github.com/lfnovo/open-notebook/issues/711 I don't want parallelism in my setup, so I have this config but doesn't work: powershell -NoExit -Command "$host.UI.RawUI.WindowTitle = 'Ollama LLMs';$env:OLLAMA_GPU='1';$env:OLLAMA_NUM_THREADS='10';$env:OMP_NUM_THREADS='10';$env:OLLAMA_GPU_MEMORY_FRACTION='1';$env:OLLAMA_MAX_LOADED_MODELS='1';$env:OLLAMA_NUM_PARALLEL='1';$env:OLLAMA_KV_CACHE_TYPE='q8_0';$env:OLLAMA_NEW_ENGINE='1';$env:OLLAMA_VULKAN='0';$env:OLLAMA_MAX_QUEUE='1';$env:OLLAMA_HOST='127.0.0.1:11435';$env:OLLAMA_SCHED_SPREAD='1';$env:OLLAMA_FLASH_ATTENTION='1';ollama serve" ### Relevant log output ```shell [GIN] 2026/03/31 - 19:55:44 | 200 | 84.3503ms | 127.0.0.1 | HEAD "/" [GIN] 2026/03/31 - 19:55:45 | 200 | 111.554ms | 127.0.0.1 | GET "/api/ps" [GIN] 2026/03/31 - 20:13:45 | 200 | 1.4590673s | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:14:56 | 200 | 412.5284ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:15:08 | 200 | 194.5318ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:15:11 | 200 | 267.7857ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:15:14 | 200 | 530.0361ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:15:56 | 200 | 635.156ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:16:04 | 200 | 301.8981ms | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:19:23 | 200 | 4.327851s | 127.0.0.1 | GET "/api/tags" [GIN] 2026/03/31 - 20:31:18 | 200 | 127.1235ms | 127.0.0.1 | HEAD "/" [GIN] 2026/03/31 - 20:31:18 | 200 | 32.8794ms | 127.0.0.1 | GET "/api/ps" [GIN] 2026/03/31 - 22:17:53 | 200 | 2h24m6s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 00:19:53 | 200 | 1.2835584s | 127.0.0.1 | HEAD "/" [GIN] 2026/04/01 - 00:19:56 | 200 | 888.8263ms | 127.0.0.1 | GET "/api/ps" [GIN] 2026/04/01 - 00:43:01 | 200 | 4h48m46s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 01:03:51 | 200 | 7.1998ms | 127.0.0.1 | HEAD "/" [GIN] 2026/04/01 - 01:03:51 | 200 | 13.6305ms | 127.0.0.1 | GET "/api/ps" [GIN] 2026/04/01 - 02:22:52 | 200 | 6h28m36s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 02:30:31 | 200 | 6h36m16s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 02:42:20 | 200 | 6h48m5s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 02:51:54 | 200 | 6h58m7s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 03:35:48 | 200 | 7h41m33s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 03:40:55 | 200 | 7h46m40s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 03:52:03 | 200 | 7h57m47s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 04:52:13 | 200 | 8h57m57s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 05:12:38 | 200 | 9h18m22s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 05:52:32 | 200 | 9h58m5s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 06:15:38 | 200 | 10h21m28s | 127.0.0.1 | POST "/api/chat" [GIN] 2026/04/01 - 06:34:57 | 200 | 10h40m29s | 127.0.0.1 | POST "/api/chat" ``` ### OS Windows ### GPU Nvidia ### CPU Intel ### Ollama version 0.18.3
GiteaMirror added the bug label 2026-04-29 10:28:08 -05:00
Author
Owner

@PureBlissAK commented on GitHub (Apr 18, 2026):

🤖 Automated Triage & Analysis Report

Issue: #15195
Analyzed: 2026-04-18T18:22:54.011801

Analysis

  • Type: unknown
  • Severity: medium
  • Components: unknown

Implementation Plan

  • Effort: medium
  • Steps:

This issue has been triaged and marked for implementation.

<!-- gh-comment-id:4274310855 --> @PureBlissAK commented on GitHub (Apr 18, 2026): <!-- ollama-issue-orchestrator:v1 issue:15195 --> ## 🤖 Automated Triage & Analysis Report **Issue**: #15195 **Analyzed**: 2026-04-18T18:22:54.011801 ### Analysis - **Type**: unknown - **Severity**: medium - **Components**: unknown ### Implementation Plan - **Effort**: medium - **Steps**: *This issue has been triaged and marked for implementation.*
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#56240