[GH-ISSUE #11752] Does the Ollama /api/chat endpoint support setting Reasoning Effort? #33548

Closed
opened 2026-04-22 16:23:35 -05:00 by GiteaMirror · 5 comments
Owner

Originally created by @EntropyYue on GitHub (Aug 6, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/11752

Issue #11738 mentions that OpenAI-compatible endpoints support this parameter

Originally created by @EntropyYue on GitHub (Aug 6, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/11752 Issue #11738 mentions that OpenAI-compatible endpoints support this parameter
Author
Owner

@drifkin commented on GitHub (Aug 6, 2025):

yes! For the gpt-oss models think can now be "high"/"medium"/"low" in addition to true and false.

curl http://localhost:11434/api/chat -d '{
"stream": false,
  "think": "low",
  "model": "gpt-oss",
  "messages": [
    { "role": "user", "content": "Tell me an interesting fact" }
  ]
}' | jq .

we'll be adding documentation soon and updating SDKs

We also support this from the cli via e.g., --think=low and also /set think low within interactive CLI sessions

<!-- gh-comment-id:3161491187 --> @drifkin commented on GitHub (Aug 6, 2025): yes! For the `gpt-oss` models `think` can now be "high"/"medium"/"low" in addition to true and false. ``` curl http://localhost:11434/api/chat -d '{ "stream": false, "think": "low", "model": "gpt-oss", "messages": [ { "role": "user", "content": "Tell me an interesting fact" } ] }' | jq . ``` we'll be adding documentation soon and updating SDKs We also support this from the cli via e.g., `--think=low` and also `/set think low` within interactive CLI sessions
Author
Owner

@njbrake commented on GitHub (Aug 7, 2025):

Just a note that I think this behavior is broken in the ollama python sdk https://github.com/ollama/ollama-python/issues/552

<!-- gh-comment-id:3162151141 --> @njbrake commented on GitHub (Aug 7, 2025): Just a note that I think this behavior is broken in the ollama python sdk https://github.com/ollama/ollama-python/issues/552
Author
Owner

@drifkin commented on GitHub (Aug 7, 2025):

thanks, yes we need to update that, will try to get to that asap

<!-- gh-comment-id:3162155878 --> @drifkin commented on GitHub (Aug 7, 2025): thanks, yes we need to update that, will try to get to that asap
Author
Owner

@drifkin commented on GitHub (Aug 7, 2025):

python sdk updated (v0.5.3), javascript coming next. Example at https://github.com/ollama/ollama-python/blob/main/examples/thinking-levels.py

<!-- gh-comment-id:3165923125 --> @drifkin commented on GitHub (Aug 7, 2025): python sdk updated (v0.5.3), javascript coming next. Example at https://github.com/ollama/ollama-python/blob/main/examples/thinking-levels.py
Author
Owner

@chigkim commented on GitHub (Mar 1, 2026):

Is this possible through openai api by setting reasoning effort?
If so, can you set reasoning effort to "none" to disable entirely for other models like qwen3.5?

<!-- gh-comment-id:3978893980 --> @chigkim commented on GitHub (Mar 1, 2026): Is this possible through openai api by setting reasoning effort? If so, can you set reasoning effort to "none" to disable entirely for other models like qwen3.5?
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#33548