[GH-ISSUE #12907] "/set nothink" not disabling reasoning output in Qwen3:4B #8555

Closed
opened 2026-04-12 21:16:21 -05:00 by GiteaMirror · 2 comments
Owner

Originally created by @Saleh7127 on GitHub (Nov 2, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/12907

What is the issue?

When running the Qwen3:4B model, the model still outputs its internal reasoning (“thinking”) even when the thinking mode is explicitly disabled.

I have tried both methods:

  • Setting "think": false in the /api/chat JSON payload
  • Using the CLI command "/set nothink" before the prompt

Neither disables the reasoning traces; the model still prints intermediate reasoning steps before the final answer.

Image Image

Relevant log output


OS

macOS

GPU

Apple

CPU

Apple

Ollama version

0.12.9

Originally created by @Saleh7127 on GitHub (Nov 2, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/12907 ### What is the issue? When running the Qwen3:4B model, the model still outputs its internal reasoning (“thinking”) even when the thinking mode is explicitly disabled. I have tried both methods: - Setting "think": false in the /api/chat JSON payload - Using the CLI command "/set nothink" before the prompt Neither disables the reasoning traces; the model still prints intermediate reasoning steps before the final answer. <img width="546" height="307" alt="Image" src="https://github.com/user-attachments/assets/346f5e7e-9ab4-437b-ab4f-78e0b5b95947" /> <img width="852" height="397" alt="Image" src="https://github.com/user-attachments/assets/f9af4d68-642e-4d3c-8e20-bd2970d6496b" /> ### Relevant log output ```shell ``` ### OS macOS ### GPU Apple ### CPU Apple ### Ollama version 0.12.9
GiteaMirror added the bug label 2026-04-12 21:16:21 -05:00
Author
Owner

@ronething commented on GitHub (Nov 2, 2025):

i found qwen3-vl:8b also had this issue.

Image
<!-- gh-comment-id:3478103492 --> @ronething commented on GitHub (Nov 2, 2025): i found `qwen3-vl:8b` also had this issue. <img width="1216" height="322" alt="Image" src="https://github.com/user-attachments/assets/576c1445-debd-42d3-ad8a-d8fb75663bdd" />
Author
Owner

@jmorganca commented on GitHub (Nov 2, 2025):

Hi all, similar to qwen3, qwen3-vl has different versions for whether the model can think or not. For the non-thinking version, use ollama run qwen3-vl:8b-instruct

<!-- gh-comment-id:3478500546 --> @jmorganca commented on GitHub (Nov 2, 2025): Hi all, similar to `qwen3`, `qwen3-vl` has different versions for whether the model can think or not. For the non-thinking version, use `ollama run qwen3-vl:8b-instruct`
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#8555