mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 10:58:17 -05:00
[PR #23843] [CLOSED] fix: skip <think> wrap for native reasoning_content round-trip #66260
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/23843
Author: @nulltea
Created: 4/17/2026
Status: ❌ Closed
Base:
dev← Head:fix/reasoning-content-roundtrip-native-path📝 Commits (1)
b3a8fa6fix: skip wrap for native reasoning_content round-trip📊 Changes
1 file changed (+9 additions, -4 deletions)
View changed files
📝
backend/open_webui/utils/misc.py(+9 -4)📄 Description
Chat templates that don't strip (e.g. Gemma 4) leak the tags into the prompt, causing reasoning markup to bleed into the visible reply after a tool call. Only add the wrap for inline-tag-detected reasoning; native-field reasoning ships as reasoning_content alone.
Pull Request Checklist
Note to first-time contributors: Please open a discussion post in Discussions to discuss your idea/fix with the community before creating a pull request, and describe your changes before submitting a pull request.
This is to ensure large feature PRs are discussed with the community first, before starting work on it. If the community does not want this feature or it is not relevant for Open WebUI as a project, it can be identified in the discussion before working on the feature and submitting the PR.
Before submitting, make sure you've checked the following:
devbranch. PRs targetingmainwill be immediately closed.devto ensure no unrelated commits (e.g. frommain) are included. Push updates to the existing PR branch instead of closing and reopening.Changelog Entry
Description
convert_output_to_messagesemits reasoning received via native provider fields (delta.reasoning_content/.reasoning/.thinking) only as the structuredreasoning_contentfield on the assistant message, without the additional<think>…</think>wrap insidecontent. Reasoning detected from inline tags is unchanged. Fixes a regression introduced by PR #23742 where chat templates that don't strip<think>(Gemma 4'sstrip_thinkingonly removes<|channel>…<channel|>, same pattern for gpt-oss harmony and Qwen3-thinking) passed the foreign tags into the prompt after a tool round-trip, leaking literalthought/<channel|>markup into the visible reply.Added
Changed
attributes.type == 'reasoning_content'are no longer duplicated as a<think>…</think>wrap incontentwhen reconstructing assistant history for the backend; only the structuredreasoning_contentfield is sent. Inline-tag-detected reasoning keeps the existing behavior.Deprecated
Removed
Fixed
thought/<channel|>leak in replies after native tool calls on reasoning-capable models whose chat templates consumereasoning_contentnatively but do not strip<think>fromcontent(Gemma 4, gpt-oss harmony, Qwen3-thinking). Regression of PR #23742.Security
Breaking Changes
Additional Information
Problem. The partial fix in
3dd825581introducedpending_reasoningso assistant tool-call messages include areasoning_contentfield (needed by Kimi K2.5). It still also emits a<think>…</think>wrap incontent. Gemma 4'sstrip_thinkingmacro only strips<|channel>…<channel|>, so the<think>tags pass into the prompt. The model's own channel markers then either get rendered as empty special tokens or bleed intodelta.content, producing visiblethought\n<channel|>…leakage.Fix. Branch on the existing provenance marker
attributes.type == 'reasoning_content'(set atmiddleware.py:3774only for reasoning from native provider fields, and already used atmiddleware.py:3852by theinside_tag_blockguard). For those items, emit only the structured field; skip the<think>wrap. Inline-tag-detected items keep the wrap since those models emit and expect inline tags.Scope. One function, one branch. No changes to DB schema, frontend rendering, export, or other call sites. No new config.
Manual verification. Direct API curl: before,
delta.contentstarts with literalthought/\n/<channel|>; after, reasoning streams indelta.reasoning_contentand content stays clean. End-to-end via OpenWebUI + Gemma 4 + a Python tool: reply renders a clean "Thought for N seconds" collapsible.Screenshots or Videos
Before:


After:
Contributor License Agreement
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.