[GH-ISSUE #14561] issue: thinking not showing up for Openrouter models anymore after recent update #104295

New Issue

2026-05-18T01:57:21-05:00

GiteaMirror commented

2026-05-18 01:57:21 -05:00

Originally created by @amanat361 on GitHub (May 31, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/14561

Check Existing Issues

I have searched the existing issues and discussions.
I am using the latest version of Open WebUI.

Installation Method

Git Clone

Open WebUI Version

0.6.13

Ollama Version (if applicable)

No response

Operating System

Ubuntu 22.04

Browser (if applicable)

Chrome

Confirmation

I have read and followed all instructions in README.md.
I am using the latest version of both Open WebUI and Ollama.
I have included the browser console logs.
I have included the Docker container logs.
I have provided every relevant configuration, setting, and environment variable used in my setup.
I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
Start with the initial platform/version/OS and dependencies used,
Specify exact install/launch/configure commands,
List URLs visited, user input (incl. example values/emails/passwords if needed),
Describe all options and toggles enabled or changed,
Include any files or environmental changes,
Identify the expected and actual result at each stage,
Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

View the "thinking" preview for reasoning models like Gemini 2.5 Pro. This worked normally for a while but upon the recent update no longer works. Local models on Ollama show reasoning, but external models do not. The expected behavior is the little spinner that says "thinking" where you can expand to see the reasoning tokens before the output completes.

Actual Behavior

Current behavior is that the model just has a loading state until reasoning completes and then continues with a standard response. For Deepseek models on Groq, it works, but for any Openrouter model (Gemini 2.5 pro or Deepseek R1 0528), it does not work. It also does not work for models from the Openai provider like O3 and O4-mini.

Steps to Reproduce

Clean Openwebui install, newest version. Ollama version irrelevant since this is an external provider issue. Go to "connections" tab in settings and add Openrouter and Openai providers respectively. Try models like Gemini 2.5 Pro

which has the tag Openrouter.google/gemini-2.5-pro-preview

Submit a prompt in a chat that requires reasoning like "give me an interesting detailed proof for the square root of pi being irrational"

Notice that the thinking doesn't actually show. Sorry guys, I know the issue template wants me to put lots of info, but it's really as simple as that. On some models, reasoning doesn't show up. Not sure what else to explain!

Logs & Screenshots

Ok this is weird. When I went to take a screenshot, it literally worked the ONE time I tried to show it not working. Absolutely baffled.

However that was just Deepsek. I switched models to Gemini 2.5 Pro and yep, as expected, just a loading state:

In the browser console logs, the "delta" in the choice completion streaming is just an empty string over and over during the supposed reasoning stage:

Additional Information

Maybe it is a Gemini issue and the models changed or something but I could swear I remember the thinking working before, maybe I'm crazy. Let me know if you'd like more detailed info, I hope this is alright!

Originally created by @amanat361 on GitHub (May 31, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/14561 ### Check Existing Issues - [x] I have searched the existing issues and discussions. - [x] I am using the latest version of Open WebUI. ### Installation Method Git Clone ### Open WebUI Version 0.6.13 ### Ollama Version (if applicable) _No response_ ### Operating System Ubuntu 22.04 ### Browser (if applicable) Chrome ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior View the "thinking" preview for reasoning models like Gemini 2.5 Pro. This worked normally for a while but upon the recent update no longer works. Local models on Ollama show reasoning, but external models do not. The expected behavior is the little spinner that says "thinking" where you can expand to see the reasoning tokens before the output completes. ### Actual Behavior Current behavior is that the model just has a loading state until reasoning completes and then continues with a standard response. For Deepseek models on Groq, it works, but for any Openrouter model (Gemini 2.5 pro or Deepseek R1 0528), it does not work. It also does not work for models from the Openai provider like O3 and O4-mini. ### Steps to Reproduce Clean Openwebui install, newest version. Ollama version irrelevant since this is an external provider issue. Go to "connections" tab in settings and add Openrouter and Openai providers respectively. Try models like Gemini 2.5 Pro ![Image](https://github.com/user-attachments/assets/3b4150af-c19e-4fe8-95f1-8d0e3a869b41) which has the tag `Openrouter.google/gemini-2.5-pro-preview` Submit a prompt in a chat that requires reasoning like "give me an interesting detailed proof for the square root of pi being irrational" Notice that the thinking doesn't actually show. Sorry guys, I know the issue template wants me to put lots of info, but it's really as simple as that. On some models, reasoning doesn't show up. Not sure what else to explain! ### Logs & Screenshots Ok this is weird. When I went to take a screenshot, it literally worked the ONE time I tried to show it not working. Absolutely baffled. ![Image](https://github.com/user-attachments/assets/e3743bbd-a2b9-42e9-9846-95a5a632aec0) However that was just Deepsek. I switched models to Gemini 2.5 Pro and yep, as expected, just a loading state: ![Image](https://github.com/user-attachments/assets/80615f3c-be4f-4b6f-b9be-180a334f72a9) In the browser console logs, the "delta" in the choice completion streaming is just an empty string over and over during the supposed reasoning stage: ![Image](https://github.com/user-attachments/assets/e721fd4a-5b47-4600-adec-34e4d2b5b6fe) ![Image](https://github.com/user-attachments/assets/0df4d2d2-df7b-4139-860c-23f181d2e560) ### Additional Information Maybe it is a Gemini issue and the models changed or something but I could swear I remember the thinking working before, maybe I'm crazy. Let me know if you'd like more detailed info, I hope this is alright!

GiteaMirror added the bug label 2026-05-18 01:57:21 -05:00

GiteaMirror closed this issue

2026-05-18 01:57:22 -05:00

GiteaMirror commented

2026-05-18 01:57:23 -05:00

@tjbck commented on GitHub (Jun 2, 2025):

This might have more to do with model's stochastic behaviour than Open WebUI, we haven't changed our logic to handle reasoning content. Keep us updated!

@tjbck commented on GitHub (Jun 2, 2025): This might have more to do with model's stochastic behaviour than Open WebUI, we haven't changed our logic to handle reasoning content. Keep us updated!

Sign in to join this conversation.

Branches Tags

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/open-webui#104295