[GH-ISSUE #19744] issue: message token cutoff not leaving enough space for completion #18981

Closed
opened 2026-04-20 01:16:19 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @NedasValantic on GitHub (Dec 4, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/19744

Check Existing Issues

  • I have searched for any existing and/or related issues.
  • I have searched for any existing and/or related discussions.
  • I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!).
  • I am using the latest version of Open WebUI.

Installation Method

Git Clone

Open WebUI Version

v0.6.40

Ollama Version (if applicable)

No response

Operating System

windows 11

Browser (if applicable)

No response

Confirmation

  • I have read and followed all instructions in README.md.
  • I am using the latest version of both Open WebUI and Ollama.
  • I have included the browser console logs.
  • I have included the Docker container logs.
  • I have provided every relevant configuration, setting, and environment variable used in my setup.
  • I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
  • I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
  • Start with the initial platform/version/OS and dependencies used,
  • Specify exact install/launch/configure commands,
  • List URLs visited, user input (incl. example values/emails/passwords if needed),
  • Describe all options and toggles enabled or changed,
  • Include any files or environmental changes,
  • Identify the expected and actual result at each stage,
  • Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

I should not face any token limitations when using the chat functionality

Actual Behavior

When chat gets longer or gets close to the token limit, automatic cut-offs should kick in. Seemingly they do that, but not enough is being cut off, leaving not enough space for completion, resulting in an error for the end user.

Steps to Reproduce

  1. Start up a chat
  2. Upload some information from knowledge, some utility files
  3. Use the chat for a bit more
  4. In due time you will get the error similar to what is shown in the provided image.

Logs & Screenshots

Image

Additional Information

No response

Originally created by @NedasValantic on GitHub (Dec 4, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/19744 ### Check Existing Issues - [x] I have searched for any existing and/or related issues. - [x] I have searched for any existing and/or related discussions. - [x] I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!). - [x] I am using the latest version of Open WebUI. ### Installation Method Git Clone ### Open WebUI Version v0.6.40 ### Ollama Version (if applicable) _No response_ ### Operating System windows 11 ### Browser (if applicable) _No response_ ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior I should not face any token limitations when using the chat functionality ### Actual Behavior When chat gets longer or gets close to the token limit, automatic cut-offs should kick in. Seemingly they do that, but not enough is being cut off, leaving not enough space for completion, resulting in an error for the end user. ### Steps to Reproduce 1. Start up a chat 2. Upload some information from knowledge, some utility files 3. Use the chat for a bit more 4. In due time you will get the error similar to what is shown in the provided image. ### Logs & Screenshots <img width="2299" height="209" alt="Image" src="https://github.com/user-attachments/assets/ea239ba9-989a-4b8a-8fa5-4208b4daa851" /> ### Additional Information _No response_
GiteaMirror added the bug label 2026-04-20 01:16:19 -05:00
Author
Owner

@owui-terminator[bot] commented on GitHub (Dec 4, 2025):

🔍 Similar Issues Found

I found some existing issues that might be related to this one. Please check if any of these are duplicates or contain helpful solutions:

  1. #19656 issue: Tool call response tokens are duplicated, causing 2x token consumption
    by qq3829596922 • Dec 01, 2025 • bug

  2. #19563 issue:
    by naruto7g • Nov 28, 2025 • bug

  3. #19211 issue:
    by Byrnes9 • Nov 16, 2025 • bug

  4. #18974 issue: hanging on front matter requirements after update
    by DuckyBlender • Nov 06, 2025 • bug

  5. #19047 issue: followup questions sometimes fail to generate
    by avidwriter • Nov 08, 2025 • bug

Show 5 more related issues
  1. #19170 issue: Settings not honored or understood
    by itguy327 • Nov 13, 2025 • bug

  2. #19429 issue: user list wrong count and less than 30 items per page
    by destination-one • Nov 24, 2025 • bug

  3. #19230 issue: Open WebUI 0.6.36 Unexpected token 'd', "data: {"id"... is not valid JSON returned.
    by abhijit4law • Nov 17, 2025 • bug

  4. #19169 issue: System Prompt Duplication During Agentic Tool Calls Leading to Token Waste and Write-Cache Overprice
    by alexis-dioxycle • Nov 13, 2025 • bug

  5. #19390 issue: tools will double the token cost
    by qq3829596922 • Nov 23, 2025 • bug


💡 Tips:

  • If this is a duplicate, please consider closing this issue and adding any additional details to the existing one
  • If you found a solution in any of these issues, please share it here to help others

This comment was generated automatically by a bot. Please react with a 👍 if this comment was helpful, or a 👎 if it was not.

<!-- gh-comment-id:3611376620 --> @owui-terminator[bot] commented on GitHub (Dec 4, 2025): 🔍 **Similar Issues Found** I found some existing issues that might be related to this one. Please check if any of these are duplicates or contain helpful solutions: 1. [#19656](https://github.com/open-webui/open-webui/issues/19656) **issue: Tool call response tokens are duplicated, causing 2x token consumption** *by qq3829596922 • Dec 01, 2025 • `bug`* 2. [#19563](https://github.com/open-webui/open-webui/issues/19563) **issue:** *by naruto7g • Nov 28, 2025 • `bug`* 3. [#19211](https://github.com/open-webui/open-webui/issues/19211) **issue:** *by Byrnes9 • Nov 16, 2025 • `bug`* 4. [#18974](https://github.com/open-webui/open-webui/issues/18974) **issue: hanging on front matter requirements after update** *by DuckyBlender • Nov 06, 2025 • `bug`* 5. [#19047](https://github.com/open-webui/open-webui/issues/19047) **issue: followup questions sometimes fail to generate** *by avidwriter • Nov 08, 2025 • `bug`* <details> <summary>Show 5 more related issues</summary> 6. [#19170](https://github.com/open-webui/open-webui/issues/19170) **issue: Settings not honored or understood** *by itguy327 • Nov 13, 2025 • `bug`* 7. [#19429](https://github.com/open-webui/open-webui/issues/19429) **issue: user list wrong count and less than 30 items per page** *by destination-one • Nov 24, 2025 • `bug`* 8. [#19230](https://github.com/open-webui/open-webui/issues/19230) **issue: Open WebUI 0.6.36 Unexpected token 'd', "data: {"id"... is not valid JSON returned.** *by abhijit4law • Nov 17, 2025 • `bug`* 9. [#19169](https://github.com/open-webui/open-webui/issues/19169) **issue: System Prompt Duplication During Agentic Tool Calls Leading to Token Waste and Write-Cache Overprice** *by alexis-dioxycle • Nov 13, 2025 • `bug`* 10. [#19390](https://github.com/open-webui/open-webui/issues/19390) **issue: tools will double the token cost** *by qq3829596922 • Nov 23, 2025 • `bug`* </details> --- 💡 **Tips:** - If this is a duplicate, please consider closing this issue and adding any additional details to the existing one - If you found a solution in any of these issues, please share it here to help others *This comment was generated automatically by a bot.* Please react with a 👍 if this comment was helpful, or a 👎 if it was not.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#18981