mirror of
https://github.com/open-webui/open-webui.git
synced 2026-03-22 14:13:08 -05:00
In the streaming process, a crash may occur when the output tokens reach thousands or tens of thousands #2863
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Nikoyyy on GitHub (Nov 28, 2024).
Installation Method
[docker run -d -p 3001:8080 --security-opt=seccomp=unconfined --privileged --mount=type=bind,source=/sys/fs/cgroup,target=/sys/fs/cgroup,readonly=false --mount=type=bind,source=/proc,target=/proc2,readonly=false,bind-recursive=disabled -v open-webui:/app/backend/data --name open-webui --restart always ghcr.io/open-webui/open-webui:main]
Environment
Open WebUI Version: [v0.4.6]
Ollama (if applicable): []
Operating System: [Windows 10]
Browser (if applicable): [Version 131.0.6778.86 (Official Build) (64-bit)]
Confirmation:
Expected Behavior:
Streaming process, output reaches thousands or tens of thousands without crashing
Actual Behavior:
When streaming reaches thousands or tens of thousands of tokens, crashes may occur.
Description
Bug Summary:
When streaming reaches thousands or tens of thousands of tokens, crashes may occur.
Reproduction Details
Steps to Reproduce:
Using the QwQ-32B-Preview model, input: There exist real numbers x and y, both greater than 1, such that logx (y²) = log₁ (x¼) = 10. Find xy.
The output is very likely to reach thousands or tens of thousands of tokens, at which point it is highly likely to crash.
As the output increases, it becomes noticeably slower until it eventually crashes.
Logs and Screenshots
Screenshots/Screen Recordings (if applicable):
