[GH-ISSUE #18397] issue: 0.6.33 problem with long contest 128k

GiteaMirror commented

2026-05-05 20:46:55 -05:00

Owner

Copy Link

Originally created by @batot1 on GitHub (Oct 17, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/18397

Check Existing Issues

I have searched for any existing and/or related issues.
I have searched for any existing and/or related discussions.
I am using the latest version of Open WebUI.

Installation Method

Docker

Open WebUI Version

0.6.33

Ollama Version (if applicable)

0.12.5

Operating System

Debian 12 (all updates)

Browser (if applicable)

FF 143.0.4 (64-bit)

Confirmation

I have read and followed all instructions in README.md.
I am using the latest version of both Open WebUI and Ollama.
I have included the browser console logs.
I have included the Docker container logs.
I have provided every relevant configuration, setting, and environment variable used in my setup.
I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
Start with the initial platform/version/OS and dependencies used,
Specify exact install/launch/configure commands,
List URLs visited, user input (incl. example values/emails/passwords if needed),
Describe all options and toggles enabled or changed,
Include any files or environmental changes,
Identify the expected and actual result at each stage,
Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

BUG long contest 128k.
Past ~second question he never stooped answer and repeats the same thing over and over in an infinite loop.

How reproduce:
/ollama pull qwen3-coder:30b
/set parameter num_ctx 131072
/save qwen3-coder-128k

Now when I'm asking him 2-3 query in window, he getting loop and never stoping answer.
When I'm doing this same direct in window ollama all working property.

Actual Behavior

BUG long contest 128k.
Past ~second question he never stooped answer and repeats the same thing over and over in an infinite loop.

How reproduce:
/ollama pull qwen3-coder:30b
/set parameter num_ctx 131072
/save qwen3-coder-128k

Now when I'm asking him 2-3 query in window, he getting loop and never stoping answer.
When I'm doing this same direct in window ollama all working property.

Steps to Reproduce

How reproduce:
/ollama pull qwen3-coder:30b
/set parameter num_ctx 131072
/save qwen3-coder-128k

Now when I'm asking him 2-3 query in window in OpenWeb-UI, he getting loop and never stoping answer.

Logs & Screenshots

In windows Open-Web-UI Contest:
Response payload is not completed: <TransferEncodingError: 400, message='Not enough data to satisfy transfer length header.'>

Additional Information

In LOG docker I don't see any warring or error only INFO. Nothing special.

Originally created by @batot1 on GitHub (Oct 17, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/18397 ### Check Existing Issues - [x] I have searched for any existing and/or related issues. - [x] I have searched for any existing and/or related discussions. - [x] I am using the latest version of Open WebUI. ### Installation Method Docker ### Open WebUI Version 0.6.33 ### Ollama Version (if applicable) 0.12.5 ### Operating System Debian 12 (all updates) ### Browser (if applicable) FF 143.0.4 (64-bit) ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior BUG long contest 128k. Past ~second question he never stooped answer and repeats the same thing over and over in an infinite loop. How reproduce: /ollama pull qwen3-coder:30b /set parameter num_ctx 131072 /save qwen3-coder-128k Now when I'm asking him 2-3 query in window, he getting loop and never stoping answer. When I'm doing this same direct in window ollama all working property. ### Actual Behavior BUG long contest 128k. Past ~second question he never stooped answer and repeats the same thing over and over in an infinite loop. How reproduce: /ollama pull qwen3-coder:30b /set parameter num_ctx 131072 /save qwen3-coder-128k Now when I'm asking him 2-3 query in window, he getting loop and never stoping answer. When I'm doing this same direct in window ollama all working property. ### Steps to Reproduce How reproduce: /ollama pull qwen3-coder:30b /set parameter num_ctx 131072 /save qwen3-coder-128k Now when I'm asking him 2-3 query in window in OpenWeb-UI, he getting loop and never stoping answer. ### Logs & Screenshots In windows Open-Web-UI Contest: Response payload is not completed: <TransferEncodingError: 400, message='Not enough data to satisfy transfer length header.'> ### Additional Information In LOG docker I don't see any warring or error only INFO. Nothing special.

GiteaMirror added the bug label 2026-05-05 20:46:55 -05:00

GiteaMirror closed this issue

2026-05-05 20:46:56 -05:00

GiteaMirror commented

2026-05-05 20:46:57 -05:00

Author

Owner

Copy Link

@Classic298 commented on GitHub (Oct 17, 2025):

Do you have the hardware to support a 128k context* window?

Reproduction steps not clear; how to reproduce and what to do

@Classic298 commented on GitHub (Oct 17, 2025): Do you have the hardware to support a 128k context* window? Reproduction steps not clear; how to reproduce and what to do

GiteaMirror commented

2026-05-05 20:46:57 -05:00

Author

Owner

Copy Link

@batot1 commented on GitHub (Oct 17, 2025):

I was giving you instruction how reprocude:

/ollama pull qwen3-coder:30b
/set parameter num_ctx 131072
/save qwen3-coder-128k

When you saveing this you get "new model with long contest. But it only working wiht ollama but with OpenWebUI not working property.

Yes, I have hardware to reproduce 128k contest.

$ ollama ps
NAME ID SIZE PROCESSOR CONTEXT UNTIL
qwen3-coder-30b-128k:latest 571f59fefc54 44 GB 48%/52% CPU/GPU 131072 4 minutes from now

I'm guessing that OpenWebUI probably won't work with any longer models, as I see they're probably not handling long contexts correctly.
Why are you closing the ticket without resolving or even verifying that the problem exists?
4 simple steps to reproduce.

@batot1 commented on GitHub (Oct 17, 2025): I was giving you instruction how reprocude: /ollama pull qwen3-coder:30b /set parameter num_ctx 131072 /save qwen3-coder-128k When you saveing this you get "new model with long contest. But it only working wiht ollama but with OpenWebUI not working property. Yes, I have hardware to reproduce 128k contest. $ ollama ps NAME ID SIZE PROCESSOR CONTEXT UNTIL qwen3-coder-30b-128k:latest 571f59fefc54 44 GB 48%/52% CPU/GPU 131072 4 minutes from now I'm guessing that OpenWebUI probably won't work with any longer models, as I see they're probably not handling long contexts correctly. Why are you closing the ticket without resolving or even verifying that the problem exists? 4 simple steps to reproduce.

GiteaMirror commented

2026-05-05 20:46:57 -05:00

Author

Owner

Copy Link

@Classic298 commented on GitHub (Oct 17, 2025):

You did not provide any sensible steps to reproduce? How do I reproduce these steps inside of Open WebUI?

And yes, Open WebUI CAN handle long context models just fine.

@Classic298 commented on GitHub (Oct 17, 2025): You did not provide any sensible steps to reproduce? How do I reproduce these steps inside of Open WebUI? And yes, Open WebUI CAN handle long context models just fine.

GiteaMirror commented

2026-05-05 20:46:58 -05:00

Author

Owner

Copy Link

@batot1 commented on GitHub (Oct 17, 2025):

Open-WebUI side:
new chat ---> search model (select qwen3-coder-30b-128k:latest)

No matter abut you asking example:
"10 rows with question"
--->>> PUSH ENTER
waitting for answer
"10 rows with question"
--->>> PUSH ENTER
waitting for answer
"10 rows with question no matter what you wrote"
--->>> PUSH ENTER
waitting for answer. Probably newer ending but if enndig repeat one again else.
"10 rows with question no matter what you wrote"
--->>> PUSH ENTER

@batot1 commented on GitHub (Oct 17, 2025): Open-WebUI side: new chat ---> search model (select qwen3-coder-30b-128k:latest) 1. No matter abut you asking example: "10 rows with question" --->>> PUSH ENTER 2. waitting for answer 3. "10 rows with question" --->>> PUSH ENTER 4. waitting for answer "10 rows with question no matter what you wrote" --->>> PUSH ENTER 5. waitting for answer. Probably newer ending but if enndig repeat one again else. 6. "10 rows with question no matter what you wrote" --->>> PUSH ENTER

GiteaMirror commented

2026-05-05 20:46:58 -05:00

Author

Owner

Copy Link

@silentoplayz commented on GitHub (Oct 17, 2025):

https://www.lenovo.com/us/en/glossary/pebkac/

@silentoplayz commented on GitHub (Oct 17, 2025): https://www.lenovo.com/us/en/glossary/pebkac/

No Branch/Tag Specified

main

dev

v0.9.2

v0.9.1

v0.9.0

v0.8.12

v0.8.11

v0.8.10

v0.8.9

v0.8.8

v0.8.7

v0.8.6

v0.8.5

v0.8.4

v0.8.3

v0.8.2

v0.8.1

v0.8.0

v0.7.2

v0.7.1

v0.7.0

v0.6.43

v0.6.42

v0.6.41

v0.6.40

v0.6.39

v0.6.38

v0.6.37

v0.6.36

v0.6.35

v0.6.34

v0.6.33

v0.6.32

v0.6.31

v0.6.30

v0.6.29

v0.6.28

v0.6.27

v0.6.26

v0.6.25

v0.6.24

v0.6.23

v0.6.22

v0.6.21

v0.6.20

v0.6.19

v0.6.18

v0.6.17

v0.6.16

v0.6.15

v0.6.14

v0.6.13

v0.6.12

v0.6.11

v0.6.10

v0.6.9

v0.6.8

v0.6.7

v0.6.6

v0.6.5

v0.6.4

v0.6.3

v0.6.2

v0.6.1

v0.6.0

v0.5.20

v0.5.19

v0.5.18

v0.5.17

v0.5.16

v0.5.15

v0.5.14

v0.5.13

v0.5.12

v0.5.11

v0.5.10

v0.5.9

v0.5.8

v0.5.7

v0.5.6

v0.5.5

v0.5.4

v0.5.3

v0.5.2

v0.5.1

v0.5.0

v0.4.8

v0.4.7

v0.4.6

v0.4.5

v0.4.4

v0.4.3

v0.4.2

v0.4.1

v0.4.0

v0.3.35

v0.3.34

v0.3.33

v0.3.32

v0.3.31

v0.3.30

v0.3.29

v0.3.28

v0.3.27

v0.3.26

v0.3.25

v0.3.24

v0.3.23

v0.3.22

v0.3.21

v0.3.20

v0.3.19

v0.3.18

v0.3.17

v0.3.16

v0.3.15

v0.3.14

v0.3.13

v0.3.12

v0.3.11

v0.3.10

v0.3.9

v0.3.8

v0.3.7

v0.3.6

v0.3.5

v0.3.4

v0.3.3

v0.3.2

v0.3.1

v0.3.0

v0.2.5

v0.2.4

v0.2.3

v0.2.2

v0.2.1

v0.2.0

v0.1.125

v0.1.124

v0.1.123

v0.1.122

v0.1.121

v0.1.120

v0.1.119

v0.1.118

v0.1.117

v0.1.116

v0.1.115

v0.1.114

v0.1.113

v0.1.112

v0.1.111

v0.1.110

v0.1.109

v0.1.108

v0.1.107

v0.1.106

v0.1.105

v0.1.104

v0.1.103

v0.1.102

Labels

Clear labels

bug

confirmed

confirmed issue

core

documentation

enhancement

good first issue

help wanted

non-core

pull-request

Mirrored from GitHub Pull Request

python

question

testing wanted

No Label bug

Milestone

No items

No Milestone

Projects

Clear projects

No project

Assignees

Clear assignees

GiteaMirror

ninjasurge

No Assignees

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/open-webui#57250

[GH-ISSUE #18397] issue: 0.6.33 problem with long contest 128k #57250

Check Existing Issues

Installation Method

Open WebUI Version

Ollama Version (if applicable)

Operating System

Browser (if applicable)

Confirmation

Expected Behavior

Actual Behavior

Steps to Reproduce

Logs & Screenshots

Additional Information