[GH-ISSUE #18237] feat: have TTS Engine calls occure while response text is still appearing #18541

Closed
opened 2026-04-20 00:46:08 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @RetroZelda on GitHub (Oct 11, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/18237

Check Existing Issues

  • I have searched the existing issues and discussions.

Problem Description

im using chatterbox for TTS and the delay from the text response appearing to when the TTS plays back is very long. It seems like it waits for the entire promt to finish before calling the api to generate the audio.

Desired Solution you'd like

I think it could speed things up greatly if the "response splitting" setting in admin/settings/audio could start splitting the responses and sending them off as the response is generated.

An example is right now I have it set to "Punctuation" and it will wait for the entire promt to finish, but instead it could see that a period, question mark, etc was output and to then send off that sentence to the TTS engine to begin generating the response.

Alternatives Considered

No response

Additional Context

No response

Originally created by @RetroZelda on GitHub (Oct 11, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/18237 ### Check Existing Issues - [x] I have searched the existing issues and discussions. ### Problem Description im using chatterbox for TTS and the delay from the text response appearing to when the TTS plays back is very long. It seems like it waits for the entire promt to finish before calling the api to generate the audio. ### Desired Solution you'd like I think it could speed things up greatly if the "response splitting" setting in `admin/settings/audio` could start splitting the responses and sending them off as the response is generated. An example is right now I have it set to "Punctuation" and it will wait for the entire promt to finish, but instead it could see that a period, question mark, etc was output and to then send off that sentence to the TTS engine to begin generating the response. ### Alternatives Considered _No response_ ### Additional Context _No response_
Author
Owner

@tjbck commented on GitHub (Oct 11, 2025):

Implemented with voice mode.

<!-- gh-comment-id:3393578015 --> @tjbck commented on GitHub (Oct 11, 2025): Implemented with voice mode.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#18541