enh: Add TTS Response Splitting option for splitting on markdown (bold/italics) #3479

Closed
opened 2025-11-11 15:32:33 -06:00 by GiteaMirror · 0 comments
Owner

Originally created by @TheRealPSV on GitHub (Jan 29, 2025).

Feature Request

Is your feature request related to a problem? Please describe.
The Response Splitting option for TTS is helpful, but the "Sentences" option doesn't split off bold/italicized sentences that some models will output, creating run-on sentences when sent to the TTS engine.

Describe the solution you'd like
Create a Response Splitting option similar to the Sentences option, but that also splits on bold or italics markers (asterisks) for models that insert "roleplaying" sentences, such as characters' actions, to help prevent run-on sentences in TTS.

Describe alternatives you've considered
Another potential option could be to have the markdown cleanup that is performed during sending to TTS split the sentence as it removes markdown, but a user-configurable option would likely be more robust, as certain models are more likely to require this, while it may be detrimental to others.

Additional context
Example of a paragraph that becomes a run-on sentence when passed to TTS:

Hello! My name is Henry. Holds out hand Nice to meet you.

In the above paragraph, the Sentence option's split will result in the TTS engine outputting the run-on sentence "Holds out hand nice to meet you."

Originally created by @TheRealPSV on GitHub (Jan 29, 2025). # Feature Request **Is your feature request related to a problem? Please describe.** The Response Splitting option for TTS is helpful, but the "Sentences" option doesn't split off bold/italicized sentences that some models will output, creating run-on sentences when sent to the TTS engine. **Describe the solution you'd like** Create a Response Splitting option similar to the Sentences option, but that also splits on bold or italics markers (asterisks) for models that insert "roleplaying" sentences, such as characters' actions, to help prevent run-on sentences in TTS. **Describe alternatives you've considered** Another potential option could be to have the markdown cleanup that is performed during sending to TTS split the sentence as it removes markdown, but a user-configurable option would likely be more robust, as certain models are more likely to require this, while it may be detrimental to others. **Additional context** Example of a paragraph that becomes a run-on sentence when passed to TTS: Hello! My name is Henry. *Holds out hand* Nice to meet you. In the above paragraph, the Sentence option's split will result in the TTS engine outputting the run-on sentence "Holds out hand nice to meet you."
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#3479