mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-08 04:16:03 -05:00
[GH-ISSUE #5560] enh: option to disable TTS removeFormattings #14032
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @bergutman on GitHub (Sep 20, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5560
Installation Method
I forked the latest official helm chart to support v0.3.22 and deployed Open-WebUI to my Kubernetes cluster.
Environment
Confirmation:
Expected Behavior:
Azure TTS should read content that is italicized.
Actual Behavior:
Azure TTS skips over content that is italicized.
Description
Bug Summary:
The latest update to Open-WebUI added native Azure TTS support, which mostly works great—thank you! However, italicized content is not included in the array sent to Azure. I suspect it might be related to the asterisk (*) character causing a regex issue or something. Other formatting, like bold or quoted text, works fine.
Reproduction Details
Install v0.3.22 and generate audio for text containing italicized content.
Specific Azure TTS settings:
Voice: en-US-AvaNeural
Server: eastus
Output Format: audio-24khz-160kbitrate-mono-mp3
Response Splitting: Punctuation / Paragraph / None (doesn't matter)
Steps to Reproduce:
Ask any model to generate text containing italicized content (IE: Test 1 "Test 2" Test 3 Test 4 ) and click the read aloud button.
Logs and Screenshots
Given the following text:
Test 1 "Test 2" *Test 3* **Test 4**Browser Console Logs:
Prepared message content for TTS
Array [ 'Test 1 "Test 2" test 4' ]
ResponseMessage.svelte:176:11
Docker Container Logs:
INFO: REDACTED:0 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
INFO: REDACTED:0 - "POST /audio/api/v1/speech HTTP/1.1" 200 OK
Screenshots/Screen Recordings (if applicable):
https://github.com/user-attachments/assets/829da1c8-9a60-4887-9119-ec5d421d64ee
Additional Information
I can verify that this bug is not occurring on Azure's end. From the Azure voice gallery you can test out sample text and it is able to read aloud content wrapped in asterisks.
@iamtheezac commented on GitHub (Sep 26, 2024):
GitHub Comment Draft
Hi, I'm experiencing the same issue as described above with italicized text being skipped during TTS playback. Here are my specific details:
Installation Method:
ollamaimage.Environment:
Steps to Reproduce:
Example Text:
When played back, the engine skips over
*test 3*, while the rest of the text is read correctly.Logs:
Here's a condensed version of my Docker logs:
I've also recorded a
.movfile where I tested various formatting, which is attached.https://github.com/user-attachments/assets/fcf59f23-378f-451b-86c7-3d710b7c2da1
@denispol commented on GitHub (Dec 18, 2024):
Fixed by #7919