mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 19:38:46 -05:00
[GH-ISSUE #6911] Markdown text is being skipped or mistakenly read during TTS "Read Aloud" #53196
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @danielj23 on GitHub (Nov 13, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/6911
Bug Report
Installation Method
Docker / windows
Environment
v0.3.35
TTS via matatonic/openedai-speech
Win 10 / FF latest
Confirmation:
Expected Behavior:
All visible alphanumeric "text" that makes up the visible markdown should be sent to the TTS service during TTS "Read Aloud" playback.
Actual Behavior:
Some text, like italicized text is not being played back during TTS "Read Aloud" playback.
Description
Bug Summary:
I noticed during Read Aloud playback that some words were skipped. I noticed a pattern that its almost always italicized markdown text not being played back. It can be other markdown text, though. Sometimes it reads invisible hash tags.
Reproduction Details
Steps to Reproduce:
(see attached sample of json and wav output)
speech.zip
3c319a9970cb8c555490978176a74f47eb7becc04f1fde10ad8064c8be0b4154.json
7eccaee9ca14bc649740f954181476df390fb368fcf88cab73972d7abb9f8a41.json
13b3be85f1ff2dc7eb8364c872d4bb92d2b3ca5f3402b45184fc753412d6c01e.json
a8a0ec24b1676c26f84e86a5ad17eb550c9fb7d795babc197b84930d010e5c64.json
Example markdown to attempt playback.
@danielj23 commented on GitHub (Nov 13, 2024):
Adding here that..... I don't know the correct answer about some text being sent to TTS.
For example, the TTS service is reading the # hashtag symbol that is presented to it in the json.
However, because its not visualized to the reader since its just a header markdown symbol, it likely shouldn't be sent to the TTS service.
But, if I write "Hey Carl, this is a hashtag #", open-webui needs to send that hashtag to be read by TTS.