[GH-ISSUE #5921] TTS not working for Call #52838

New Issue

GiteaMirror · 2026-05-05T13:59:41-05:00

GiteaMirror commented

2026-05-05 13:59:41 -05:00

Originally created by @beddows on GitHub (Oct 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5921

Bug Report

Installation Method

Docker, for open-webui and pipelines

Environment

Open WebUI Version: v0.3.30
Operating System: macOS Sonoma 14.5
Browser (if applicable): Chrome 129.0

Confirmation:

I have read and followed all the instructions provided in the README.md.
I am on the latest version of both Open WebUI and Ollama.
I have included the browser console logs.
I have included the Docker container logs.
I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below.

Expected Behavior:

When using Call, the TTS should work.

Actual Behavior:

TTS only works with Record Voice. Using Call hangs during the response.

Description

Bug Summary:

using Claude via Pipelines, no issues
STT works well for both Record voice and Call, no issues
TTS works well when using Record voice, no errors
TTS using Call generates an error "400 Bad Request. Invalid voice id"

Reproduction Details

Steps to Reproduce:

go to admin panel, settings, audio
under TTS settings, select Eleven Labs
enter API key
select TTS voice and model via drop-downs
click Save
go to new chat, click Call

Logs and Screenshots

Browser Console Logs:
POST http://localhost:3000/audio/api/v1/speech 400 (Bad Request)
CallOverlay.svelte:457 Invalid voice id

Docker Container Logs:
2024-10-05 12:50:57 INFO: 192.168.65.1:54184 - "GET /api/v1/auths/admin/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/voices HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/models HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_MODEL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_VOICE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_SPLIT_ON' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_REGION' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_MODEL' to the database
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "POST /audio/api/v1/config/update HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "GET /api/config HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934/tags HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/users/user/settings HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /ollama/api/version HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:46131 - "GET /static/favicon.png HTTP/1.1" 304 Not Modified
2024-10-05 12:54:01 INFO [open_webui.apps.audio.main] file.content_type: audio/wav
2024-10-05 12:54:02 INFO [open_webui.apps.audio.main] Detected language 'en' with probability 0.994098
2024-10-05 12:54:04 68d59def-c699-481a-a7f4-597b5280dc88.wav
2024-10-05 12:54:04 {'text': 'Yes, the speech to text is working, but not the text to speech.'}
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/transcriptions HTTP/1.1" 200 OK
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /api/chat/completions HTTP/1.1" 200 OK
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/chat/completed HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:29340 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request

Originally created by @beddows on GitHub (Oct 5, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/5921 # Bug Report ## Installation Method Docker, for open-webui and pipelines ## Environment - **Open WebUI Version:** v0.3.30 - **Operating System:** macOS Sonoma 14.5 - **Browser (if applicable):** Chrome 129.0 **Confirmation:** - [x] I have read and followed all the instructions provided in the README.md. - [x] I am on the latest version of both Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below. ## Expected Behavior: When using Call, the TTS should work. ## Actual Behavior: TTS only works with Record Voice. Using Call hangs during the response. ## Description **Bug Summary:** - using Claude via Pipelines, no issues - STT works well for both Record voice and Call, no issues - TTS works well when using Record voice, no errors - TTS using Call generates an error "400 Bad Request. Invalid voice id" ## Reproduction Details **Steps to Reproduce:** - go to admin panel, settings, audio - under TTS settings, select Eleven Labs - enter API key - select TTS voice and model via drop-downs - click Save - go to new chat, click Call ## Logs and Screenshots **Browser Console Logs:** POST http://localhost:3000/audio/api/v1/speech 400 (Bad Request) CallOverlay.svelte:457 Invalid voice id **Docker Container Logs:** 2024-10-05 12:50:57 INFO: 192.168.65.1:54184 - "GET /api/v1/auths/admin/config HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/config HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/voices HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/models HTTP/1.1" 200 OK 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_BASE_URL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_ENGINE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_MODEL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_VOICE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_SPLIT_ON' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_REGION' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_BASE_URL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_ENGINE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_MODEL' to the database 2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "POST /audio/api/v1/config/update HTTP/1.1" 200 OK 2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "GET /api/config HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934/tags HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/users/user/settings HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /ollama/api/version HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:46131 - "GET /static/favicon.png HTTP/1.1" 304 Not Modified 2024-10-05 12:54:01 INFO [open_webui.apps.audio.main] file.content_type: audio/wav 2024-10-05 12:54:02 INFO [open_webui.apps.audio.main] Detected language 'en' with probability 0.994098 2024-10-05 12:54:04 68d59def-c699-481a-a7f4-597b5280dc88.wav 2024-10-05 12:54:04 {'text': 'Yes, the speech to text is working, but not the text to speech.'} 2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/transcriptions HTTP/1.1" 200 OK 2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /api/chat/completions HTTP/1.1" 200 OK 2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/chat/completed HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:29340 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request

GiteaMirror closed this issue

2026-05-05 13:59:42 -05:00

Sign in to join this conversation.

Branches Tags

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/open-webui#52838