[GH-ISSUE #5921] TTS not working for Call #52838

Closed
opened 2026-05-05 13:59:41 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @beddows on GitHub (Oct 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5921

Bug Report

Installation Method

Docker, for open-webui and pipelines

Environment

  • Open WebUI Version: v0.3.30

  • Operating System: macOS Sonoma 14.5

  • Browser (if applicable): Chrome 129.0

Confirmation:

  • I have read and followed all the instructions provided in the README.md.
  • I am on the latest version of both Open WebUI and Ollama.
  • I have included the browser console logs.
  • I have included the Docker container logs.
  • I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below.

Expected Behavior:

When using Call, the TTS should work.

Actual Behavior:

TTS only works with Record Voice. Using Call hangs during the response.

Description

Bug Summary:

  • using Claude via Pipelines, no issues
  • STT works well for both Record voice and Call, no issues
  • TTS works well when using Record voice, no errors
  • TTS using Call generates an error "400 Bad Request. Invalid voice id"

Reproduction Details

Steps to Reproduce:

  • go to admin panel, settings, audio
  • under TTS settings, select Eleven Labs
  • enter API key
  • select TTS voice and model via drop-downs
  • click Save
  • go to new chat, click Call

Logs and Screenshots

Browser Console Logs:
POST http://localhost:3000/audio/api/v1/speech 400 (Bad Request)
CallOverlay.svelte:457 Invalid voice id

Docker Container Logs:
2024-10-05 12:50:57 INFO: 192.168.65.1:54184 - "GET /api/v1/auths/admin/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/voices HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/models HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_MODEL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_VOICE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_SPLIT_ON' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_REGION' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_MODEL' to the database
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "POST /audio/api/v1/config/update HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "GET /api/config HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934/tags HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/users/user/settings HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /ollama/api/version HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:46131 - "GET /static/favicon.png HTTP/1.1" 304 Not Modified
2024-10-05 12:54:01 INFO [open_webui.apps.audio.main] file.content_type: audio/wav
2024-10-05 12:54:02 INFO [open_webui.apps.audio.main] Detected language 'en' with probability 0.994098
2024-10-05 12:54:04 68d59def-c699-481a-a7f4-597b5280dc88.wav
2024-10-05 12:54:04 {'text': 'Yes, the speech to text is working, but not the text to speech.'}
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/transcriptions HTTP/1.1" 200 OK
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /api/chat/completions HTTP/1.1" 200 OK
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/chat/completed HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:29340 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request

Originally created by @beddows on GitHub (Oct 5, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/5921 # Bug Report ## Installation Method Docker, for open-webui and pipelines ## Environment - **Open WebUI Version:** v0.3.30 - **Operating System:** macOS Sonoma 14.5 - **Browser (if applicable):** Chrome 129.0 **Confirmation:** - [x] I have read and followed all the instructions provided in the README.md. - [x] I am on the latest version of both Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below. ## Expected Behavior: When using Call, the TTS should work. ## Actual Behavior: TTS only works with Record Voice. Using Call hangs during the response. ## Description **Bug Summary:** - using Claude via Pipelines, no issues - STT works well for both Record voice and Call, no issues - TTS works well when using Record voice, no errors - TTS using Call generates an error "400 Bad Request. Invalid voice id" ## Reproduction Details **Steps to Reproduce:** - go to admin panel, settings, audio - under TTS settings, select Eleven Labs - enter API key - select TTS voice and model via drop-downs - click Save - go to new chat, click Call ## Logs and Screenshots **Browser Console Logs:** POST http://localhost:3000/audio/api/v1/speech 400 (Bad Request) CallOverlay.svelte:457 Invalid voice id **Docker Container Logs:** 2024-10-05 12:50:57 INFO: 192.168.65.1:54184 - "GET /api/v1/auths/admin/config HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/config HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/voices HTTP/1.1" 200 OK 2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/models HTTP/1.1" 200 OK 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_BASE_URL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_ENGINE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_MODEL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_VOICE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_SPLIT_ON' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_REGION' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_BASE_URL' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_KEY' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_ENGINE' to the database 2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_MODEL' to the database 2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "POST /audio/api/v1/config/update HTTP/1.1" 200 OK 2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "GET /api/config HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934/tags HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/users/user/settings HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /ollama/api/version HTTP/1.1" 200 OK 2024-10-05 12:53:43 INFO: 192.168.65.1:46131 - "GET /static/favicon.png HTTP/1.1" 304 Not Modified 2024-10-05 12:54:01 INFO [open_webui.apps.audio.main] file.content_type: audio/wav 2024-10-05 12:54:02 INFO [open_webui.apps.audio.main] Detected language 'en' with probability 0.994098 2024-10-05 12:54:04 68d59def-c699-481a-a7f4-597b5280dc88.wav 2024-10-05 12:54:04 {'text': 'Yes, the speech to text is working, but not the text to speech.'} 2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/transcriptions HTTP/1.1" 200 OK 2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /api/chat/completions HTTP/1.1" 200 OK 2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/chat/completed HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:29340 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK 2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#52838