mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 02:48:13 -05:00
[GH-ISSUE #5921] TTS not working for Call #14172
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @beddows on GitHub (Oct 5, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/5921
Bug Report
Installation Method
Docker, for open-webui and pipelines
Environment
Open WebUI Version: v0.3.30
Operating System: macOS Sonoma 14.5
Browser (if applicable): Chrome 129.0
Confirmation:
Expected Behavior:
When using Call, the TTS should work.
Actual Behavior:
TTS only works with Record Voice. Using Call hangs during the response.
Description
Bug Summary:
Reproduction Details
Steps to Reproduce:
Logs and Screenshots
Browser Console Logs:
POST http://localhost:3000/audio/api/v1/speech 400 (Bad Request)
CallOverlay.svelte:457 Invalid voice id
Docker Container Logs:
2024-10-05 12:50:57 INFO: 192.168.65.1:54184 - "GET /api/v1/auths/admin/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/config HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/voices HTTP/1.1" 200 OK
2024-10-05 12:51:00 INFO: 192.168.65.1:54184 - "GET /audio/api/v1/models HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_MODEL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_VOICE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_SPLIT_ON' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_REGION' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_BASE_URL' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_OPENAI_API_KEY' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_ENGINE' to the database
2024-10-05 12:53:38 INFO [open_webui.env] Saving 'AUDIO_STT_MODEL' to the database
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "POST /audio/api/v1/config/update HTTP/1.1" 200 OK
2024-10-05 12:53:38 INFO: 192.168.65.1:60604 - "GET /api/config HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934/tags HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /api/v1/users/user/settings HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:60604 - "GET /ollama/api/version HTTP/1.1" 200 OK
2024-10-05 12:53:43 INFO: 192.168.65.1:46131 - "GET /static/favicon.png HTTP/1.1" 304 Not Modified
2024-10-05 12:54:01 INFO [open_webui.apps.audio.main] file.content_type: audio/wav
2024-10-05 12:54:02 INFO [open_webui.apps.audio.main] Detected language 'en' with probability 0.994098
2024-10-05 12:54:04 68d59def-c699-481a-a7f4-597b5280dc88.wav
2024-10-05 12:54:04 {'text': 'Yes, the speech to text is working, but not the text to speech.'}
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/transcriptions HTTP/1.1" 200 OK
2024-10-05 12:54:04 INFO: 192.168.65.1:35608 - "POST /api/chat/completions HTTP/1.1" 200 OK
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:05 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:06 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:07 INFO: 192.168.65.1:29340 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/chat/completed HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /api/v1/chats/1340572e-aced-4f80-a47f-c81da09b9934 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:29340 - "GET /api/v1/chats/?page=1 HTTP/1.1" 200 OK
2024-10-05 12:54:08 INFO: 192.168.65.1:35608 - "POST /audio/api/v1/speech HTTP/1.1" 400 Bad Request