[GH-ISSUE #9227] STT API calls do not recognize or accept audio files. #15427

Closed
opened 2026-04-19 21:37:48 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @ckraushaar on GitHub (Feb 2, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/9227

Bug Report

Important Notes

  • Before submitting a bug report: Please check the Issues or Discussions section to see if a similar issue or feature request has already been posted. It's likely we're already tracking it! If you’re unsure, start a discussion post first. This will help us efficiently focus on improving the project.

  • Collaborate respectfully: We value a constructive attitude, so please be mindful of your communication. If negativity is part of your approach, our capacity to engage may be limited. We’re here to help if you’re open to learning and communicating positively. Remember, Open WebUI is a volunteer-driven project managed by a single maintainer and supported by contributors who also have full-time jobs. We appreciate your time and ask that you respect ours.

  • Contributing: If you encounter an issue, we highly encourage you to submit a pull request or fork the project. We actively work to prevent contributor burnout to maintain the quality and continuity of Open WebUI.

  • Bug reproducibility: If a bug cannot be reproduced with a :main or :dev Docker setup, or a pip install with Python 3.11, it may require additional help from the community. In such cases, we will move it to the "issues" Discussions section due to our limited resources. We encourage the community to assist with these issues. Remember, it’s not that the issue doesn’t exist; we need your help!

Note: Please remove the notes above when submitting your post. Thank you for your understanding and support!


Installation Method

Docker Installation

Environment

  • Open WebUI Version: v0.5.7

  • LocalAI: v2.25.0

  • Operating System: Rocky Linux 9.4

  • Browser (if applicable): [e.g., Chrome 100.0, Firefox 98.0]

Confirmation:

  • I have read and followed all the instructions provided in the README.md.
  • I am on the latest version of both Open WebUI and LocalAI.
  • I have included the browser console logs.
  • I have included the Docker container logs.
  • I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below.

Expected Behavior:

Upload a webm, wav, mp3 file from curl cli command or php curl function and receive a JSON string of the transcribed text

Actual Behavior:

Returned JSON string when passing in an audio file:
{"detail":"Oops! It seems like the file format you're trying to upload is not supported. Please upload a file with a supported format (e.g., JPG, PNG, PDF, TXT) and try again."}

Description

Bug Summary:
When uploading an audio file using curl to the http://openwebui:port/api/v1/audio/transcriptions url it returns that the file is unrecognizable. I have tried .mp3, .wav and .webm, all with the same results.

Open WebUI website works for both TTS and STT calls using OpenAI settings directing to the local LocalAI Docker launched from the same docker-compose.yaml

I can pass the same file directly to LocalAI http://localai:port/v1/audio/transcriptions and receive the expected JSON string.

Appears to be an error on the file check when you receive a file over the api call. It seems it may only check for images or documents based on the error.

Reproduction Details

Steps to Reproduce:

https://github.com/user-attachments/assets/3db60492-062d-4c2f-bde1-4057de42a8e6

Logs and Screenshots

Browser Console Logs:
[Include relevant browser console logs, if applicable]

Docker Container Logs:
[Include relevant Docker container logs, if applicable]

Screenshots/Screen Recordings (if applicable):
[Attach any relevant screenshots to help illustrate the issue]

Additional Information

[Include any additional details that may help in understanding and reproducing the issue. This could include specific configurations, error messages, or anything else relevant to the bug.]

Note

If the bug report is incomplete or does not follow the provided instructions, it may not be addressed. Please ensure that you have followed the steps outlined in the README.md and troubleshooting.md documents, and provide all necessary information for us to reproduce and address the issue. Thank you!

Originally created by @ckraushaar on GitHub (Feb 2, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/9227 # Bug Report ## Important Notes - **Before submitting a bug report**: Please check the Issues or Discussions section to see if a similar issue or feature request has already been posted. It's likely we're already tracking it! If you’re unsure, start a discussion post first. This will help us efficiently focus on improving the project. - **Collaborate respectfully**: We value a constructive attitude, so please be mindful of your communication. If negativity is part of your approach, our capacity to engage may be limited. We’re here to help if you’re open to learning and communicating positively. Remember, Open WebUI is a volunteer-driven project managed by a single maintainer and supported by contributors who also have full-time jobs. We appreciate your time and ask that you respect ours. - **Contributing**: If you encounter an issue, we highly encourage you to submit a pull request or fork the project. We actively work to prevent contributor burnout to maintain the quality and continuity of Open WebUI. - **Bug reproducibility**: If a bug cannot be reproduced with a `:main` or `:dev` Docker setup, or a pip install with Python 3.11, it may require additional help from the community. In such cases, we will move it to the "issues" Discussions section due to our limited resources. We encourage the community to assist with these issues. Remember, it’s not that the issue doesn’t exist; we need your help! Note: Please remove the notes above when submitting your post. Thank you for your understanding and support! --- ## Installation Method Docker Installation ## Environment - **Open WebUI Version**: v0.5.7 - **LocalAI**: v2.25.0 - **Operating System:** Rocky Linux 9.4 - **Browser (if applicable):** [e.g., Chrome 100.0, Firefox 98.0] **Confirmation:** - [x] I have read and followed all the instructions provided in the README.md. - [x] I am on the latest version of both Open WebUI and LocalAI. - [ ] I have included the browser console logs. - [ ] I have included the Docker container logs. - [x] I have provided the exact steps to reproduce the bug in the "Steps to Reproduce" section below. ## Expected Behavior: Upload a webm, wav, mp3 file from curl cli command or php curl function and receive a JSON string of the transcribed text ## Actual Behavior: Returned JSON string when passing in an audio file: {"detail":"Oops! It seems like the file format you're trying to upload is not supported. Please upload a file with a supported format (e.g., JPG, PNG, PDF, TXT) and try again."} ## Description **Bug Summary:** When uploading an audio file using curl to the http://openwebui:port/api/v1/audio/transcriptions url it returns that the file is unrecognizable. I have tried .mp3, .wav and .webm, all with the same results. Open WebUI website works for both TTS and STT calls using OpenAI settings directing to the local LocalAI Docker launched from the same docker-compose.yaml I can pass the same file directly to LocalAI http://localai:port/v1/audio/transcriptions and receive the expected JSON string. Appears to be an error on the file check when you receive a file over the api call. It seems it may only check for images or documents based on the error. ## Reproduction Details **Steps to Reproduce:** - Create an audio file (wav, mp3, webm) - Send to Open WebUI with a curl command ex: curl http://openwebui:port/api/v1/audio/transcriptions -H "Authorization: Bearer key" -H "Content-Type: multipart/form-data" -F file="@test.webm" https://github.com/user-attachments/assets/3db60492-062d-4c2f-bde1-4057de42a8e6 ## Logs and Screenshots **Browser Console Logs:** [Include relevant browser console logs, if applicable] **Docker Container Logs:** [Include relevant Docker container logs, if applicable] **Screenshots/Screen Recordings (if applicable):** [Attach any relevant screenshots to help illustrate the issue] ## Additional Information [Include any additional details that may help in understanding and reproducing the issue. This could include specific configurations, error messages, or anything else relevant to the bug.] ## Note If the bug report is incomplete or does not follow the provided instructions, it may not be addressed. Please ensure that you have followed the steps outlined in the README.md and troubleshooting.md documents, and provide all necessary information for us to reproduce and address the issue. Thank you!
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#15427