[GH-ISSUE #7890] Pass audio files directly to LLM instead of passing a transcript #30450

Closed
opened 2026-04-25 04:39:30 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @Simon-Stone on GitHub (Dec 16, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/7890

Is your feature request related to a problem? Please describe.
With multimodal LLMs that can process audio now widely available, it would be very helpful to pass an audio file directly to the model, similar to how images can now be sent as part of a message.

Describe the solution you'd like
When uploading an audio file, a dialog popup should ask whether the file should be transcribed to text or passed as context directly.

Originally created by @Simon-Stone on GitHub (Dec 16, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/7890 **Is your feature request related to a problem? Please describe.** With multimodal LLMs that can process audio now widely available, it would be very helpful to pass an audio file directly to the model, similar to how images can now be sent as part of a message. **Describe the solution you'd like** When uploading an audio file, a dialog popup should ask whether the file should be transcribed to text or passed as context directly.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#30450