[GH-ISSUE #14261] issue: Video Files Not Processed For Transcription #55861

Closed
opened 2026-05-05 18:10:49 -05:00 by GiteaMirror · 6 comments
Owner

Originally created by @NoMoreFood on GitHub (May 23, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/14261

Originally assigned to: @jackthgu on GitHub.

Check Existing Issues

  • I have searched the existing issues and discussions.
  • I am using the latest version of Open WebUI.

Installation Method

Docker

Open WebUI Version

v0.6.10

Ollama Version (if applicable)

No response

Operating System

Windows 11

Browser (if applicable)

No response

Confirmation

  • I have read and followed all instructions in README.md.
  • I am using the latest version of both Open WebUI and Ollama.
  • I have included the browser console logs. (N/A)
  • I have included the Docker container logs. (N/A)
  • I have listed steps to reproduce the bug in detail.

Expected Behavior

MP4 files should be sent to the transcription services (STT) like WAV files are.

Actual Behavior

Nothing happens. The console logs report the MIME type but not transcription activity.

Steps to Reproduce

  1. Start a chat
  2. Upload an MP4 file
  3. Observe logs

Logs & Screenshots

The console reports the following:

open-webui | 2025-05-23 23:33:00.852 | INFO | open_webui.routers.files:upload_file:91 - file.content_type: video/mp4 - {}
open-webui | 2025-05-23 23:33:00.947 | INFO | uvicorn.protocols.http.httptools_impl:send:476 - 172.18.0.1:47372 - "POST /api/v1/files/ HTTP/1.1" 200 - {}

This is what the UI looks like after the file is transmitted:

Image

Additional Information

I believe I see where this issue is occurring. Should we make this list customizable with a setting with delimited MIME types to force transcription on?

Originally created by @NoMoreFood on GitHub (May 23, 2025). Original GitHub issue: https://github.com/open-webui/open-webui/issues/14261 Originally assigned to: @jackthgu on GitHub. ### Check Existing Issues - [x] I have searched the existing issues and discussions. - [x] I am using the latest version of Open WebUI. ### Installation Method Docker ### Open WebUI Version v0.6.10 ### Ollama Version (if applicable) _No response_ ### Operating System Windows 11 ### Browser (if applicable) _No response_ ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. (N/A) - [x] I have included the Docker container logs. (N/A) - [x] I have listed steps to reproduce the bug in detail. ### Expected Behavior MP4 files should be sent to the transcription services (STT) like WAV files are. ### Actual Behavior Nothing happens. The console logs report the MIME type but not transcription activity. ### Steps to Reproduce 1. Start a chat 2. Upload an MP4 file 3. Observe logs ### Logs & Screenshots The console reports the following: open-webui | 2025-05-23 23:33:00.852 | INFO | open_webui.routers.files:upload_file:91 - file.content_type: video/mp4 - {} open-webui | 2025-05-23 23:33:00.947 | INFO | uvicorn.protocols.http.httptools_impl:send:476 - 172.18.0.1:47372 - "POST /api/v1/files/ HTTP/1.1" 200 - {} This is what the UI looks like after the file is transmitted: ![Image](https://github.com/user-attachments/assets/2bd7b63d-a244-4e23-b574-ebd2431cec83) ### Additional Information I believe I see [where](https://github.com/open-webui/open-webui/blob/82716f3789147585862f56f9f18fb2a77d92ed39/backend/open_webui/routers/files.py#L143) this issue is occurring. Should we make this list customizable with a setting with delimited MIME types to force transcription on?
GiteaMirror added the bug label 2026-05-05 18:10:49 -05:00
Author
Owner

@arunbugkiller commented on GitHub (May 30, 2025):

@NoMoreFood anything on this, were you able to make any progress here? We are also stuck here

<!-- gh-comment-id:2921794350 --> @arunbugkiller commented on GitHub (May 30, 2025): @NoMoreFood anything on this, were you able to make any progress here? We are also stuck here
Author
Owner

@NoMoreFood commented on GitHub (Jun 3, 2025):

@arunbugkiller Nothing yet. I was thinking maybe that the code should call something like supports_transcription() function that would conditionally make a decision based on the STT provider (providers could support different MIME types). I was hoping for input from others though.

<!-- gh-comment-id:2934625753 --> @NoMoreFood commented on GitHub (Jun 3, 2025): @arunbugkiller Nothing yet. I was thinking maybe that the code should call something like supports_transcription() function that would conditionally make a decision based on the STT provider (providers could support different MIME types). I was hoping for input from others though.
Author
Owner

@arunbugkiller commented on GitHub (Jun 4, 2025):

@tjbck Would be really great if you can throw some light here

<!-- gh-comment-id:2938423572 --> @arunbugkiller commented on GitHub (Jun 4, 2025): @tjbck Would be really great if you can throw some light here
Author
Owner

@tjbck commented on GitHub (Jun 8, 2025):

Agreed with customizable mime types for STT processing, PR welcome!

<!-- gh-comment-id:2954191118 --> @tjbck commented on GitHub (Jun 8, 2025): Agreed with customizable mime types for STT processing, PR welcome!
Author
Owner

@NoMoreFood commented on GitHub (Jun 9, 2025):

Alright, I'll put in something this week.

<!-- gh-comment-id:2955511183 --> @NoMoreFood commented on GitHub (Jun 9, 2025): Alright, I'll put in something this week.
Author
Owner

@tjbck commented on GitHub (Jun 16, 2025):

Thanks!

<!-- gh-comment-id:2976441372 --> @tjbck commented on GitHub (Jun 16, 2025): Thanks!
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#55861