[GH-ISSUE #24162] issue: STT (Whisper Local): MediaRecorder never starts in regular browser windows, works in private/incognito #58882

Open
opened 2026-05-06 00:20:22 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @tropicflite on GitHub (Apr 26, 2026).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/24162

Check Existing Issues

  • I have searched for any existing and/or related issues.
  • I have searched for any existing and/or related discussions.
  • I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!).
  • I am using the latest version of Open WebUI.

Installation Method

Docker

Open WebUI Version

v0.9.2

Ollama Version (if applicable)

Ollama Version: N/A (using external Ollama)

Operating System

Operating System: Windows 11

Browser (if applicable)

Browser: Firefox 150, Microsoft Edge

Confirmation

  • I have read and followed all instructions in README.md.
  • I am using the latest version of both Open WebUI and Ollama.
  • I have included the browser console logs.
  • I have included the Docker container logs.
  • I have provided every relevant configuration, setting, and environment variable used in my setup.
  • I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
  • I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
  • Start with the initial platform/version/OS and dependencies used,
  • Specify exact install/launch/configure commands,
  • List URLs visited, user input (incl. example values/emails/passwords if needed),
  • Describe all options and toggles enabled or changed,
  • Include any files or environmental changes,
  • Identify the expected and actual result at each stage,
  • Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

Expected Behavior: STT records audio and sends to Whisper backend

Actual Behavior

Actual Behavior: Mic dialog opens, shows "Listening..." but waveform is static, no POST ever reaches the server. Works in Firefox private window and Android.

Steps to Reproduce

Steps to Reproduce: Open in regular Firefox/Edge → click mic → nothing. Open in Firefox private → works.

Logs & Screenshots

Logs: No errors in container logs when attempted from desktop. Console shows "Invalid URI. Load of media resource failed" on page load (TTS audio element with empty src).

Additional Information

Additional Info: getUserMedia succeeds (MIC OK). MediaRecorder.isTypeSupported('audio/webm;codecs=opus') = true. No audio element with srcObject is ever created when mic is clicked. Tried clearing all browser storage, permissions.sqlite, disabling extensions — no change.

Originally created by @tropicflite on GitHub (Apr 26, 2026). Original GitHub issue: https://github.com/open-webui/open-webui/issues/24162 ### Check Existing Issues - [x] I have searched for any existing and/or related issues. - [x] I have searched for any existing and/or related discussions. - [x] I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!). - [x] I am using the latest version of Open WebUI. ### Installation Method Docker ### Open WebUI Version v0.9.2 ### Ollama Version (if applicable) Ollama Version: N/A (using external Ollama) ### Operating System Operating System: Windows 11 ### Browser (if applicable) Browser: Firefox 150, Microsoft Edge ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior Expected Behavior: STT records audio and sends to Whisper backend ### Actual Behavior Actual Behavior: Mic dialog opens, shows "Listening..." but waveform is static, no POST ever reaches the server. Works in Firefox private window and Android. ### Steps to Reproduce Steps to Reproduce: Open in regular Firefox/Edge → click mic → nothing. Open in Firefox private → works. ### Logs & Screenshots Logs: No errors in container logs when attempted from desktop. Console shows "Invalid URI. Load of media resource failed" on page load (TTS audio element with empty src). ### Additional Information Additional Info: getUserMedia succeeds (MIC OK). MediaRecorder.isTypeSupported('audio/webm;codecs=opus') = true. No audio element with srcObject is ever created when mic is clicked. Tried clearing all browser storage, permissions.sqlite, disabling extensions — no change.
GiteaMirror added the bug label 2026-05-06 00:20:22 -05:00
Author
Owner

@tropicflite commented on GitHub (Apr 28, 2026):

Additional findings from further testing:

  • STT broken in regular Firefox (non-private) — mic permission is granted, "Listening..." UI appears and waveform activates, but no audio is captured and nothing is ever POSTed to the server (confirmed via server logs). Works correctly in Firefox private window.

  • Console shows "Invalid URI. Load of media resource failed." before recording is even attempted — appears to be a TTS playback issue (notification sound?) unrelated to STT but present in regular mode only.

  • Workaround found: adding audio/wav to supported MIME types in Admin Panel → Audio fixes STT on Android (phone was sending audio/wav, not in the default MIME list).

<!-- gh-comment-id:4332247756 --> @tropicflite commented on GitHub (Apr 28, 2026): **Additional findings from further testing:** - STT broken in regular Firefox (non-private) — mic permission is granted, "Listening..." UI appears and waveform activates, but no audio is captured and nothing is ever POSTed to the server (confirmed via server logs). Works correctly in Firefox private window. - Console shows "Invalid URI. Load of media resource failed." before recording is even attempted — appears to be a TTS playback issue (notification sound?) unrelated to STT but present in regular mode only. - Workaround found: adding `audio/wav` to supported MIME types in Admin Panel → Audio fixes STT on Android (phone was sending `audio/wav`, not in the default MIME list).
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#58882