[GH-ISSUE #2241] enh: better voice interactions #12808

New Issue

GiteaMirror · 2026-04-19T19:40:40-05:00

GiteaMirror commented

2026-04-19 19:40:40 -05:00

Originally created by @tjbck on GitHub (May 13, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/2241

Originally assigned to: @tjbck on GitHub.

voice message recording like imessage
siri-esque real time voice interaction

Originally created by @tjbck on GitHub (May 13, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/2241 Originally assigned to: @tjbck on GitHub. - [x] voice message recording like imessage - [x] siri-esque real time voice interaction

GiteaMirror added the enhancement core labels 2026-04-19 19:40:40 -05:00

GiteaMirror closed this issue

2026-04-19 19:40:41 -05:00

GiteaMirror commented

2026-04-19 19:40:43 -05:00

@IIPedro commented on GitHub (May 31, 2024):

Hello! I'm unaware if work has been done about this topic yet, but it would be interesting to see better voice interactions as part of pipelines. Of course, it should be the front-end's job to provide an easy to use and comfortable interface, but I believe both siri-esque interaction and voice calls can be tackled in one go with a new pipeline function. Just like there's a pipe def in pipelines, it would be very useful to have a voice def that takes an audio buffer as input in a previously determined rate and returns another audio buffer, which would be the assistant's voice in the call. That would make voice interactions as versatile as current chat pipelines and would allow for the support of various libraries and APIs in a standardized manner. Thanks!

@IIPedro commented on GitHub (May 31, 2024): Hello! I'm unaware if work has been done about this topic yet, but it would be interesting to see better voice interactions as part of pipelines. Of course, it should be the front-end's job to provide an easy to use and comfortable interface, but I believe both siri-esque interaction and voice calls can be tackled in one go with a new pipeline function. Just like there's a pipe def in pipelines, it would be very useful to have a voice def that takes an audio buffer as input in a previously determined rate and returns another audio buffer, which would be the assistant's voice in the call. That would make voice interactions as versatile as current chat pipelines and would allow for the support of various libraries and APIs in a standardized manner. Thanks!

GiteaMirror commented

2026-04-19 19:40:43 -05:00

@tjbck commented on GitHub (Jun 8, 2024):

Implemented on dev.

@IIPedro Great suggestions, I'll see what can be done!

@tjbck commented on GitHub (Jun 8, 2024): Implemented on dev. @IIPedro Great suggestions, I'll see what can be done!

GiteaMirror commented

2026-04-19 19:40:44 -05:00

@darkvertex commented on GitHub (Jun 18, 2024):

@tjbck Is it normal the microphone stays in listening mode permanently until closing the tab even after exiting the Call mode? (Should I file a bug issue for this?)

@darkvertex commented on GitHub (Jun 18, 2024): @tjbck Is it normal the microphone stays in listening mode permanently until closing the tab even after exiting the Call mode? (Should I file a bug issue for this?)

GiteaMirror commented

2026-04-19 19:40:45 -05:00

@justinh-rahb commented on GitHub (Jun 18, 2024):

I've noted this as well on Chrome/Mac and Chrome/Android.

@justinh-rahb commented on GitHub (Jun 18, 2024): I've noted this as well on Chrome/Mac and Chrome/Android.

GiteaMirror referenced this issue

2026-04-20 04:34:43 -05:00

[PR #12808] [CLOSED] build(deps): bump the npm_and_yarn group across 1 directory with 6 updates #23021

GiteaMirror referenced this issue

2026-04-25 11:34:21 -05:00

[PR #12808] [CLOSED] build(deps): bump the npm_and_yarn group across 1 directory with 6 updates #38651

GiteaMirror referenced this issue