mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 03:18:23 -05:00
[GH-ISSUE #1527] Whitelist Connections, or Whitelist Models on specific Connection #28066
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @lewismacnow on GitHub (Apr 12, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/1527
Is your feature request related to a problem? Please describe.
Background:
I have multiple connections pointed at a single host.

Connection 1 is a load balancer rotating through 5 Ollama instance.
Connection 2 is pointed directly at 1 Ollama instance.
Problem:
I need to force users to utilise the load balancer on Connection 1 AND restrict model usage.
Because Connection 1 and 2 both use the same Model library (all instances are on the same host), white listing models does not restrict connections so the result we may end up with is user roating through Ollama instances, unloading the model in memory from the previous interaction so they can use the one they have selected.
Load balancing across multiple connections 'can' be useful, but can also cause unintended and irritating delays in the current way it is implemented.
The loading/unloading process adds seconds to the response so if users are activiely using multiple models which are shared bewteen connections, we will potentially add time to responses instead of benefiting from load balancing over multiple instances.
The solution here I believe is to prevent unloading the model uncessesarily so not to impact the user experience, by allowing restrictions on models per connection.
Describe the solution you'd like
One of:
The aim here is to prevent unloading the model uncessesarily so not to impact the user experience.
Another possibility, perhaps a more flexible approach, a model is restricted to one connection:
In this example, although Mistral is available on Connection 2, the user will never use it and therefore not unload Command-R from Ollama 1. This is beneficial for Users of Command-R, as they will
Another possible use case:
3 Connections to 3 different Ollama Instanced, but all which own the same models.
1 Connection is used exclusively for Model A
The other 2 Connections Load balance between the other Ollama instanced for Model B
Describe alternatives you've considered
Whitelisting models as it currently is while effective is limited in this example.
Additional context
Nothing to add.
@tjbck commented on GitHub (Apr 12, 2024):
Related #1081, Let's continue our discussion there.