mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-07 03:18:23 -05:00
[GH-ISSUE #8784] Rerranking not using gpu #30782
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @fireblade2534 on GitHub (Jan 23, 2025).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/8784
Installation Method
Docker
Environment
Open WebUI Version: V0.5.6
Operating System: Ubuntu
Confirmation:
Expected Behavior:
Fast reranking performance
Actual Behavior:
It doesn't seem like reranking is run on the gpu even when it is available
Description
Bug Summary:
The reranking model doesn't seem to use the gpu resulting in bad performance
Reproduction Details
Steps to Reproduce:
Add a reranking model and use nvidia-smi to check gpu memory. Running RAG with and without the reranker does not change the ammount of gpu memory usage