mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-31 14:00:34 -05:00
Quantized Embeddings #1220
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @snadeem1362 on GitHub (Jun 12, 2024).
Is your feature request related to a problem? Please describe.
Hello... we are using open-webui RAG for our company documents. we have almost 500GBs of data. Our system is generating embeddings and it is resulting in huge size of chroma db file. it is too much time consuming and require HW as well.
we are also incorporating wikipedia articles. Is there any builtin solution available so that we dont have to regenerate embeddings?
At the moment all-minilm-L6-v2 is generating 12GB of charoma DB against 1GB of wikipedia text.
Describe the solution you'd like
Can we add a feature to enable Quantize Embeddings. Binary quantization reduces size and speedup the retreival process.