[GH-ISSUE #4806] Error with some embedding models with RAG (files not found) #13738

Closed
opened 2026-04-19 20:22:20 -05:00 by GiteaMirror · 0 comments
Owner

Originally created by @mikael1234 on GitHub (Aug 22, 2024).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/4806

Discussed in https://github.com/open-webui/open-webui/discussions/4805

Originally posted by mikael1234 August 22, 2024
Some models, for example sentence-transformers/all-MiniLM-L6 work fine, but others, for example Alibaba-NLP/gte-multilingual-base result an error below. This model requires RAG_EMBEDDING_MODEL_TRUST_REMOTE_CODE='True', not sure if that is related. Im usingthe default engine (transformers).

2024-08-22T08:25:11.372218964Z INFO:apps.rag.main:Updating embedding model: Alibaba-NLP/gte-multilingual-base to Alibaba-NLP/gte-multilingual-base
2024-08-22T08:25:11.372239594Z INFO:config:Saving 'RAG_EMBEDDING_MODEL' to config.json
2024-08-22T08:25:11.372751339Z INFO:sentence_transformers.SentenceTransformer:Load pretrained SentenceTransformer: /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab
2024-08-22T08:25:11.639432081Z ERROR:apps.rag.main:Problem updating embedding model: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab.
2024-08-22T08:25:11.639460621Z Traceback (most recent call last):
2024-08-22T08:25:11.639463201Z File "/app/backend/apps/rag/main.py", line 320, in update_embedding_config
2024-08-22T08:25:11.639465421Z update_embedding_model(app.state.config.RAG_EMBEDDING_MODEL)
2024-08-22T08:25:11.639467311Z File "/app/backend/apps/rag/main.py", line 184, in update_embedding_model
2024-08-22T08:25:11.639469241Z app.state.sentence_transformer_ef = sentence_transformers.SentenceTransformer(
2024-08-22T08:25:11.639471051Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2024-08-22T08:25:11.639472981Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/SentenceTransformer.py", line 197, in init
2024-08-22T08:25:11.639487340Z modules = self._load_sbert_model(
2024-08-22T08:25:11.639489370Z ^^^^^^^^^^^^^^^^^^^^^^^
2024-08-22T08:25:11.639491190Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/SentenceTransformer.py", line 1296, in _load_sbert_model
2024-08-22T08:25:11.639493100Z module = Transformer(model_name_or_path, cache_dir=cache_folder, **kwargs)
2024-08-22T08:25:11.639495040Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2024-08-22T08:25:11.639496920Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/models/Transformer.py", line 36, in init
2024-08-22T08:25:11.639498770Z self._load_model(model_name_or_path, config, cache_dir, **model_args)
2024-08-22T08:25:11.639500630Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/models/Transformer.py", line 65, in _load_model
2024-08-22T08:25:11.639502530Z self.auto_model = AutoModel.from_pretrained(
2024-08-22T08:25:11.639504270Z ^^^^^^^^^^^^^^^^^^^^^^^^^^
2024-08-22T08:25:11.639505980Z File "/usr/local/lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line 558, in from_pretrained
2024-08-22T08:25:11.639507850Z return model_class.from_pretrained(
2024-08-22T08:25:11.639509600Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2024-08-22T08:25:11.639511320Z File "/usr/local/lib/python3.11/site-packages/transformers/modeling_utils.py", line 3305, in from_pretrained
2024-08-22T08:25:11.639513190Z raise EnvironmentError(
2024-08-22T08:25:11.639515620Z OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab.
2024-08-22T08:25:11.639870477Z INFO: 192.168.113.24:53340 - "POST /rag/api/v1/embedding/update HTTP/1.1" 500 Internal Server Error

Originally created by @mikael1234 on GitHub (Aug 22, 2024). Original GitHub issue: https://github.com/open-webui/open-webui/issues/4806 ### Discussed in https://github.com/open-webui/open-webui/discussions/4805 <div type='discussions-op-text'> <sup>Originally posted by **mikael1234** August 22, 2024</sup> Some models, for example sentence-transformers/all-MiniLM-L6 work fine, but others, for example Alibaba-NLP/gte-multilingual-base result an error below. This model requires RAG_EMBEDDING_MODEL_TRUST_REMOTE_CODE='True', not sure if that is related. Im usingthe default engine (transformers). 2024-08-22T08:25:11.372218964Z INFO:apps.rag.main:Updating embedding model: Alibaba-NLP/gte-multilingual-base to Alibaba-NLP/gte-multilingual-base 2024-08-22T08:25:11.372239594Z INFO:config:Saving 'RAG_EMBEDDING_MODEL' to config.json 2024-08-22T08:25:11.372751339Z INFO:sentence_transformers.SentenceTransformer:Load pretrained SentenceTransformer: /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab 2024-08-22T08:25:11.639432081Z ERROR:apps.rag.main:Problem updating embedding model: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab. 2024-08-22T08:25:11.639460621Z Traceback (most recent call last): 2024-08-22T08:25:11.639463201Z File "/app/backend/apps/rag/main.py", line 320, in update_embedding_config 2024-08-22T08:25:11.639465421Z update_embedding_model(app.state.config.RAG_EMBEDDING_MODEL) 2024-08-22T08:25:11.639467311Z File "/app/backend/apps/rag/main.py", line 184, in update_embedding_model 2024-08-22T08:25:11.639469241Z app.state.sentence_transformer_ef = sentence_transformers.SentenceTransformer( 2024-08-22T08:25:11.639471051Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 2024-08-22T08:25:11.639472981Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/SentenceTransformer.py", line 197, in __init__ 2024-08-22T08:25:11.639487340Z modules = self._load_sbert_model( 2024-08-22T08:25:11.639489370Z ^^^^^^^^^^^^^^^^^^^^^^^ 2024-08-22T08:25:11.639491190Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/SentenceTransformer.py", line 1296, in _load_sbert_model 2024-08-22T08:25:11.639493100Z module = Transformer(model_name_or_path, cache_dir=cache_folder, **kwargs) 2024-08-22T08:25:11.639495040Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 2024-08-22T08:25:11.639496920Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/models/Transformer.py", line 36, in __init__ 2024-08-22T08:25:11.639498770Z self._load_model(model_name_or_path, config, cache_dir, **model_args) 2024-08-22T08:25:11.639500630Z File "/usr/local/lib/python3.11/site-packages/sentence_transformers/models/Transformer.py", line 65, in _load_model 2024-08-22T08:25:11.639502530Z self.auto_model = AutoModel.from_pretrained( 2024-08-22T08:25:11.639504270Z ^^^^^^^^^^^^^^^^^^^^^^^^^^ 2024-08-22T08:25:11.639505980Z File "/usr/local/lib/python3.11/site-packages/transformers/models/auto/auto_factory.py", line 558, in from_pretrained 2024-08-22T08:25:11.639507850Z return model_class.from_pretrained( 2024-08-22T08:25:11.639509600Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 2024-08-22T08:25:11.639511320Z File "/usr/local/lib/python3.11/site-packages/transformers/modeling_utils.py", line 3305, in from_pretrained 2024-08-22T08:25:11.639513190Z raise EnvironmentError( 2024-08-22T08:25:11.639515620Z OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /app/backend/data/cache/embedding/models/models--Alibaba-NLP--gte-multilingual-base/snapshots/f7d567e1f2493bb0df9413965d144de9f15e7bab. 2024-08-22T08:25:11.639870477Z INFO: 192.168.113.24:53340 - "POST /rag/api/v1/embedding/update HTTP/1.1" 500 Internal Server Error </div>
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#13738