open-webui

mirror of https://github.com/open-webui/open-webui.git synced 2026-05-03 18:59:38 -05:00

Author	SHA1	Message	Date
Timothy Jaeryang Baek	4d2f189810	feat: add RAG_RERANKING_BATCH_SIZE configuration option Add configurable reranker batch size (env var RAG_RERANKING_BATCH_SIZE, default 32) following the same pattern as RAG_EMBEDDING_BATCH_SIZE. - config.py: PersistentConfig for RAG_RERANKING_BATCH_SIZE - main.py: import, state init, pass to get_reranking_function - colbert.py: accept batch_size param in predict() (was hardcoded 32) - utils.py: get_reranking_function passes batch_size at call time - retrieval.py: expose in config GET/POST endpoints and ConfigForm - Documents.svelte: add Reranking Batch Size input in admin settings Closes #23730	2026-04-17 08:35:45 +09:00
Timothy Jaeryang Baek	5dae600ce7	chore: format	2026-04-14 17:27:31 -05:00
Classic298	a3ea7bf043	fix(retrieval): offload Loader.load to a worker thread so file uploads stop blocking the event loop (#23705 ) Loader.load() dispatches to the underlying langchain document loaders (PyMuPDF, Unstructured, python-docx, Tika, …) which are all synchronous and CPU/IO-bound. process_file() awaited it directly on the event loop, so parsing a non-trivial PDF/DOCX would freeze the entire FastAPI app for the duration of the parse — which is what users experience as "the server hangs whenever I upload a file." Add an `aload()` async wrapper on Loader that runs the sync load on a worker thread via asyncio.to_thread, and update process_file() to await it. The sync API is preserved so existing callers that already run inside run_in_threadpool (e.g. save_docs_to_vector_db) are unaffected. https://claude.ai/code/session_01JSr4NZSskEUQvoJnavVXh8 Co-authored-by: Claude <noreply@anthropic.com>	2026-04-14 10:55:46 -05:00
Classic298	804f9f3153	fix(retrieval): offload sync VECTOR_DB_CLIENT calls in async paths via AsyncVectorDBClient (#23706 ) * fix(retrieval): offload sync VECTOR_DB_CLIENT calls in async paths via AsyncVectorDBClient The vector DB backends (Chroma, pgvector, Qdrant, Milvus, Pinecone, Weaviate, …) are uniformly synchronous and their methods perform blocking network or disk I/O. Multiple async route handlers and helpers were calling them directly on the event loop — file processing, memories, knowledge bases, hybrid search bookkeeping — so a single upsert/delete/search would freeze every other in-flight request for the duration of the call. Introduce `AsyncVectorDBClient`, a thin async facade that wraps the existing sync client and dispatches each method through `asyncio.to_thread`. It mirrors `VectorDBBase` exactly and forwards args/kwargs so backend-specific extra parameters keep working. Update every async-context call site (routers/retrieval, routers/files, routers/memories, routers/knowledge, retrieval/utils, tools/builtin) to await `ASYNC_VECTOR_DB_CLIENT` instead of calling the sync client directly. Two helpers that were sync-only also acquire async siblings or are awaited via `asyncio.to_thread` at their async call site (`remove_knowledge_base_metadata_embedding`, `get_all_items_from_collections`, `query_doc`). The original sync `VECTOR_DB_CLIENT` is unchanged, so callers that already run inside `run_in_threadpool` (e.g. `save_docs_to_vector_db` and the sync `query_doc`/`get_doc` helpers) are unaffected. https://claude.ai/code/session_01JSr4NZSskEUQvoJnavVXh8 fix(retrieval): restore explicit AsyncVectorDBClient signatures matching VectorDBBase Per PR review: the original args/kwargs forwarding lost type safety and IDE/static-analysis support. Restore explicit signatures that mirror VectorDBBase exactly, so: Bad kwargs fail at the facade boundary instead of inside the worker thread (where the resulting TypeError tends to be swallowed by surrounding `try/except`). * IDE autocomplete and static analysis work as expected. * The stated intent ("mirror VectorDBBase exactly") now holds at the API contract level, not just behaviourally. While doing this, surface a pre-existing bug in `delete_entries_from_collection` that the stricter typing flagged: the call passed `metadata={'hash': hash}` which is not a parameter on `VectorDBBase.delete` nor any backend. The TypeError raised inside the sync delete was silently swallowed by `except Exception` so the endpoint always reported `{'status': False}` for every request instead of actually deleting matching vectors. Replace with `filter=...` to do what the endpoint name promises. The thorough review's other note (no concurrency/backpressure on the shared default threadpool) is intentionally not addressed here: asyncio.to_thread on the shared executor is the right primitive for this use case; per-domain bounded executors would add lifecycle complexity disproportionate to the problem and the loop is no longer blocked, which was the actual bug. https://claude.ai/code/session_01JSr4NZSskEUQvoJnavVXh8 * fix(retrieval): parallelize hybrid-search collection prefetch; document async facade contracts Address PR review findings: 1. Hybrid-search prefetch was sequential `query_collection_with_hybrid_search` previously awaited `ASYNC_VECTOR_DB_CLIENT.get(name)` once per collection in a for loop. Each call already off-loaded to a worker thread, but awaiting them serially meant total prefetch latency scaled linearly with the number of collections. Run them concurrently with `asyncio.gather` so multi-collection queries actually benefit from the threadpool. Per-collection exception handling is preserved by wrapping each fetch in a small helper that logs and returns `(name, None)` on failure, so a single bad collection cannot poison the whole gather. 2. Document the thread-safety expectation explicitly The facade now formally states what was always implicit: the sync `VECTOR_DB_CLIENT` is shared across worker threads, so the underlying backend driver must be thread-safe. This is not a new exposure — `save_docs_to_vector_db` already called the sync client from `run_in_threadpool`. Adding a global lock here would defeat the responsiveness the facade exists to provide; backends that cannot tolerate concurrent access should grow their own internal serialization. 3. Document the API-surface choice and `.sync` escape hatch The strict `VectorDBBase` mirror was a deliberate choice (the previous `args/kwargs` revision let a `metadata=` typo silently break an endpoint). Document it, and call out the `.sync` escape hatch with an example for callers that genuinely need a backend-specific parameter not on `VectorDBBase`. https://claude.ai/code/session_01JSr4NZSskEUQvoJnavVXh8 fix(retrieval): guard /delete against null file.hash and let HTTPException reach the client Address PR review finding on the `metadata=` → `filter=` change in `delete_entries_from_collection`. The new `filter={'hash': hash}` query was correct for files that have a hash, but did not handle `file.hash is None` (unprocessed, failed, or legacy records). The match semantics of a null filter value are backend-dependent — some ignore the key entirely, some treat it as "metadata field absent" and match every such row — so issuing the query risked deleting unrelated entries. * Reject `hash is None` up front with a 400 explaining the file has no hash to target. * Narrow the surrounding `except Exception` so it no longer swallows `HTTPException`. Without this fix the new 400 (and the pre-existing 404 for missing files) would be silently re-shaped into `{'status': False}` and the caller could not distinguish a bad-request input from a backend error. https://claude.ai/code/session_01JSr4NZSskEUQvoJnavVXh8 --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-04-14 10:50:18 -05:00
Timothy Jaeryang Baek	8dba798cce	refac	2026-04-13 16:03:36 -05:00
Timothy Jaeryang Baek	9c64d84ad9	refac	2026-04-13 15:03:22 -05:00
Classic298	0753409e7b	fix: use ipaddress stdlib for IPv6 SSRF protection (#23453 ) The validators.ipv6(ip, private=True) call always returns a falsy ValidationError because validators==0.35.0 does not support the private kwarg for IPv6. This means any hostname resolving to a private IPv6 address (::1, fd00::*, ::ffff:169.254.169.254) bypasses SSRF protection entirely, circumventing the fix for CVE-2025-65958. Replace both the IPv4 and IPv6 validators-based private checks with Python's stdlib ipaddress module using an allowlist approach (not addr.is_global). This blocks all non-globally-routable addresses — private, loopback, link-local, reserved, multicast, and unspecified — for both IPv4 and IPv6, including IPv4-mapped IPv6 addresses.	2026-04-12 16:34:13 -05:00
Timothy Jaeryang Baek	27169124f2	refac: async db	2026-04-12 14:22:11 -05:00
Timothy Jaeryang Baek	6d736d3c59	refac	2026-03-26 19:01:33 -05:00
Timothy Jaeryang Baek	350d52f515	chore: format	2026-03-25 16:43:06 -05:00
Timothy Jaeryang Baek	8b6fa1f4ab	refac	2026-03-24 20:14:28 -05:00
Timothy Jaeryang Baek	d738044f47	refac	2026-03-24 17:03:08 -05:00
Timothy Jaeryang Baek	9a2c60d595	refac	2026-03-21 17:12:33 -05:00
biebiep	f593f92f18	FIX: serper.dev API - Change snippet key from 'description' to 'snippet' (#22869 ) This has apparently been broken since forever and native tool calling made it a lot more apparent.	2026-03-20 18:54:16 -05:00
Timothy Jaeryang Baek	de3317e26b	refac	2026-03-17 17:58:01 -05:00
Timothy Jaeryang Baek	fcf7208352	refac	2026-03-17 17:56:15 -05:00
Ethan T.	a229f9ea42	fix: replace bare except with except Exception (#22473 ) Replace bare except clauses with except Exception to follow Python best practices and avoid catching unexpected system exceptions like KeyboardInterrupt and SystemExit.	2026-03-15 17:48:23 -05:00
Timothy Jaeryang Baek	6862d618ee	refac	2026-03-13 20:57:12 -05:00
Timothy Jaeryang Baek	c6a1469fad	refac	2026-03-08 19:05:15 -05:00
Timothy Jaeryang Baek	61366cbcda	refac	2026-03-08 18:57:20 -05:00
Timothy Jaeryang Baek	352391fa76	chore: format	2026-03-08 18:14:09 -05:00
Code with love	265d1b2824	Add support for mariadb-vector as backing vector DB (#21931 )	2026-03-08 17:13:14 -05:00
Timothy Jaeryang Baek	710320601a	refac	2026-03-08 16:41:21 -05:00
Alvin Tang	2c35bdbcf5	fix: replace bare string raises with proper exception types (#22446 ) `raise "string"` in Python raises TypeError instead of the intended error, making error messages confusing and debugging difficult. Co-authored-by: gambletan <ethanchang32@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-08 16:39:09 -05:00
Classic298	0b851cf55a	fix: offline model retrieval, re-raise to disable instead of returning useless fallback (#22106 ) Co-authored-by: ahxxm <1286225+ahxxm@users.noreply.github.com>	2026-03-01 13:52:31 -05:00
Timothy Jaeryang Baek	d9fd2a3f30	refac	2026-02-22 18:42:25 -06:00
Timothy Jaeryang Baek	631e30e22d	refac	2026-02-21 15:35:34 -06:00
lazariv	5759917f54	feat: Adding You.com as a web search provider (#21599 ) * Add ydc.py provider implementation * Add PersistentConfig entry for you.com * Add Youcom search function import * Update you.com configuration * Add you.com as a web search engine option in frontend * Add YOUCOM_API_KEY to main.py	2026-02-21 14:51:56 -06:00
Timothy Jaeryang Baek	5d4547f934	enh: RAG_EMBEDDING_CONCURRENT_REQUESTS	2026-02-21 14:33:48 -06:00
VasilyLebedev123	6d67ac371d	fix: correct unpacking order of distances, documents, and metadatas in hybrid search query (#21562 ) Co-authored-by: Vasily Lebedev <Vasily.Lebedev@sapowernetworks.com.au>	2026-02-19 16:38:40 -06:00
Classic298	97a3b1528d	Update utils.py (#21105 )	2026-02-13 13:37:12 -06:00
Timothy Jaeryang Baek	f376d4f378	chore: format	2026-02-11 16:24:11 -06:00
Timothy Jaeryang Baek	cd31b8301b	refac	2026-02-10 12:44:31 -06:00
Varun Chawla	9b1fd86aa7	fix: use keyword argument for IndicesClient.refresh() for opensearch-py 3.x (#21248 ) In opensearch-py >= 3.0.0, IndicesClient.refresh() no longer accepts the index name as a positional argument. This causes a TypeError when uploading documents to knowledge bases with OpenSearch backend. Changes positional arguments to keyword arguments (index=...) in all three refresh() calls in the OpenSearch vector DB client. Fixes #20649	2026-02-09 16:16:44 -06:00
Tim Baek	48a0abb40f	Merge pull request #21277 from open-webui/acl refac: acl	2026-02-09 13:34:36 -06:00
Timothy Jaeryang Baek	9747b07ca5	refac	2026-02-08 21:24:38 -06:00
Tim Baek	a214ec40ea	fix	2026-02-06 03:34:21 +04:00
Tim Baek	2c37daef86	refac	2026-02-06 03:23:37 +04:00
Danil	c5c4aef7b1	Yandex web search (#20922 ) Co-authored-by: Tim Baek <tim@openwebui.com> Co-authored-by: joaoback <156559121+joaoback@users.noreply.github.com>	2026-01-26 07:31:44 -05:00
Timothy Jaeryang Baek	ecbdef732b	enh: PDF_LOADER_MODE	2026-01-21 23:51:36 +04:00
rohithshenoy	9d642f6354	Added support for connecting to self hosted weaviate deployments using connect_to_custom replacing connect_to_local, which is better suited for cases where HTTP and GRPC are hosted on different ingresses. (#20620 ) Co-authored-by: Tim Baek <tim@openwebui.com> Co-authored-by: joaoback <156559121+joaoback@users.noreply.github.com> Co-authored-by: rohithshenoyg@gmail.com <rohithshenoyg@gmail.com>	2026-01-17 21:48:52 +04:00
Kailey Wong	e26f6acc3b	fix: use proper X-Api-Key header format when docling api key provided (#20652 )	2026-01-15 10:44:35 +04:00
Timothy Jaeryang Baek	5990c51ab5	chore: format	2026-01-09 22:27:53 +04:00
Timothy Jaeryang Baek	3c986adeda	enh: kb metadata search	2026-01-09 22:21:00 +04:00
Timothy Jaeryang Baek	e67891a374	refac	2026-01-08 00:42:29 +04:00
Classic298	614cb56420	feat: Add configurable DDGS backend selection with UI support (#20366 ) * init * Update WebSearch.svelte * reorder	2026-01-05 03:05:56 +04:00
Timothy Jaeryang Baek	e4a5b06ca6	enh: embedding_batch_size for local embedding engine	2026-01-01 16:06:42 +04:00
Timothy Jaeryang Baek	f7f8a263b9	feat: JINA_API_BASE_URL	2026-01-01 02:17:47 +04:00
Timothy Jaeryang Baek	89ad1c68d1	enh: FIRECRAWL_TIMEOUT	2026-01-01 02:07:22 +04:00
Timothy Jaeryang Baek	dfc5dad631	enh: REQUESTS_VERIFY	2026-01-01 01:27:07 +04:00

1 2 3 4 5 ...

536 Commits