issue: [v0.6.15] Open WebUI container uses VRAM at Idle #5650

Closed
opened 2025-11-11 16:27:20 -06:00 by GiteaMirror · 1 comment
Owner

Originally created by @slavkosky on GitHub (Jun 26, 2025).

Check Existing Issues

  • I have searched the existing issues and discussions.
  • I am using the latest version of Open WebUI.

Installation Method

Docker

Open WebUI Version

v0.6.15-cuda (ghcr.io/open-webui/open-webui:cuda)

Ollama Version (if applicable)

v0.9.3

Operating System

Windows 11 Pro (10.0.22631 Build 22631)

Browser (if applicable)

Brave 1.80.113 x64 (Chromium: 138.0.7204.49)

Confirmation

  • I have read and followed all instructions in README.md.
  • I am using the latest version of both Open WebUI and Ollama.
  • I have included the browser console logs.
  • I have included the Docker container logs.
  • I have provided every relevant configuration, setting, and environment variable used in my setup.
  • I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
  • I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
  • Start with the initial platform/version/OS and dependencies used,
  • Specify exact install/launch/configure commands,
  • List URLs visited, user input (incl. example values/emails/passwords if needed),
  • Describe all options and toggles enabled or changed,
  • Include any files or environmental changes,
  • Identify the expected and actual result at each stage,
  • Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

Open WebUI uses next to no VRAM when started, and releases any used by models afterwards (OLLAMA_KEEP_ALIVE=0)

Actual Behavior

Starting the Open WebUI Docker container eats VRAM and never releases it. Clicking Save in the Admin Panel > Settings > Documents tab consumes VRAM indefinitely each time the button is clicked. That VRAM is also never released.

Steps to Reproduce

  1. Start docker container with image ghcr.io/open-webui/open-webui:cuda using the following run command:
docker run -d -p 3000:8080 --gpus all -v D:\Docker\open-webui\data:/app/backend/data -e OLLAMA_BASE_URL=http://127.0.0.1:11434 --name open-webui --restart unless-stopped ghcr.io/open-webui/open-webui:cuda
  1. Observe dedicated VRAM usage increasing while Open WebUI starts. It levels off after start.
  2. Go to Admin Panel > Settings > Documents window, click "Save"
  3. Observe VRAM usage increase and Open WebUI logs showing database activity

The only way to release the VRAM is to stop the Open WebUI Docker container. Closing the Ollama server does not free the VRAM, which means it is being consumed directly by OWUI.

Ollama server is running separately and natively on Windows, it is not part of the OWUI container.

Logs & Screenshots

OWUI:

2025-06-26 16:23:24.376 | Loading WEBUI_SECRET_KEY from file, not provided as an environment variable.
2025-06-26 16:23:24.376 | Loading WEBUI_SECRET_KEY from .webui_secret_key
2025-06-26 16:23:24.379 | CUDA is enabled, appending LD_LIBRARY_PATH to include torch/cudnn & cublas libraries.
2025-06-26 16:23:27.722 | /app/backend/open_webui
2025-06-26 16:23:27.722 | /app/backend
2025-06-26 16:23:27.722 | /app
2025-06-26 16:23:27.725 | INFO  [alembic.runtime.migration] Context impl SQLiteImpl.
2025-06-26 16:23:27.725 | INFO  [alembic.runtime.migration] Will assume non-transactional DDL.
2025-06-26 16:23:27.749 | INFO  [open_webui.env] 'ENABLE_API_KEY' loaded from the latest database entry
2025-06-26 16:23:27.749 | INFO  [open_webui.env] 'ENABLE_API_KEY_ENDPOINT_RESTRICTIONS' loaded from the latest database entry
2025-06-26 16:23:27.749 | INFO  [open_webui.env] 'API_KEY_ALLOWED_ENDPOINTS' loaded from the latest database entry
2025-06-26 16:23:27.749 | INFO  [open_webui.env] 'JWT_EXPIRES_IN' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_OLLAMA_API' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'OLLAMA_BASE_URLS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'OLLAMA_API_CONFIGS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_OPENAI_API' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'OPENAI_API_KEYS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'OPENAI_API_BASE_URLS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'OPENAI_API_CONFIGS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'WEBUI_URL' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_SIGNUP' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'DEFAULT_LOCALE' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'DEFAULT_MODELS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'DEFAULT_PROMPT_SUGGESTIONS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'MODEL_ORDER_LIST' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'DEFAULT_USER_ROLE' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_CHANNELS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_EVALUATION_ARENA_MODELS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'EVALUATION_ARENA_MODELS' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'WEBHOOK_URL' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_COMMUNITY_SHARING' loaded from the latest database entry
2025-06-26 16:23:27.758 | INFO  [open_webui.env] 'ENABLE_MESSAGE_RATING' loaded from the latest database entry
2025-06-26 16:23:27.758 | WARNI [open_webui.env] 
2025-06-26 16:23:27.758 | 
2025-06-26 16:23:27.758 | WARNING: CORS_ALLOW_ORIGIN IS SET TO '*' - NOT RECOMMENDED FOR PRODUCTION DEPLOYMENTS.
2025-06-26 16:23:27.758 | 
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'WEBUI_BANNERS' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'SHOW_ADMIN_DETAILS' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'TASK_MODEL' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'TASK_MODEL_EXTERNAL' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'TITLE_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'TAGS_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'IMAGE_PROMPT_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'ENABLE_TAGS_GENERATION' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'ENABLE_TITLE_GENERATION' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'ENABLE_SEARCH_QUERY_GENERATION' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'ENABLE_RETRIEVAL_QUERY_GENERATION' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'QUERY_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'ENABLE_AUTOCOMPLETE_GENERATION' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'AUTOCOMPLETE_GENERATION_INPUT_MAX_LENGTH' loaded from the latest database entry
2025-06-26 16:23:27.759 | INFO  [open_webui.env] 'TOOLS_FUNCTION_CALLING_PROMPT_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'ENABLE_GOOGLE_DRIVE_INTEGRATION' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'ENABLE_ONEDRIVE_INTEGRATION' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'CONTENT_EXTRACTION_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_LANGS' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_USE_LLM' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_SKIP_CACHE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_FORCE_OCR' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_PAGINATE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_STRIP_EXISTING_OCR' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_DISABLE_IMAGE_EXTRACTION' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DATALAB_MARKER_OUTPUT_FORMAT' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'EXTERNAL_DOCUMENT_LOADER_URL' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'EXTERNAL_DOCUMENT_LOADER_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'TIKA_SERVER_URL' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_SERVER_URL' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_OCR_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_OCR_LANG' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_DO_PICTURE_DESCRIPTION' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_MODE' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_LOCAL' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_API' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCUMENT_INTELLIGENCE_ENDPOINT' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'DOCUMENT_INTELLIGENCE_KEY' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'MISTRAL_OCR_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'BYPASS_EMBEDDING_AND_RETRIEVAL' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'RAG_TOP_K' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'RAG_TOP_K_RERANKER' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'RAG_RELEVANCE_THRESHOLD' loaded from the latest database entry
2025-06-26 16:23:28.103 | INFO  [open_webui.env] 'RAG_HYBRID_BM25_WEIGHT' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'ENABLE_RAG_HYBRID_SEARCH' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_FULL_CONTEXT' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_ALLOWED_FILE_EXTENSIONS' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_EMBEDDING_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'PDF_EXTRACT_IMAGES' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_EMBEDDING_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] Embedding model set: Q78KG/gte-Qwen2-7B-instruct:latest
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_EMBEDDING_BATCH_SIZE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_RERANKING_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_RERANKING_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] Reranking model set: BAAI/bge-reranker-v2-m3
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_EXTERNAL_RERANKER_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_EXTERNAL_RERANKER_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_TEXT_SPLITTER' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'CHUNK_SIZE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'CHUNK_OVERLAP' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_TEMPLATE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_OPENAI_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_OPENAI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_AZURE_OPENAI_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_AZURE_OPENAI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_AZURE_OPENAI_API_VERSION' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_OLLAMA_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'RAG_OLLAMA_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'YOUTUBE_LOADER_LANGUAGE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'YOUTUBE_LOADER_PROXY_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'ENABLE_WEB_SEARCH' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_SEARCH_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'BYPASS_WEB_SEARCH_EMBEDDING_AND_RETRIEVAL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'BYPASS_WEB_SEARCH_WEB_LOADER' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_SEARCH_RESULT_COUNT' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_SEARCH_DOMAIN_FILTER_LIST' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_SEARCH_CONCURRENT_REQUESTS' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_LOADER_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'ENABLE_WEB_LOADER_SSL_VERIFICATION' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'WEB_SEARCH_TRUST_ENV' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'SEARXNG_QUERY_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'YACY_QUERY_URL' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'YACY_USERNAME' loaded from the latest database entry
2025-06-26 16:23:28.104 | INFO  [open_webui.env] 'YACY_PASSWORD' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'GOOGLE_PSE_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'GOOGLE_PSE_ENGINE_ID' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'BRAVE_SEARCH_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'KAGI_SEARCH_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'MOJEEK_SEARCH_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'BOCHA_SEARCH_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPSTACK_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPSTACK_HTTPS' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPER_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPLY_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'JINA_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SEARCHAPI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SEARCHAPI_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPAPI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SERPAPI_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'BING_SEARCH_V7_ENDPOINT' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'BING_SEARCH_V7_SUBSCRIPTION_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'EXA_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'PERPLEXITY_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'PERPLEXITY_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'PERPLEXITY_SEARCH_CONTEXT_USAGE' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SOUGOU_API_SID' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'SOUGOU_API_SK' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'TAVILY_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'TAVILY_EXTRACT_DEPTH' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'PLAYWRIGHT_WS_URL' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'PLAYWRIGHT_TIMEOUT' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'FIRECRAWL_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'FIRECRAWL_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'EXTERNAL_WEB_SEARCH_URL' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'EXTERNAL_WEB_SEARCH_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'EXTERNAL_WEB_LOADER_URL' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'EXTERNAL_WEB_LOADER_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.105 | INFO  [open_webui.env] 'IMAGE_GENERATION_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'ENABLE_IMAGE_GENERATION' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'ENABLE_IMAGE_PROMPT_GENERATION' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUTOMATIC1111_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUTOMATIC1111_API_AUTH' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'COMFYUI_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'COMFYUI_WORKFLOW' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'COMFYUI_WORKFLOW' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'IMAGES_OPENAI_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'IMAGES_OPENAI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'IMAGES_GEMINI_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'IMAGES_GEMINI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'WHISPER_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_STT_OPENAI_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_STT_OPENAI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_STT_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_STT_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_OPENAI_API_BASE_URL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_OPENAI_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_API_KEY' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_ENGINE' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_MODEL' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_VOICE' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_SPLIT_ON' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_AZURE_SPEECH_REGION' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' loaded from the latest database entry
2025-06-26 16:23:28.106 | INFO  [open_webui.env] 'ENABLE_LDAP' loaded from the latest database entry
2025-06-26 16:23:28.958 | WARNI [langchain_community.utils.user_agent] USER_AGENT environment variable not set, consider setting it to identify your requests.
2025-06-26 16:23:34.255 | 
2025-06-26 16:23:34.255 |  ██████╗ ██████╗ ███████╗███╗   ██╗    ██╗    ██╗███████╗██████╗ ██╗   ██╗██╗
2025-06-26 16:23:34.255 | ██╔═══██╗██╔══██╗██╔════╝████╗  ██║    ██║    ██║██╔════╝██╔══██╗██║   ██║██║
2025-06-26 16:23:34.255 | ██║   ██║██████╔╝█████╗  ██╔██╗ ██║    ██║ █╗ ██║█████╗  ██████╔╝██║   ██║██║
2025-06-26 16:23:34.255 | ██║   ██║██╔═══╝ ██╔══╝  ██║╚██╗██║    ██║███╗██║██╔══╝  ██╔══██╗██║   ██║██║
2025-06-26 16:23:34.255 | ╚██████╔╝██║     ███████╗██║ ╚████║    ╚███╔███╔╝███████╗██████╔╝╚██████╔╝██║
2025-06-26 16:23:34.255 |  ╚═════╝ ╚═╝     ╚══════╝╚═╝  ╚═══╝     ╚══╝╚══╝ ╚══════╝╚═════╝  ╚═════╝ ╚═╝
2025-06-26 16:23:34.255 | 
2025-06-26 16:23:34.255 | 
2025-06-26 16:23:34.255 | v0.6.15 - building the best AI user interface.
2025-06-26 16:23:34.255 | 
2025-06-26 16:23:34.255 | https://github.com/open-webui/open-webui
2025-06-26 16:23:34.255 | 
2025-06-26 16:23:34.260 | 
2025-06-26 16:23:34.260 | Fetching 13 files:   0%|          | 0/13 [00:00<?, ?it/s]
2025-06-26 16:23:34.260 | Fetching 13 files: 100%|██████████| 13/13 [00:00<00:00, 2500.73it/s]
2025-06-26 16:23:46.582 | INFO:     Started server process [1]
2025-06-26 16:23:46.583 | INFO:     Waiting for application startup.
2025-06-26 16:23:46.588 | 2025-06-26 23:23:46.588 | INFO     | open_webui.utils.logger:start_logger:140 - GLOBAL_LOG_LEVEL: INFO - {}
2025-06-26 16:23:46.588 | 2025-06-26 23:23:46.588 | INFO     | open_webui.main:lifespan:514 - Installing external dependencies of functions and tools... - {}
2025-06-26 16:23:46.605 | 2025-06-26 23:23:46.605 | INFO     | open_webui.utils.plugin:install_frontmatter_requirements:241 - No requirements found in frontmatter. - {}
2025-06-26 16:23:57.988 | 2025-06-26 23:23:57.988 | INFO     | uvicorn.protocols.http.httptools_impl:send:476 - 172.17.0.1:54198 - "GET /_app/version.json HTTP/1.1" 200 - {}

Ollama server.log:

time=2025-06-26T15:59:32.786-07:00 level=INFO source=routes.go:1235 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://0.0.0.0:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:1 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:K:\\Ollama\\models\\ OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]"
time=2025-06-26T15:59:32.789-07:00 level=INFO source=images.go:476 msg="total blobs: 43"
time=2025-06-26T15:59:32.790-07:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0"
time=2025-06-26T15:59:32.791-07:00 level=INFO source=routes.go:1288 msg="Listening on [::]:11434 (version 0.9.3)"
time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=sched.go:108 msg="starting llm scheduler"
time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs"
time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu_windows.go:167 msg=packages count=1
time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu_windows.go:214 msg="" package=0 cores=16 efficiency=0 threads=32
time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=gpu.go:98 msg="searching for GPU discovery libraries for NVIDIA"
time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=gpu.go:501 msg="Searching for GPU library" name=nvml.dll

CUDA driver version: 12.8
calling cuDeviceGetCount
device count 1
time=2025-06-26T15:59:32.819-07:00 level=DEBUG source=gpu.go:125 msg="detected GPUs" count=1 library=C:\Windows\system32\nvcuda.dll
[GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] CUDA totalMem 24563mb
[GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] CUDA freeMem 22994mb
[GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] Compute Capability 8.9
time=2025-06-26T15:59:32.944-07:00 level=DEBUG source=amd_windows.go:34 msg="unable to load amdhip64_6.dll, please make sure to upgrade to the latest amd driver: The specified module could not be found."
releasing cuda driver library
releasing nvml library
time=2025-06-26T15:59:32.945-07:00 level=INFO source=types.go:130 msg="inference compute" id=GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8 library=cuda variant=v12 compute=8.9 driver=12.8 name="NVIDIA GeForce RTX 4090" total="24.0 GiB" available="22.5 GiB"
time=2025-06-26T15:59:32.972-07:00 level=DEBUG source=gpu.go:391 msg="updating system memory data" before.total="127.9 GiB" before.free="94.1 GiB" before.free_swap="102.8 GiB" now.total="127.9 GiB" now.free="94.0 GiB" now.free_swap="102.6 GiB"
time=2025-06-26T15:59:32.990-07:00 level=DEBUG source=gpu.go:441 msg="updating cuda memory data" gpu=GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8 name="NVIDIA GeForce RTX 4090" overhead="0 B" before.total="24.0 GiB" before.free="22.5 GiB" now.total="24.0 GiB" now.free="17.2 GiB" now.used="6.8 GiB"
releasing nvml library

VRAM Usage on Docker container startup:
https://github.com/user-attachments/assets/0953d683-185a-4bb2-965c-c89bb867ed26

Additional Information

VRAM Usage also increases each time I click "Save" in Admin Panel > Settings > Documents window: https://drive.google.com/file/d/16yof8N1mN9VSy94uDIHZf-EynO4fnzNh/view?usp=sharing

The VRAM is never released afterwards until the OWUI docker container is stopped.

Originally created by @slavkosky on GitHub (Jun 26, 2025). ### Check Existing Issues - [x] I have searched the existing issues and discussions. - [x] I am using the latest version of Open WebUI. ### Installation Method Docker ### Open WebUI Version v0.6.15-cuda (ghcr.io/open-webui/open-webui:cuda) ### Ollama Version (if applicable) v0.9.3 ### Operating System Windows 11 Pro (10.0.22631 Build 22631) ### Browser (if applicable) Brave 1.80.113 x64 (Chromium: 138.0.7204.49) ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior Open WebUI uses next to no VRAM when started, and releases any used by models afterwards (OLLAMA_KEEP_ALIVE=0) ### Actual Behavior Starting the Open WebUI Docker container eats VRAM and never releases it. Clicking Save in the Admin Panel > Settings > Documents tab consumes VRAM indefinitely each time the button is clicked. That VRAM is also never released. ### Steps to Reproduce 1. Start docker container with image `ghcr.io/open-webui/open-webui:cuda` using the following run command: ``` docker run -d -p 3000:8080 --gpus all -v D:\Docker\open-webui\data:/app/backend/data -e OLLAMA_BASE_URL=http://127.0.0.1:11434 --name open-webui --restart unless-stopped ghcr.io/open-webui/open-webui:cuda ``` 2. Observe dedicated VRAM usage increasing while Open WebUI starts. It levels off after start. 3. Go to `Admin Panel > Settings > Documents` window, click "Save" 4. Observe VRAM usage increase and Open WebUI logs showing database activity **The only way to release the VRAM is to stop the Open WebUI Docker container**. Closing the Ollama server does not free the VRAM, which means it is being consumed directly by OWUI. Ollama server is running separately and natively on Windows, it is not part of the OWUI container. ### Logs & Screenshots OWUI: ```console 2025-06-26 16:23:24.376 | Loading WEBUI_SECRET_KEY from file, not provided as an environment variable. 2025-06-26 16:23:24.376 | Loading WEBUI_SECRET_KEY from .webui_secret_key 2025-06-26 16:23:24.379 | CUDA is enabled, appending LD_LIBRARY_PATH to include torch/cudnn & cublas libraries. 2025-06-26 16:23:27.722 | /app/backend/open_webui 2025-06-26 16:23:27.722 | /app/backend 2025-06-26 16:23:27.722 | /app 2025-06-26 16:23:27.725 | INFO [alembic.runtime.migration] Context impl SQLiteImpl. 2025-06-26 16:23:27.725 | INFO [alembic.runtime.migration] Will assume non-transactional DDL. 2025-06-26 16:23:27.749 | INFO [open_webui.env] 'ENABLE_API_KEY' loaded from the latest database entry 2025-06-26 16:23:27.749 | INFO [open_webui.env] 'ENABLE_API_KEY_ENDPOINT_RESTRICTIONS' loaded from the latest database entry 2025-06-26 16:23:27.749 | INFO [open_webui.env] 'API_KEY_ALLOWED_ENDPOINTS' loaded from the latest database entry 2025-06-26 16:23:27.749 | INFO [open_webui.env] 'JWT_EXPIRES_IN' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_OLLAMA_API' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'OLLAMA_BASE_URLS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'OLLAMA_API_CONFIGS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_OPENAI_API' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'OPENAI_API_KEYS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'OPENAI_API_BASE_URLS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'OPENAI_API_CONFIGS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'WEBUI_URL' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_SIGNUP' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'DEFAULT_LOCALE' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'DEFAULT_MODELS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'DEFAULT_PROMPT_SUGGESTIONS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'MODEL_ORDER_LIST' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'DEFAULT_USER_ROLE' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_CHANNELS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_EVALUATION_ARENA_MODELS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'EVALUATION_ARENA_MODELS' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'WEBHOOK_URL' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_COMMUNITY_SHARING' loaded from the latest database entry 2025-06-26 16:23:27.758 | INFO [open_webui.env] 'ENABLE_MESSAGE_RATING' loaded from the latest database entry 2025-06-26 16:23:27.758 | WARNI [open_webui.env] 2025-06-26 16:23:27.758 | 2025-06-26 16:23:27.758 | WARNING: CORS_ALLOW_ORIGIN IS SET TO '*' - NOT RECOMMENDED FOR PRODUCTION DEPLOYMENTS. 2025-06-26 16:23:27.758 | 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'WEBUI_BANNERS' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'SHOW_ADMIN_DETAILS' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'TASK_MODEL' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'TASK_MODEL_EXTERNAL' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'TITLE_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'TAGS_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'IMAGE_PROMPT_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'ENABLE_TAGS_GENERATION' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'ENABLE_TITLE_GENERATION' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'ENABLE_SEARCH_QUERY_GENERATION' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'ENABLE_RETRIEVAL_QUERY_GENERATION' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'QUERY_GENERATION_PROMPT_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'ENABLE_AUTOCOMPLETE_GENERATION' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'AUTOCOMPLETE_GENERATION_INPUT_MAX_LENGTH' loaded from the latest database entry 2025-06-26 16:23:27.759 | INFO [open_webui.env] 'TOOLS_FUNCTION_CALLING_PROMPT_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'ENABLE_GOOGLE_DRIVE_INTEGRATION' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'ENABLE_ONEDRIVE_INTEGRATION' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'CONTENT_EXTRACTION_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_LANGS' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_USE_LLM' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_SKIP_CACHE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_FORCE_OCR' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_PAGINATE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_STRIP_EXISTING_OCR' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_DISABLE_IMAGE_EXTRACTION' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DATALAB_MARKER_OUTPUT_FORMAT' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'EXTERNAL_DOCUMENT_LOADER_URL' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'EXTERNAL_DOCUMENT_LOADER_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'TIKA_SERVER_URL' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_SERVER_URL' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_OCR_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_OCR_LANG' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_DO_PICTURE_DESCRIPTION' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_MODE' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_LOCAL' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCLING_PICTURE_DESCRIPTION_API' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCUMENT_INTELLIGENCE_ENDPOINT' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'DOCUMENT_INTELLIGENCE_KEY' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'MISTRAL_OCR_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'BYPASS_EMBEDDING_AND_RETRIEVAL' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'RAG_TOP_K' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'RAG_TOP_K_RERANKER' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'RAG_RELEVANCE_THRESHOLD' loaded from the latest database entry 2025-06-26 16:23:28.103 | INFO [open_webui.env] 'RAG_HYBRID_BM25_WEIGHT' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'ENABLE_RAG_HYBRID_SEARCH' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_FULL_CONTEXT' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_ALLOWED_FILE_EXTENSIONS' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_EMBEDDING_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'PDF_EXTRACT_IMAGES' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_EMBEDDING_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] Embedding model set: Q78KG/gte-Qwen2-7B-instruct:latest 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_EMBEDDING_BATCH_SIZE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_RERANKING_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_RERANKING_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] Reranking model set: BAAI/bge-reranker-v2-m3 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_EXTERNAL_RERANKER_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_EXTERNAL_RERANKER_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_TEXT_SPLITTER' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'CHUNK_SIZE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'CHUNK_OVERLAP' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_TEMPLATE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_OPENAI_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_OPENAI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_AZURE_OPENAI_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_AZURE_OPENAI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_AZURE_OPENAI_API_VERSION' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_OLLAMA_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'RAG_OLLAMA_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'YOUTUBE_LOADER_LANGUAGE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'YOUTUBE_LOADER_PROXY_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'ENABLE_WEB_SEARCH' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_SEARCH_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'BYPASS_WEB_SEARCH_EMBEDDING_AND_RETRIEVAL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'BYPASS_WEB_SEARCH_WEB_LOADER' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_SEARCH_RESULT_COUNT' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_SEARCH_DOMAIN_FILTER_LIST' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_SEARCH_CONCURRENT_REQUESTS' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_LOADER_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'ENABLE_WEB_LOADER_SSL_VERIFICATION' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'WEB_SEARCH_TRUST_ENV' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'SEARXNG_QUERY_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'YACY_QUERY_URL' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'YACY_USERNAME' loaded from the latest database entry 2025-06-26 16:23:28.104 | INFO [open_webui.env] 'YACY_PASSWORD' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'GOOGLE_PSE_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'GOOGLE_PSE_ENGINE_ID' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'BRAVE_SEARCH_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'KAGI_SEARCH_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'MOJEEK_SEARCH_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'BOCHA_SEARCH_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPSTACK_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPSTACK_HTTPS' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPER_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPLY_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'JINA_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SEARCHAPI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SEARCHAPI_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPAPI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SERPAPI_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'BING_SEARCH_V7_ENDPOINT' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'BING_SEARCH_V7_SUBSCRIPTION_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'EXA_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'PERPLEXITY_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'PERPLEXITY_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'PERPLEXITY_SEARCH_CONTEXT_USAGE' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SOUGOU_API_SID' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'SOUGOU_API_SK' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'TAVILY_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'TAVILY_EXTRACT_DEPTH' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'PLAYWRIGHT_WS_URL' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'PLAYWRIGHT_TIMEOUT' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'FIRECRAWL_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'FIRECRAWL_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'EXTERNAL_WEB_SEARCH_URL' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'EXTERNAL_WEB_SEARCH_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'EXTERNAL_WEB_LOADER_URL' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'EXTERNAL_WEB_LOADER_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.105 | INFO [open_webui.env] 'IMAGE_GENERATION_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'ENABLE_IMAGE_GENERATION' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'ENABLE_IMAGE_PROMPT_GENERATION' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUTOMATIC1111_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUTOMATIC1111_API_AUTH' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'COMFYUI_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'COMFYUI_WORKFLOW' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'COMFYUI_WORKFLOW' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'IMAGES_OPENAI_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'IMAGES_OPENAI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'IMAGES_GEMINI_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'IMAGES_GEMINI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'WHISPER_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_STT_OPENAI_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_STT_OPENAI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_STT_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_STT_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_OPENAI_API_BASE_URL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_OPENAI_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_API_KEY' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_ENGINE' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_MODEL' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_VOICE' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_SPLIT_ON' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_AZURE_SPEECH_REGION' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'AUDIO_TTS_AZURE_SPEECH_OUTPUT_FORMAT' loaded from the latest database entry 2025-06-26 16:23:28.106 | INFO [open_webui.env] 'ENABLE_LDAP' loaded from the latest database entry 2025-06-26 16:23:28.958 | WARNI [langchain_community.utils.user_agent] USER_AGENT environment variable not set, consider setting it to identify your requests. 2025-06-26 16:23:34.255 | 2025-06-26 16:23:34.255 | ██████╗ ██████╗ ███████╗███╗ ██╗ ██╗ ██╗███████╗██████╗ ██╗ ██╗██╗ 2025-06-26 16:23:34.255 | ██╔═══██╗██╔══██╗██╔════╝████╗ ██║ ██║ ██║██╔════╝██╔══██╗██║ ██║██║ 2025-06-26 16:23:34.255 | ██║ ██║██████╔╝█████╗ ██╔██╗ ██║ ██║ █╗ ██║█████╗ ██████╔╝██║ ██║██║ 2025-06-26 16:23:34.255 | ██║ ██║██╔═══╝ ██╔══╝ ██║╚██╗██║ ██║███╗██║██╔══╝ ██╔══██╗██║ ██║██║ 2025-06-26 16:23:34.255 | ╚██████╔╝██║ ███████╗██║ ╚████║ ╚███╔███╔╝███████╗██████╔╝╚██████╔╝██║ 2025-06-26 16:23:34.255 | ╚═════╝ ╚═╝ ╚══════╝╚═╝ ╚═══╝ ╚══╝╚══╝ ╚══════╝╚═════╝ ╚═════╝ ╚═╝ 2025-06-26 16:23:34.255 | 2025-06-26 16:23:34.255 | 2025-06-26 16:23:34.255 | v0.6.15 - building the best AI user interface. 2025-06-26 16:23:34.255 | 2025-06-26 16:23:34.255 | https://github.com/open-webui/open-webui 2025-06-26 16:23:34.255 | 2025-06-26 16:23:34.260 | 2025-06-26 16:23:34.260 | Fetching 13 files: 0%| | 0/13 [00:00<?, ?it/s] 2025-06-26 16:23:34.260 | Fetching 13 files: 100%|██████████| 13/13 [00:00<00:00, 2500.73it/s] 2025-06-26 16:23:46.582 | INFO: Started server process [1] 2025-06-26 16:23:46.583 | INFO: Waiting for application startup. 2025-06-26 16:23:46.588 | 2025-06-26 23:23:46.588 | INFO | open_webui.utils.logger:start_logger:140 - GLOBAL_LOG_LEVEL: INFO - {} 2025-06-26 16:23:46.588 | 2025-06-26 23:23:46.588 | INFO | open_webui.main:lifespan:514 - Installing external dependencies of functions and tools... - {} 2025-06-26 16:23:46.605 | 2025-06-26 23:23:46.605 | INFO | open_webui.utils.plugin:install_frontmatter_requirements:241 - No requirements found in frontmatter. - {} 2025-06-26 16:23:57.988 | 2025-06-26 23:23:57.988 | INFO | uvicorn.protocols.http.httptools_impl:send:476 - 172.17.0.1:54198 - "GET /_app/version.json HTTP/1.1" 200 - {} ``` Ollama `server.log`: ```console time=2025-06-26T15:59:32.786-07:00 level=INFO source=routes.go:1235 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:4096 OLLAMA_DEBUG:DEBUG OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://0.0.0.0:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE:0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:1 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:K:\\Ollama\\models\\ OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:0 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_SCHED_SPREAD:false ROCR_VISIBLE_DEVICES:]" time=2025-06-26T15:59:32.789-07:00 level=INFO source=images.go:476 msg="total blobs: 43" time=2025-06-26T15:59:32.790-07:00 level=INFO source=images.go:483 msg="total unused blobs removed: 0" time=2025-06-26T15:59:32.791-07:00 level=INFO source=routes.go:1288 msg="Listening on [::]:11434 (version 0.9.3)" time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=sched.go:108 msg="starting llm scheduler" time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu.go:217 msg="looking for compatible GPUs" time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu_windows.go:167 msg=packages count=1 time=2025-06-26T15:59:32.791-07:00 level=INFO source=gpu_windows.go:214 msg="" package=0 cores=16 efficiency=0 threads=32 time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=gpu.go:98 msg="searching for GPU discovery libraries for NVIDIA" time=2025-06-26T15:59:32.791-07:00 level=DEBUG source=gpu.go:501 msg="Searching for GPU library" name=nvml.dll CUDA driver version: 12.8 calling cuDeviceGetCount device count 1 time=2025-06-26T15:59:32.819-07:00 level=DEBUG source=gpu.go:125 msg="detected GPUs" count=1 library=C:\Windows\system32\nvcuda.dll [GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] CUDA totalMem 24563mb [GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] CUDA freeMem 22994mb [GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8] Compute Capability 8.9 time=2025-06-26T15:59:32.944-07:00 level=DEBUG source=amd_windows.go:34 msg="unable to load amdhip64_6.dll, please make sure to upgrade to the latest amd driver: The specified module could not be found." releasing cuda driver library releasing nvml library time=2025-06-26T15:59:32.945-07:00 level=INFO source=types.go:130 msg="inference compute" id=GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8 library=cuda variant=v12 compute=8.9 driver=12.8 name="NVIDIA GeForce RTX 4090" total="24.0 GiB" available="22.5 GiB" time=2025-06-26T15:59:32.972-07:00 level=DEBUG source=gpu.go:391 msg="updating system memory data" before.total="127.9 GiB" before.free="94.1 GiB" before.free_swap="102.8 GiB" now.total="127.9 GiB" now.free="94.0 GiB" now.free_swap="102.6 GiB" time=2025-06-26T15:59:32.990-07:00 level=DEBUG source=gpu.go:441 msg="updating cuda memory data" gpu=GPU-3db3732a-8ee8-c1ec-d5d3-64737acab9b8 name="NVIDIA GeForce RTX 4090" overhead="0 B" before.total="24.0 GiB" before.free="22.5 GiB" now.total="24.0 GiB" now.free="17.2 GiB" now.used="6.8 GiB" releasing nvml library ``` **VRAM Usage on Docker container startup:** https://github.com/user-attachments/assets/0953d683-185a-4bb2-965c-c89bb867ed26 ### Additional Information **VRAM Usage also increases each time I click "Save"** in `Admin Panel > Settings > Documents` window: https://drive.google.com/file/d/16yof8N1mN9VSy94uDIHZf-EynO4fnzNh/view?usp=sharing The VRAM is never released afterwards until the OWUI docker container is stopped.
GiteaMirror added the bug label 2025-11-11 16:27:20 -06:00
Author
Owner

@tjbck commented on GitHub (Jun 27, 2025):

Intended behaviour with built-in sentence transformer embedding engine, I'd suggest you to use external embedding engines if you would like to change the current behaviour.

@tjbck commented on GitHub (Jun 27, 2025): Intended behaviour with built-in sentence transformer embedding engine, I'd suggest you to use external embedding engines if you would like to change the current behaviour.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/open-webui#5650