mirror of
https://github.com/open-webui/open-webui.git
synced 2026-05-06 19:08:59 -05:00
[PR #15902] [CLOSED] feat: Add configurable API URL and additional_config for Datalab Marker API Doc Parser #10775
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/open-webui/open-webui/pull/15902
Author: @Hisma
Created: 7/21/2025
Status: ❌ Closed
Base:
dev← Head:marker-api-content-extraction📝 Commits (8)
5fbfe2bMerge pull request #15879 from open-webui/devb234b20Update catalan translation.json241d2c8refac: memory handling413b19afix: dev.shba5b554refac/fix: channel messages5e91375feat: add additional_config parameterbcc9e43feat: add datalab_marker_api_base_url feature32f290ffix: url not being passed through backend flow📊 Changes
11 files changed (+197 additions, -118 deletions)
View changed files
📝
backend/dev.sh(+1 -1)📝
backend/open_webui/config.py(+10 -4)📝
backend/open_webui/main.py(+4 -2)📝
backend/open_webui/models/memories.py(+13 -4)📝
backend/open_webui/retrieval/loaders/datalab_marker.py(+9 -5)📝
backend/open_webui/retrieval/loaders/main.py(+6 -1)📝
backend/open_webui/routers/channels.py(+18 -14)📝
backend/open_webui/routers/memories.py(+4 -0)📝
backend/open_webui/routers/retrieval.py(+16 -8)📝
src/lib/components/admin/Settings/Documents.svelte(+52 -15)📝
src/lib/i18n/locales/ca-ES/translation.json(+64 -64)📄 Description
Before submitting, make sure you've checked the following:
devbranch.Changelog Entry
Description
Reference issue - https://github.com/open-webui/open-webui/issues/13137#issuecomment-3014486758
This PR enhances the Datalab Marker API integration by adding configurable self-hosted Marker API URL support and replacing the deprecated language selection feature with the new
additional_configparameter. Users can now specify custom Datalab Marker API endpoints and have control over the latest marker API processing options.Added
https://www.datalab.to/api/v1/markerwhen left emptydisable_links,keep_pageheader_in_output,keep_pagefooter_in_outputfilter_blank_pages,drop_repeated_text,layout_coverage_thresholdmerge_threshold,height_tolerance,gap_threshold,image_thresholdmin_line_length,level_count,default_levelChanged
Deprecated
Removed
Security
Additional Information
https://www.datalab.to/api/v1/marker(can be overridden with self-hosted API)Key Enhancements:
Testing Environment:
docker build --build-arg USE_CUDA=true --build-arg USE_CUDA_VER=cu121 --build-arg USE_OLLAMA=false -t openwebui-custom:dev .docker.io/hisma/openwebui:devBackend Logs Confirm Success:
Screenshots
Contributor License Agreement
By submitting this pull request, I confirm that I have read and fully agree to the Contributor License Agreement (CLA), and I am providing my contributions under its terms.
🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.