mirror of
https://github.com/open-webui/open-webui.git
synced 2026-03-10 15:54:15 -05:00
issue: WebLoader unable to handle non-html links #5861
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @ifgreulich on GitHub (Jul 24, 2025).
Check Existing Issues
Installation Method
Docker
Open WebUI Version
0.6.18
Ollama Version (if applicable)
0.9.6 but not relevant
Operating System
Ubuntu 24.04
Browser (if applicable)
not relevant
Confirmation
README.md.Expected Behavior
If one configures Web Search using Google PSE Open-Webui is able to handle all links provided by the search engine as part of result list even when the link points not to a html document. E.g. if the link points to a pdf document it is downloaded and processed using e.g. apache tika and it's content is used as if it was content of an html page.
Actual Behavior
Currently links provided as search result from the search engine pointing to a PDF document (at least PDF) fail to load (run into timeout, see attached logs) and an error message appears at the web ui.
Steps to Reproduce
Logs & Screenshots
Open-Webui.log
Additional Information
No response
@tjbck commented on GitHub (Jul 24, 2025):
This has nothing to do with webloader and has to do with your reverse proxy timeout config.