[GH-ISSUE #21598] bug: Images uploaded to chat are sent as file ID, not accessible URL, causing vision models to fail #19529

New Issue

GiteaMirror · 2026-04-20T02:00:25-05:00

GiteaMirror commented

2026-04-20 02:00:25 -05:00

Originally created by @kpasechnikov2 on GitHub (Feb 19, 2026).
Original GitHub issue: https://github.com/open-webui/open-webui/issues/21598

Check Existing Issues

I have searched for any existing and/or related issues.
I have searched for any existing and/or related discussions.
I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!).
I am using the latest version of Open WebUI.

Installation Method

Docker

Open WebUI Version

0.8.2

Ollama Version (if applicable)

No response

Operating System

Ubuntu 22.04, browser upload (reproducible with Docker or git clone)

Browser (if applicable)

Chrome latest

Confirmation

I have read and followed all instructions in README.md.
I am using the latest version of both Open WebUI and Ollama.
I have included the browser console logs.
I have included the Docker container logs.
I have provided every relevant configuration, setting, and environment variable used in my setup.
I have clearly listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc).
I have documented step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation. My steps:
Start with the initial platform/version/OS and dependencies used,
Specify exact install/launch/configure commands,
List URLs visited, user input (incl. example values/emails/passwords if needed),
Describe all options and toggles enabled or changed,
Include any files or environmental changes,
Identify the expected and actual result at each stage,
Ensure any reasonably skilled user can follow and hit the same issue.

Expected Behavior

When sending an image in chat (via browser upload), the image metadata's url property is only a file ID, not a full URL. As a result, when the chat message is sent to a vision model (e.g. GPT-4o, GPT-5.x, Azure, OpenAI etc), the backend/model cannot access the image and only gets the file ID, not the image bytes or a valid URL.

The current frontend code uploads successfully but records file.url=1465c96b-e52d-4b41-a06a-223144f838ae instead of /files/1465c96b-e52d-4b41-a06a-223144f838ae/content or a data:image/...;base64,... string.
When preparing the payload for the model, the code does not convert file IDs to retrievable URLs.
If you run a vision model (e.g. "GPT-4o") with an OpenAI-compatible backend (Azure, vLLM, Foundry, etc), you get a message like: I can help, but I need the actual image pixels to run OCR. The attachment you sent shows only a file reference (1465c96b-e52d-4b41-a06a-223144f838ae) and I can't access it from here.
This happens even though the frontend UI correctly reconstructs the file URL for displaying the image preview.

Actual Behavior

After uploading and sending an image, the model receives only a file ID as its url field, not a real URL or base64 image.
Vision models (Azure, GPT-4/5, etc) do not see an image at all because they cannot dereference the file id.
Human UI users can see the image, but vision LLMs cannot.
Workarounds (like uploading to a public server and pasting a real URL) do work, so this is a file reference/URL issue, not an auth/network issue.

Steps to Reproduce

Start Open-WebUI (v0.8.2). Connect a vision-capable OpenAI-compatible model (Azure with vision, Foundry, vLLM+vision, etc)
In chat, upload an image via the browser.
Send message: "recognize text on this image"
Model responds: "I can't access file attachment — only received reference ID, not actual image."

Open the browser devtools and inspect the network/chat payload: the user message file block contains

"files": [{"type": "file", ... "url": "1465c96b-e52d-4b41-a06a-223144f838ae", ...}]

instead of

"files": [{"type": "image", ... "url": "/files/1465c96b-e52d-4b41-a06a-223144f838ae/content", ...}]

or...

"files": [{"type": "image", ... "url": "data:image/png;base64,...", ...}]

As a result, vision models cannot "see" the uploaded image attachments and always reply needing a public link/base64 upload.

Logs & Screenshots

See network payload for file upload and final chat submission. Screenshot attached of model response:

I can help, but I need the actual image pixels to run OCR. The attachment you sent shows only a file reference (1465c96b-e52d-4b41-a06a-223144f838ae) and I can't access it from here.

Try with Azure Foundry, GPT-4o, vLLM, or any OpenAI-compatible vision endpoint. You will see only a file ID where a URL or image data is required.

Additional Information

How to fix:

When preparing messages with image/files for sending to the model, reconstruct the file URL using:
- ${WEBUI_API_BASE_URL}/files/${file.url}/content if the URL is a file ID
- Or, alternatively, upload as data URL if the provider supports inline base64 images
Ensure the model always receives a retrievable URL or base64-encoded image in the chat payload

References:

Frontend: src/lib/components/chat/Chat.svelte (code link)
UI reconstruction logic: src/lib/components/chat/MessageInput.svelte (code link)

Originally created by @kpasechnikov2 on GitHub (Feb 19, 2026). Original GitHub issue: https://github.com/open-webui/open-webui/issues/21598 ### Check Existing Issues - [x] I have searched for any existing and/or related issues. - [x] I have searched for any existing and/or related discussions. - [x] I have also searched in the CLOSED issues AND CLOSED discussions and found no related items (your issue might already be addressed on the development branch!). - [x] I am using the latest version of Open WebUI. ### Installation Method Docker ### Open WebUI Version 0.8.2 ### Ollama Version (if applicable) _No response_ ### Operating System Ubuntu 22.04, browser upload (reproducible with Docker or git clone) ### Browser (if applicable) Chrome latest ### Confirmation - [x] I have read and followed all instructions in `README.md`. - [x] I am using the latest version of **both** Open WebUI and Ollama. - [x] I have included the browser console logs. - [x] I have included the Docker container logs. - [x] I have **provided every relevant configuration, setting, and environment variable used in my setup.** - [x] I have clearly **listed every relevant configuration, custom setting, environment variable, and command-line option that influences my setup** (such as Docker Compose overrides, .env values, browser settings, authentication configurations, etc). - [x] I have documented **step-by-step reproduction instructions that are precise, sequential, and leave nothing to interpretation**. My steps: - Start with the initial platform/version/OS and dependencies used, - Specify exact install/launch/configure commands, - List URLs visited, user input (incl. example values/emails/passwords if needed), - Describe all options and toggles enabled or changed, - Include any files or environmental changes, - Identify the expected and actual result at each stage, - Ensure any reasonably skilled user can follow and hit the same issue. ### Expected Behavior **When sending an image in chat (via browser upload), the image metadata's `url` property is only a file ID, not a full URL. As a result, when the chat message is sent to a vision model (e.g. GPT-4o, GPT-5.x, Azure, OpenAI etc), the backend/model cannot access the image and only gets the file ID, not the image bytes or a valid URL.** - The current frontend code uploads successfully but records file.url=`1465c96b-e52d-4b41-a06a-223144f838ae` instead of `/files/1465c96b-e52d-4b41-a06a-223144f838ae/content` or a `data:image/...;base64,...` string. - When preparing the payload for the model, the code does not convert file IDs to retrievable URLs. - If you run a vision model (e.g. "GPT-4o") with an OpenAI-compatible backend (Azure, vLLM, Foundry, etc), you get a message like: `I can help, but I need the actual image pixels to run OCR. The attachment you sent shows only a file reference (1465c96b-e52d-4b41-a06a-223144f838ae) and I can't access it from here.` - This happens even though the frontend UI correctly reconstructs the file URL for displaying the image preview. ### Actual Behavior - After uploading and sending an image, the model receives only a file ID as its `url` field, not a real URL or base64 image. - Vision models (Azure, GPT-4/5, etc) **do not see an image at all** because they cannot dereference the file id. - Human UI users can see the image, but vision LLMs cannot. - Workarounds (like uploading to a public server and pasting a real URL) do work, so this is a file reference/URL issue, not an auth/network issue. ### Steps to Reproduce 1. Start Open-WebUI (v0.8.2). Connect a vision-capable OpenAI-compatible model (Azure with vision, Foundry, vLLM+vision, etc) 2. In chat, upload an image via the browser. 3. Send message: "recognize text on this image" 4. Model responds: "I can't access file attachment — only received reference ID, not actual image." 5. Open the browser devtools and inspect the network/chat payload: the user message file block contains ```json "files": [{"type": "file", ... "url": "1465c96b-e52d-4b41-a06a-223144f838ae", ...}] ``` instead of ```json "files": [{"type": "image", ... "url": "/files/1465c96b-e52d-4b41-a06a-223144f838ae/content", ...}] ``` or... ```json "files": [{"type": "image", ... "url": "data:image/png;base64,...", ...}] ``` 6. As a result, vision models cannot "see" the uploaded image attachments and always reply needing a public link/base64 upload. ### Logs & Screenshots See network payload for file upload and final chat submission. Screenshot attached of model response: > I can help, but I need the actual image pixels to run OCR. The attachment you sent shows only a file reference (1465c96b-e52d-4b41-a06a-223144f838ae) and I can't access it from here. Try with Azure Foundry, GPT-4o, vLLM, or any OpenAI-compatible vision endpoint. You will see only a file ID where a URL or image data is required. ### Additional Information **How to fix:** - When preparing messages with image/files for sending to the model, reconstruct the file URL using: - `${WEBUI_API_BASE_URL}/files/${file.url}/content` if the URL is a file ID - Or, alternatively, upload as data URL if the provider supports inline base64 images - Ensure the model always receives a retrievable URL or base64-encoded image in the chat payload **References:** - Frontend: `src/lib/components/chat/Chat.svelte` ([code link](https://github.com/open-webui/open-webui/blob/main/src/lib/components/chat/Chat.svelte#L792-L825)) - UI reconstruction logic: `src/lib/components/chat/MessageInput.svelte` ([code link](https://github.com/open-webui/open-webui/blob/main/src/lib/components/chat/MessageInput.svelte#L1195-L1214))

GiteaMirror added the bug label 2026-04-20 02:00:25 -05:00

GiteaMirror closed this issue

2026-04-20 02:00:26 -05:00

GiteaMirror commented

2026-04-20 02:00:27 -05:00

@pr-validator-bot commented on GitHub (Feb 19, 2026):

⚠️ Missing Issue Title Prefix

@kpasechnikov2, your issue title is missing a prefix (e.g., bug:, feat:, docs:).

Please update your issue title to include one of the following prefixes:

bug: Bug report or error you've encountered
feat: Feature request or enhancement suggestion
docs: Documentation issue or improvement request
question: Question about usage or functionality
help: Request for help or support

Example: bug: Login fails when using special characters in password

@pr-validator-bot commented on GitHub (Feb 19, 2026): # ⚠️ Missing Issue Title Prefix @kpasechnikov2, your issue title is missing a prefix (e.g., `bug:`, `feat:`, `docs:`). Please update your issue title to include one of the following prefixes: - **bug**: Bug report or error you've encountered - **feat**: Feature request or enhancement suggestion - **docs**: Documentation issue or improvement request - **question**: Question about usage or functionality - **help**: Request for help or support Example: `bug: Login fails when using special characters in password`

GiteaMirror commented

2026-04-20 02:00:27 -05:00

@Classic298 commented on GitHub (Feb 19, 2026):

fixed in 0.8.3 read the changelog, also duplicate issue use the search

@Classic298 commented on GitHub (Feb 19, 2026): fixed in 0.8.3 read the changelog, also duplicate issue use the search

GiteaMirror referenced this issue

2026-04-20 05:50:21 -05:00

[PR #19529] [CLOSED] feat: Toggle disable folders visibility from admin general settings in features configuration #25241

GiteaMirror referenced this issue

2026-04-25 13:15:56 -05:00

[PR #19529] [CLOSED] feat: Toggle disable folders visibility from admin general settings in features configuration #40871

GiteaMirror referenced this issue

2026-04-29 23:54:53 -05:00

[PR #19529] [CLOSED] feat: Toggle disable folders visibility from admin general settings in features configuration #48289

GiteaMirror referenced this issue

2026-05-06 09:24:24 -05:00

[PR #19529] [CLOSED] feat: Toggle disable folders visibility from admin general settings in features configuration #64097