[GH-ISSUE #14496] Ollama Cloud: 404 on /v1/embeddings endpoint using OpenAI client #35162

Closed
opened 2026-04-22 19:28:13 -05:00 by GiteaMirror · 3 comments
Owner

Originally created by @kishensjain on GitHub (Feb 27, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/14496

404 Error When Calling /v1/embeddings with Ollama Cloud Using OpenAI Python Client

Issue Description

When using Ollama Cloud with the OpenAI Python client and attempting to call the /v1/embeddings endpoint, the server returns a 404 error:

404 - {'error': 'path "/v1/embeddings" not found'}


Reproduction Code

from openai import OpenAI
import os

client = OpenAI(
base_url="https://ollama.com/v1",
api_key=os.getenv("OLLAMA_API_KEY")
)

response = client.embeddings.create(
model="qwen3-vl:235b-cloud",
input="test"
)


Observed Behavior

The request fails with:

openai.NotFoundError: Error code: 404 - {'error': 'path "/v1/embeddings" not found'}


Documentation Reference

The documentation mentions:

"OpenAI API Compatibility: support for the /v1/embeddings OpenAI-compatible endpoint (Coming soon)"


Questions

  1. Is there currently a supported method for generating embeddings using Ollama Cloud?
  2. If not, is there an estimated timeline for /v1/embeddings support?
Originally created by @kishensjain on GitHub (Feb 27, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/14496 # 404 Error When Calling /v1/embeddings with Ollama Cloud Using OpenAI Python Client ## Issue Description When using Ollama Cloud with the OpenAI Python client and attempting to call the /v1/embeddings endpoint, the server returns a 404 error: 404 - {'error': 'path "/v1/embeddings" not found'} --- ## Reproduction Code from openai import OpenAI import os client = OpenAI( base_url="https://ollama.com/v1", api_key=os.getenv("OLLAMA_API_KEY") ) response = client.embeddings.create( model="qwen3-vl:235b-cloud", input="test" ) --- ## Observed Behavior The request fails with: openai.NotFoundError: Error code: 404 - {'error': 'path "/v1/embeddings" not found'} --- ## Documentation Reference The documentation mentions: "OpenAI API Compatibility: support for the /v1/embeddings OpenAI-compatible endpoint (Coming soon)" --- ## Questions 1. Is there currently a supported method for generating embeddings using Ollama Cloud? 2. If not, is there an estimated timeline for /v1/embeddings support?
GiteaMirror added the feature request label 2026-04-22 19:28:13 -05:00
Author
Owner

@rick-github commented on GitHub (Feb 27, 2026):

Is there currently a supported method for generating embeddings using Ollama Cloud?

No, there are no embedding models in Ollama Cloud. There is currently no public roadmap for adding embeddings to Cloud.

<!-- gh-comment-id:3972680400 --> @rick-github commented on GitHub (Feb 27, 2026): > Is there currently a supported method for generating embeddings using Ollama Cloud? No, there are no embedding models in Ollama Cloud. There is currently no public roadmap for adding embeddings to Cloud.
Author
Owner

@kishensjain commented on GitHub (Feb 27, 2026):

Is there currently a supported method for generating embeddings using Ollama Cloud?

No, there are no embedding models in Ollama Cloud. There is currently no public roadmap for adding embeddings to Cloud.

The documentation says it is coming soon. Do you have any idea how soon it will be available?
Link to the documentation page. It is mentioned at the end of the page.

<!-- gh-comment-id:3972716650 --> @kishensjain commented on GitHub (Feb 27, 2026): > > Is there currently a supported method for generating embeddings using Ollama Cloud? > > No, there are no embedding models in Ollama Cloud. There is currently no public roadmap for adding embeddings to Cloud. The documentation says it is coming soon. Do you have any idea how soon it will be available? [Link](https://ollama.com/blog/embedding-models) to the documentation page. It is mentioned at the end of the page.
Author
Owner

@rick-github commented on GitHub (Feb 27, 2026):

Embeddings are supported on local models. There is currently no public roadmap for adding embeddings to Cloud.

<!-- gh-comment-id:3972790340 --> @rick-github commented on GitHub (Feb 27, 2026): Embeddings are supported on local models. There is currently no public roadmap for adding embeddings to Cloud.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#35162