[GH-ISSUE #15741] Kimi models require pro ? #72094

New Issue

GiteaMirror · 2026-05-05T03:27:12-05:00

GiteaMirror commented

2026-05-05 03:27:12 -05:00

Originally created by @Azecko on GitHub (Apr 22, 2026).
Original GitHub issue: https://github.com/ollama/ollama/issues/15741

What is the issue?

I've always used kimi-k2.5:cloud for Ollama. Mainly with the Hermes Agent.

Today, when I tried to message my Hermes, I got the error HTTP 403: Error code: 403 - {'error': 'this model requires a subscription, upgrade for access: https://ollama.com/upgrade.

I tried to prompt kimi using just ollama run, same thing.
Using the new Kimi CLI from v0.21.1 (because I also tried to update Ollama), same thing.
With kimi-k2.6:cloud, I also get the same error.
But I do not get this error when I try with other cloud models, like gemma.

I'm correctly signed in when I run ollama signin.

My Cloud usage for this week is at 37%, and 0% for the session at the time I write this issue.

Am I the only one who get this error ? Is there something that I missed and now Kimi models require Ollama pro ?

Relevant log output

Error: 403 Forbidden: this model requires a subscription, upgrade for access: https://ollama.com/upgrade

OS

Linux

GPU

No response

CPU

Intel

Ollama version

0.21.1

Originally created by @Azecko on GitHub (Apr 22, 2026). Original GitHub issue: https://github.com/ollama/ollama/issues/15741 ### What is the issue? I've always used `kimi-k2.5:cloud` for Ollama. Mainly with the Hermes Agent. Today, when I tried to message my Hermes, I got the error `HTTP 403: Error code: 403 - {'error': 'this model requires a subscription, upgrade for access: https://ollama.com/upgrade`. I tried to prompt kimi using just `ollama run`, same thing. Using the new Kimi CLI from `v0.21.1` (because I also tried to update Ollama), same thing. With `kimi-k2.6:cloud`, I also get the same error. But I do not get this error when I try with other cloud models, like gemma. I'm correctly signed in when I run `ollama signin`. My Cloud usage for this week is at 37%, and 0% for the session at the time I write this issue. Am I the only one who get this error ? Is there something that I missed and now Kimi models require Ollama pro ? ### Relevant log output ```shell Error: 403 Forbidden: this model requires a subscription, upgrade for access: https://ollama.com/upgrade ``` ### OS Linux ### GPU _No response_ ### CPU Intel ### Ollama version 0.21.1

GiteaMirror added the bug label 2026-05-05 03:27:12 -05:00

GiteaMirror commented

2026-05-05 03:27:14 -05:00

@sruizcarmona commented on GitHub (Apr 22, 2026):

Same here, no updates from ollama anywhere...

@sruizcarmona commented on GitHub (Apr 22, 2026): Same here, no updates from ollama anywhere...

GiteaMirror commented

2026-05-05 03:27:14 -05:00

@AngelSanchezB commented on GitHub (Apr 22, 2026):

Same here since last night...

@AngelSanchezB commented on GitHub (Apr 22, 2026): Same here since last night...

GiteaMirror commented

2026-05-05 03:27:16 -05:00

@ACheshirov commented on GitHub (Apr 22, 2026):

They disabled for free users almost all big models.

@ACheshirov commented on GitHub (Apr 22, 2026): They disabled for free users almost all big models.

GiteaMirror commented

2026-05-05 03:27:17 -05:00

@LeoLP1 commented on GitHub (Apr 22, 2026):

@ACheshirov Is that something you think, or did you read it somewhere?

@LeoLP1 commented on GitHub (Apr 22, 2026): @ACheshirov Is that something you think, or did you read it somewhere?

GiteaMirror commented

2026-05-05 03:27:18 -05:00

@ACheshirov commented on GitHub (Apr 22, 2026):

@LeoLP1 I tried a lot of their large cloud models - all of them are disabled for free users.

People on their Discord server says that it's not confirmed that the restrictions are permanent...
However, considering the number of people with paid subscriptions who have complained about interruptions and very slow responses from the models, this free tier restriction was expected...

More likely, they will think of another way to give access to free users that is more difficult to abuse.

@ACheshirov commented on GitHub (Apr 22, 2026): @LeoLP1 I tried a lot of their large cloud models - all of them are disabled for free users. People on their Discord server says that it's not confirmed that the restrictions are permanent... However, considering the number of people with paid subscriptions who have complained about interruptions and very slow responses from the models, this free tier restriction was expected... More likely, they will think of another way to give access to free users that is more difficult to abuse.

GiteaMirror commented

2026-05-05 03:27:19 -05:00

@asahu commented on GitHub (Apr 23, 2026):

Same here :(

@asahu commented on GitHub (Apr 23, 2026): Same here :(

GiteaMirror commented

2026-05-05 03:27:20 -05:00

@Azecko commented on GitHub (Apr 23, 2026):

I'm glad to see that I'm not the only one encountering this issue.
I hope that we will have a communication asap about the situation from the Ollama team, because this is really weird.

@Azecko commented on GitHub (Apr 23, 2026): I'm glad to see that I'm not the only one encountering this issue. I hope that we will have a communication asap about the situation from the Ollama team, because this is really weird.

GiteaMirror commented

2026-05-05 03:27:22 -05:00

@25kgozon commented on GitHub (Apr 23, 2026):

Free models got pay walled bc some users were abusing them too much apparently. I'm having the same 403 issue still too

@25kgozon commented on GitHub (Apr 23, 2026): Free models got pay walled bc some users were abusing them too much apparently. I'm having the same 403 issue still too

GiteaMirror commented

2026-05-05 03:27:23 -05:00

@MCOoost commented on GitHub (Apr 23, 2026):

Same issue here. Both kimi-k2.5:cloud and kimi-k2.6:cloud were working perfectly fine for me yesterday (April 22), and today they both return:

Error: this model requires a subscription, upgrade for access

I'm still well within my free cloud usage limits, and other cloud models (e.g., Gemma) continue to work without any issues on the same account.

I completely understand that business models evolve, but the lack of transparency here is disappointing. A silent, unannounced paywall on models that were freely accessible 24 hours ago creates a frustrating experience for users who have come to rely on them. If a subscription is now required, that is fair, but some advance notice or a clear changelog would have been greatly appreciated.

Could the team please clarify:

Was this an intentional change for Kimi models specifically?
Is this a temporary issue or a permanent requirement going forward?

Thank you for the otherwise excellent work on Ollama.

@MCOoost commented on GitHub (Apr 23, 2026): Same issue here. Both kimi-k2.5:cloud and kimi-k2.6:cloud were working perfectly fine for me yesterday (April 22), and today they both return: ` Error: this model requires a subscription, upgrade for access ` I'm still well within my free cloud usage limits, and other cloud models (e.g., Gemma) continue to work without any issues on the same account. I completely understand that business models evolve, but the lack of transparency here is disappointing. A silent, unannounced paywall on models that were freely accessible 24 hours ago creates a frustrating experience for users who have come to rely on them. If a subscription is now required, that is fair, but some advance notice or a clear changelog would have been greatly appreciated. Could the team please clarify: - Was this an intentional change for Kimi models specifically? - Is this a temporary issue or a permanent requirement going forward? Thank you for the otherwise excellent work on Ollama.

GiteaMirror commented

2026-05-05 03:27:25 -05:00

@siathalysedI commented on GitHub (Apr 23, 2026):

Same issue here, kimi-k2.6-cloud and I got:
Error code: 403 - {'error': 'this model requires a subscription, upgrade for access: https://ollama.com/upgrade (ref: 9d6c4cd4-b628-4882-b652-f71f638e117c)'}

I can't understand why this happens and with no further notice and such obscured information about it.

@siathalysedI commented on GitHub (Apr 23, 2026): Same issue here, kimi-k2.6-cloud and I got: `Error code: 403 - {'error': 'this model requires a subscription, upgrade for access: https://ollama.com/upgrade (ref: 9d6c4cd4-b628-4882-b652-f71f638e117c)'}` I can't understand why this happens and with no further notice and such obscured information about it.

GiteaMirror commented

2026-05-05 03:27:26 -05:00

@dnky-1 commented on GitHub (Apr 24, 2026):

same here trying to use glm-5.1 model

@dnky-1 commented on GitHub (Apr 24, 2026): same here trying to use glm-5.1 model

GiteaMirror commented

2026-05-05 03:27:27 -05:00

@FALLEN-01 commented on GitHub (Apr 25, 2026):

I am facing the same issues

@FALLEN-01 commented on GitHub (Apr 25, 2026): I am facing the same issues

GiteaMirror commented

2026-05-05 03:27:30 -05:00

@mahiarirani commented on GitHub (Apr 25, 2026):

same issue trying to use kimi-k2.6:cloud

@mahiarirani commented on GitHub (Apr 25, 2026): same issue trying to use kimi-k2.6:cloud

GiteaMirror commented

2026-05-05 03:27:31 -05:00

@freerider7777 commented on GitHub (Apr 26, 2026):

"There's no such thing as a free lunch."

@freerider7777 commented on GitHub (Apr 26, 2026): "There's no such thing as a free lunch."

GiteaMirror commented

2026-05-05 03:27:33 -05:00

@gry321 commented on GitHub (May 1, 2026):

I am facing the same issues

@gry321 commented on GitHub (May 1, 2026): I am facing the same issues

GiteaMirror commented

2026-05-05 03:27:35 -05:00

@Azecko commented on GitHub (May 2, 2026):

Now same thing with qwen2.5. This is getting annoying.

@Azecko commented on GitHub (May 2, 2026): Now same thing with qwen2.5. This is getting annoying.

GiteaMirror commented

2026-05-05 03:27:39 -05:00

@hiteshseth commented on GitHub (May 2, 2026):

Anyone has a list of models which actually work now? Ollama website lists all which ofcourse is not true

@hiteshseth commented on GitHub (May 2, 2026): Anyone has a list of models which actually work now? Ollama website lists all which ofcourse is not true

GiteaMirror commented

2026-05-05 03:27:45 -05:00

@AccidentalJedi commented on GitHub (May 2, 2026):

OS: Windows 11
MAX Plan (upgraded yesterday)

Latest Ollama update:

CONSTANT 503 service unavailable errors

I haven't decided how much longer I can afford to wait for something reliable. the service hit a pinnacle, and then has become completely useless. I do mean that. COMPLETELY useless. How long do we continue to pay for services we aren't getting? are there going to be compensations, or is it just going to be "our loss"?

@AccidentalJedi commented on GitHub (May 2, 2026): OS: Windows 11 MAX Plan (upgraded yesterday) Latest Ollama update: CONSTANT 503 service unavailable errors I haven't decided how much longer I can afford to wait for something reliable. the service hit a pinnacle, and then has become completely useless. I do mean that. COMPLETELY useless. How long do we continue to pay for services we aren't getting? are there going to be compensations, or is it just going to be "our loss"?

GiteaMirror commented

2026-05-05 03:27:47 -05:00

@zakoche commented on GitHub (May 2, 2026):

same problem but with all model's

@zakoche commented on GitHub (May 2, 2026): same problem but with all model's

GiteaMirror commented

2026-05-05 03:27:50 -05:00

@okankayci commented on GitHub (May 3, 2026):

Forbidden: this model requires a subscription, upgrade for access:

I can't use it at all.

@okankayci commented on GitHub (May 3, 2026): Forbidden: this model requires a subscription, upgrade for access: I can't use it at all.

GiteaMirror commented

2026-05-05 03:27:51 -05:00

@vinnytherobot commented on GitHub (May 3, 2026):

Same error here, I've already tried with GLM-5.1, Minimax-2.7, kimi-2.5 and they all return the same error.

@vinnytherobot commented on GitHub (May 3, 2026): Same error here, I've already tried with GLM-5.1, Minimax-2.7, kimi-2.5 and they all return the same error.

GiteaMirror commented

2026-05-05 03:27:53 -05:00

@hiteshseth commented on GitHub (May 3, 2026):

Even gemma is not working!

Yahoo Mail: Search, Organize, Conquer

On Sun, May 3, 2026 at 8:36, João @.***> wrote: vinnytherobot left a comment (ollama/ollama#15741)
Same error here, I've already tried with GLM-5.1, Minimax-2.7, kimi-2.5 and they all return the same error.

—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you are subscribed to this thread.Message ID: @.***>

@hiteshseth commented on GitHub (May 3, 2026): Even gemma is not working! Yahoo Mail: Search, Organize, Conquer On Sun, May 3, 2026 at 8:36, João ***@***.***> wrote: vinnytherobot left a comment (ollama/ollama#15741) Same error here, I've already tried with GLM-5.1, Minimax-2.7, kimi-2.5 and they all return the same error. — Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android. You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

GiteaMirror commented

2026-05-05 03:27:55 -05:00

@ACheshirov commented on GitHub (May 3, 2026):

Folks keep calling this an "issue", but that’s not really an issue. It’s intentional. They’ve deliberately cut off all the major models for free tier accounts, and there’s a pretty good chance they’re never coming back. :)
From here on out, Ollama doesn’t have much left to offer compared to other similar tools. LM Studio is just miles ahead (more options, better performance, a wider model selection, and a cleaner UI). The only good thing Ollama was better was cloud models.

@ACheshirov commented on GitHub (May 3, 2026): Folks keep calling this an "issue", but that’s not really an issue. It’s intentional. They’ve deliberately cut off all the major models for free tier accounts, and there’s a pretty good chance they’re never coming back. :) From here on out, Ollama doesn’t have much left to offer compared to other similar tools. LM Studio is just miles ahead (more options, better performance, a wider model selection, and a cleaner UI). The only good thing Ollama was better was cloud models.

GiteaMirror commented

2026-05-05 03:27:57 -05:00

@romabysen commented on GitHub (May 4, 2026):

I would say the issue is the complete lack of communication. No announcement for anything. The pricing page says nothing about model restrictions on the free plan.

@romabysen commented on GitHub (May 4, 2026): I would say the issue is the complete lack of communication. No announcement for anything. The pricing page says nothing about model restrictions on the free plan.

GiteaMirror commented

2026-05-05 03:27:59 -05:00

@somera commented on GitHub (May 4, 2026):

[...] The only good thing Ollama was better was cloud models.

And Ollama is not running the cloud models with Ollama. ;) I heard that.

@somera commented on GitHub (May 4, 2026): > [...] The only good thing Ollama was better was cloud models. And Ollama is not running the cloud models with Ollama. ;) I heard that.

GiteaMirror commented

2026-05-05 03:28:00 -05:00

@bam93 commented on GitHub (May 4, 2026):

@Azecko Same here for minimax 2.7 cloud. Worked like a charm for many weeks. I am properly signed in. Also tried to sign out back in, not working. Any news? Unless this is a deliberate policy change, but then it should be officially announced. As I'm hitting the same issue, but with minimax-m2.7:cloud rather than Kimi models — this really appears to be broader than the title suggests -> should the title of the bug be changed to make that clear (cc @Azecko ) ?

Error:

Error: 403 Forbidden: this model requires a subscription, upgrade for access: https://ollama.com/upgrade

Context:

My free cloud credits are not exhausted (weekly and session usage both well within limits)
I am correctly signed in (ollama signin)
Signing out and back in (ollama signout / ollama signin) does not resolve it
This started recently — the same model worked without issue last week

There is no official announcement from Ollama about restricting specific models to paid plans, so I believe this is a bug in how certain models validate credentials on the backend, not an intentional tier change.

@bam93 commented on GitHub (May 4, 2026): @Azecko Same here for minimax 2.7 cloud. Worked like a charm for many weeks. I am properly signed in. Also tried to sign out back in, not working. Any news? Unless this is a deliberate policy change, but then it should be officially announced. As I'm hitting the same issue, but with `minimax-m2.7:cloud` rather than Kimi models — this really appears to be broader than the title suggests -> should the title of the bug be changed to make that clear (cc @Azecko ) ? **Error:** ``` Error: 403 Forbidden: this model requires a subscription, upgrade for access: https://ollama.com/upgrade ``` **Context:** - My free cloud credits are not exhausted (weekly and session usage both well within limits) - I am correctly signed in (`ollama signin`) - Signing out and back in (`ollama signout` / `ollama signin`) does not resolve it - This started recently — the same model worked without issue last week There is no official announcement from Ollama about restricting specific models to paid plans, so I believe this is a bug in how certain models validate credentials on the backend, not an intentional tier change.

GiteaMirror commented

2026-05-05 03:28:02 -05:00

@Azecko commented on GitHub (May 4, 2026):

Updated the issue title since we know see that the problem do not appear only on kimi models.

@Azecko commented on GitHub (May 4, 2026): Updated the issue title since we know see that the problem do not appear only on kimi models.

GiteaMirror commented

2026-05-05 03:28:03 -05:00

@AccidentalJedi commented on GitHub (May 4, 2026):

I'm about to set a deadline that if these issues aren't solved... I'll be cancelling my max plan and moving on. Deepseek V4 models are STILL unusable. why bother listing them if you aren't set up to serve them? That's AFTER today's latest update as well.

@AccidentalJedi commented on GitHub (May 4, 2026): I'm about to set a deadline that if these issues aren't solved... I'll be cancelling my max plan and moving on. Deepseek V4 models are STILL unusable. why bother listing them if you aren't set up to serve them? That's AFTER today's latest update as well.

GiteaMirror commented

2026-05-05 03:28:05 -05:00

@bam93 commented on GitHub (May 4, 2026):

@AccidentalJedi yes for me it's the other way round: I was very seriously considering getting a paid plan with ollama, because I was quite satisfied with the free tier service (just not enough obviously, for my needs), but now if they make this move, and on top of that totally unannounced, out of the blue, that's a show stopper for me. And I thought I had found the rare provider that doesn't mess with the users. I hope they reply properly and officially to this issue here, then I can make an informed decision.

But what is weird: the issue is 2 weeks old.. for me, for the last 2 weeks, minimax 2.7 was working fine. Just when it reset last night, no joy any more. Maybe they are tuning which models to include in the exclusion to tune their cost model? Still there could/should have been an announcement and response by the time now. 😞

@bam93 commented on GitHub (May 4, 2026): @AccidentalJedi yes for me it's the other way round: I was very seriously considering getting a paid plan with ollama, because I was quite satisfied with the free tier service (just not enough obviously, for my needs), but now if they make this move, and on top of that totally unannounced, out of the blue, that's a show stopper for me. And I thought I had found the rare provider that doesn't mess with the users. I hope they reply properly and officially to this issue here, then I can make an informed decision. But what is weird: the issue is 2 weeks old.. for me, for the last 2 weeks, minimax 2.7 was working fine. Just when it reset last night, no joy any more. Maybe they are tuning which models to include in the exclusion to tune their cost model? Still there could/should have been an announcement and response by the time now. 😞

GiteaMirror commented

2026-05-05 03:28:07 -05:00

@ACheshirov commented on GitHub (May 4, 2026):

I already wrote this in Discord, but I’ll post it here as well - even though it looks like there are no admins around, or they just don’t care, since nobody responds anyway.

The main reason I’m holding off on upgrading to Pro is the complete lack of transparency from the team. That’s also why I prefer to just load tokens directly in z.ai for GLM 5.1. It honestly feels like they operate with a “we don’t owe you explanations” mindset.

Literally overnight, they cut off free-tier users from access to almost all cloud models, and didn’t even bother to post an announcement explaining how long this would last, why it was necessary, or anything like that.

My other issue is, again, tied to the same thing - lack of transparency. Most APIs work on a token-based system, where you know exactly how much you’re paying per million tokens. That way, everything is clear and there’s no room for surprises.

Right now, we have no idea how usage is actually being calculated. How am I supposed to trust that tomorrow they won’t just decide the current usage limits don’t work for them anymore and quietly reduce them? And judging by how things look, they’re not exactly the type of persons to explain their decisions - so something like that could easily happen without anyone even knowing.

@ACheshirov commented on GitHub (May 4, 2026): I already wrote this in Discord, but I’ll post it here as well - even though it looks like there are no admins around, or they just don’t care, since nobody responds anyway. The main reason I’m holding off on upgrading to Pro is the complete lack of transparency from the team. That’s also why I prefer to just load tokens directly in z.ai for GLM 5.1. It honestly feels like they operate with a “we don’t owe you explanations” mindset. Literally overnight, they cut off free-tier users from access to almost all cloud models, and didn’t even bother to post an announcement explaining how long this would last, why it was necessary, or anything like that. My other issue is, again, tied to the same thing - lack of transparency. Most APIs work on a token-based system, where you know exactly how much you’re paying per million tokens. That way, everything is clear and there’s no room for surprises. Right now, we have no idea how usage is actually being calculated. How am I supposed to trust that tomorrow they won’t just decide the current usage limits don’t work for them anymore and quietly reduce them? And judging by how things look, they’re not exactly the type of persons to explain their decisions - so something like that could easily happen without anyone even knowing.

GiteaMirror commented

2026-05-05 03:28:09 -05:00

@quarthex commented on GitHub (May 4, 2026):

Updated the issue title since we know see that the problem do not appear only on kimi models.

I believe, that you can restate the title as ~~Big~~Cloud models requires pro?

@quarthex commented on GitHub (May 4, 2026): > Updated the issue title since we know see that the problem do not appear only on kimi models. I believe, that you can restate the title as *~~Big~~Cloud models requires pro?*

GiteaMirror commented

2026-05-05 03:28:13 -05:00

@DanielAraldi commented on GitHub (May 4, 2026):

A complete lack of respect and transparency towards users. Honestly, I'm disgusted.

@DanielAraldi commented on GitHub (May 4, 2026): A complete lack of respect and transparency towards users. Honestly, I'm disgusted.

GiteaMirror commented

2026-05-05 03:28:14 -05:00

@sdev138 commented on GitHub (May 4, 2026):

Having the same issue while testing GLM 5.1 and deepseek-v4-flash. Kept getting 403s where I would need to upgrade despite literally having 0% quota usage atm.

@sdev138 commented on GitHub (May 4, 2026): Having the same issue while testing GLM 5.1 and deepseek-v4-flash. Kept getting 403s where I would need to upgrade despite literally having 0% quota usage atm.

GiteaMirror commented

2026-05-05 03:28:15 -05:00

@nccongg commented on GitHub (May 5, 2026):

nah, i can't use any cloud model

@nccongg commented on GitHub (May 5, 2026): nah, i can't use any cloud model

Sign in to join this conversation.

Branches Tags

main

dhiltgen/ci

dhiltgen/llama-runner

hoyyeva/anthropic-local-image-path

hoyyeva/anthropic-reference-images-path

parth-anthropic-reference-images-path

brucemacd/download-before-remove

hoyyeva/editor-config-repair

parth-mlx-decode-checkpoints

parth-launch-codex-app

hoyyeva/fix-codex-model-metadata-warning

hoyyeva/qwen

parth/hide-claude-desktop-till-release

hoyyeva/opencode-image-modality

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

hoyyeva/opencode-thinking

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

parth/update-claude-docs

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#72094