[GH-ISSUE #3023] Mamba State Space Models Integration #1859

Open
opened 2026-04-12 11:55:21 -05:00 by GiteaMirror · 7 comments
Owner

Originally created by @MarcellM01 on GitHub (Mar 9, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/3023

There has been a completed merge of mamba model support over at Ilama.ccp, would it be possible to implement these into Ollama as well?

Merged PR: https://github.com/ggerganov/llama.cpp/pull/5328

Model examples: https://huggingface.co/collections/Q-bert/mamba-65869481595e25821853d20d

Originally created by @MarcellM01 on GitHub (Mar 9, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/3023 There has been a completed merge of mamba model support over at Ilama.ccp, would it be possible to implement these into Ollama as well? Merged PR: https://github.com/ggerganov/llama.cpp/pull/5328 Model examples: https://huggingface.co/collections/Q-bert/mamba-65869481595e25821853d20d
GiteaMirror added the model label 2026-04-12 11:55:21 -05:00
Author
Owner

@ekianjo commented on GitHub (Mar 10, 2024):

Yeah, waiting for this integration as well

<!-- gh-comment-id:1987110449 --> @ekianjo commented on GitHub (Mar 10, 2024): Yeah, waiting for this integration as well
Author
Owner

@BradKML commented on GitHub (Mar 12, 2024):

Seconding this, also with RWKV (Eagle) and other fast LLMs like RetNet or H3/S4 https://github.com/ollama/ollama/issues/1612

P.S. Jamba is related to this as well https://github.com/ollama/ollama/issues/3399

<!-- gh-comment-id:1990963471 --> @BradKML commented on GitHub (Mar 12, 2024): Seconding this, also with RWKV (Eagle) and other fast LLMs like RetNet or H3/S4 https://github.com/ollama/ollama/issues/1612 P.S. Jamba is related to this as well https://github.com/ollama/ollama/issues/3399
Author
Owner

@skyler14 commented on GitHub (May 2, 2024):

Has there been any movement on this in the last month?

<!-- gh-comment-id:2089756620 --> @skyler14 commented on GitHub (May 2, 2024): Has there been any movement on this in the last month?
Author
Owner

@skyler14 commented on GitHub (May 16, 2024):

There is a one file implementation of base Mamba that was written here, might be very easy to build an OpenAI style wrapper around it.

https://github.com/johnma2006/mamba-minimal

Since there are some rising architectures that seem to do token free extraction it might be interesting to extend a small amount to those tasks

<!-- gh-comment-id:2115452856 --> @skyler14 commented on GitHub (May 16, 2024): There is a one file implementation of base Mamba that was written here, might be very easy to build an OpenAI style wrapper around it. https://github.com/johnma2006/mamba-minimal Since there are some rising architectures that seem to do token free extraction it might be interesting to extend a small amount to those tasks
Author
Owner

@aretrace commented on GitHub (Sep 9, 2024):

Any update?

<!-- gh-comment-id:2336962047 --> @aretrace commented on GitHub (Sep 9, 2024): Any update?
Author
Owner

@hg0428 commented on GitHub (Oct 1, 2024):

Any update?

<!-- gh-comment-id:2384671869 --> @hg0428 commented on GitHub (Oct 1, 2024): Any update?
Author
Owner

@gabe-l-hart commented on GitHub (Jul 14, 2025):

Hi folks. I've got a draft bumping llama.cpp that extends support for mamba, and adds support for mamba2, falcon-h1, granite4 and a number of other recurrent/hybrid-recurrent models. As with all llama.cpp changes, there are a number of additional changes along for the ride, so any additional help testing models on the draft would be much appreciated.

<!-- gh-comment-id:3069938024 --> @gabe-l-hart commented on GitHub (Jul 14, 2025): Hi folks. I've got [a draft](https://github.com/ollama/ollama/pull/11195) bumping `llama.cpp` that extends support for `mamba`, and adds support for `mamba2`, `falcon-h1`, `granite4` and a number of other recurrent/hybrid-recurrent models. As with all `llama.cpp` changes, there are a number of additional changes along for the ride, so any additional help testing models on the draft would be much appreciated.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#1859