[GH-ISSUE #8184] Falcon3 10B in 1.58bit format #67282

Closed
opened 2026-05-04 09:46:22 -05:00 by GiteaMirror · 2 comments
Owner

Originally created by @thiswillbeyourgithub on GitHub (Dec 20, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/8184

I'n sort of surprised that all variants of falcon3 have been added very quickly but not the 1.58bit one and nobody seems to have asked for it.

The full 10B model is only 3.99Go in 1,58bit format according to their hf repo so I think it would be interesting to play with!

Originally created by @thiswillbeyourgithub on GitHub (Dec 20, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/8184 I'n sort of surprised that all variants of falcon3 have been added very quickly but not the 1.58bit one and nobody seems to have asked for it. The full 10B model is only 3.99Go in 1,58bit format according to [their hf repo](https://huggingface.co/tiiuae/Falcon3-10B-Instruct-1.58bit/tree/main) so I think it would be interesting to play with!
GiteaMirror added the model label 2026-05-04 09:46:22 -05:00
Author
Owner

@rick-github commented on GitHub (Dec 20, 2024):

Ollama doesn't support 1.58 bit quants at the moment. There is an open PR (#7318) but it hasn't been tested with this new model yet.

<!-- gh-comment-id:2557207914 --> @rick-github commented on GitHub (Dec 20, 2024): Ollama doesn't support 1.58 bit quants at the moment. There is an open PR (#7318) but it hasn't been tested with this new model yet.
Author
Owner

@rick-github commented on GitHub (Dec 21, 2024):

Sadly the safetensors for this model contain tensors that llama.cpp doesn't recognize, so converting to a GGUF is currently not possible.

<!-- gh-comment-id:2557976647 --> @rick-github commented on GitHub (Dec 21, 2024): Sadly the safetensors for this model contain tensors that llama.cpp doesn't recognize, so converting to a GGUF is currently not possible.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#67282