[GH-ISSUE #3735] Can you support llama3? #64338

New Issue

GiteaMirror · 2026-05-03T17:08:26-05:00

GiteaMirror commented

2026-05-03 17:08:26 -05:00

Originally created by @ICLXL on GitHub (Apr 18, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/3735

https://llama.meta.com/llama3/

Originally created by @ICLXL on GitHub (Apr 18, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/3735 https://llama.meta.com/llama3/

GiteaMirror added the feature request label 2026-05-03 17:08:27 -05:00

GiteaMirror closed this issue

2026-05-03 17:08:29 -05:00

GiteaMirror commented

2026-05-03 17:08:33 -05:00

@selva221724 commented on GitHub (Apr 18, 2024):

Bro, i was about to give that comment ? 😂

@selva221724 commented on GitHub (Apr 18, 2024): Bro, i was about to give that comment ? 😂

GiteaMirror commented

2026-05-03 17:08:34 -05:00

@ikramhasan commented on GitHub (Apr 18, 2024):

I saw this in the library: https://ollama.com/library/llama3

@ikramhasan commented on GitHub (Apr 18, 2024): I saw this in the library: https://ollama.com/library/llama3

GiteaMirror commented

2026-05-03 17:08:36 -05:00

@sputnick commented on GitHub (Apr 18, 2024):

Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"?

@sputnick commented on GitHub (Apr 18, 2024): Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"?

GiteaMirror commented

2026-05-03 17:08:37 -05:00

@ikramhasan commented on GitHub (Apr 18, 2024):

@sputnick does the base model hallucinate less? I'm also using the instruct model.

@ikramhasan commented on GitHub (Apr 18, 2024): @sputnick does the base model hallucinate less? I'm also using the instruct model.

GiteaMirror commented

2026-05-03 17:08:39 -05:00

@taozhiyuai commented on GitHub (Apr 18, 2024):

Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"?

same here. and not support chinese

@taozhiyuai commented on GitHub (Apr 18, 2024): > Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"? same here. and not support chinese ![WechatIMG4](https://github.com/ollama/ollama/assets/146583103/fc98466c-8376-4332-b721-a25dd3b6fb87)

GiteaMirror commented

2026-05-03 17:08:41 -05:00

@DavidBates commented on GitHub (Apr 18, 2024):

Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"?

Use ollama show --modelfile llama3 to get your current model file
add PARAMETER stop """assistant""" at the bottom of that file with your favorite text editor
save and creat with ollama create l3 -f /path/to/mf
run ollama run l3

From discord

@DavidBates commented on GitHub (Apr 18, 2024): > Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"? Use `ollama show --modelfile llama3` to get your current model file add `PARAMETER stop """assistant"""` at the bottom of that file with your favorite text editor save and creat with ` ollama create l3 -f /path/to/mf` run `ollama run l3` From discord

GiteaMirror commented

2026-05-03 17:08:42 -05:00

@pdevine commented on GitHub (Apr 18, 2024):

It's here peeps. ollama run llama3. We had some problems with the vocabulary earlier, but it should be working now. The other quantizations are coming, as are the text and 70b models.

@pdevine commented on GitHub (Apr 18, 2024): It's here peeps. `ollama run llama3`. We had some problems with the vocabulary earlier, but it should be working now. The other quantizations are coming, as are the `text` and 70b models.

GiteaMirror commented

2026-05-03 17:08:43 -05:00

@ImVexed commented on GitHub (Apr 18, 2024):

Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"?

Use ollama show --modelfile llama3 to get your current model file add PARAMETER stop """assistant""" at the bottom of that file with your favorite text editor save and creat with ollama create l3 -f /path/to/mf run ollama run l3

From discord

Didn't seem to fix it for me. Not very familiar with the modelfile format, but tried various combinations of "'s around assistant and still getting endless output.

@ImVexed commented on GitHub (Apr 18, 2024): > > Instruct models just generate endless text? Is there a missing config file or something or are those models not actually "instruct"? > > Use `ollama show --modelfile llama3` to get your current model file add `PARAMETER stop """assistant"""` at the bottom of that file with your favorite text editor save and creat with ` ollama create l3 -f /path/to/mf` run `ollama run l3` > > From discord Didn't seem to fix it for me. Not very familiar with the modelfile format, but tried various combinations of `"`'s around `assistant` and still getting endless output.

GiteaMirror commented

2026-05-03 17:08:45 -05:00

@taozhiyuai commented on GitHub (Apr 18, 2024):

please update model information officially

https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3

@taozhiyuai commented on GitHub (Apr 18, 2024): please update model information officially https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3

GiteaMirror commented

2026-05-03 17:08:47 -05:00

@artem-zinnatullin commented on GitHub (Apr 19, 2024):

There is a good discussion in llama.cpp about the chat format changes in llama3, seems like both llama.cpp and ollama will need to be updated to fix the endless output, output repetitions and "assistant", start_header_id and other internal output slipping out to the user

Links:

llama.cpp llama3 issue https://github.com/ggerganov/llama.cpp/issues/6747
llama.cpp PR: "Support Llama 3 conversion" https://github.com/ggerganov/llama.cpp/pull/6745
llama3 ChatFormat 359887376f/llama/tokenizer.py (L202)
vllm PR "Add conversation template for llama3-instrcut" https://github.com/vllm-project/vllm/pull/4178

@artem-zinnatullin commented on GitHub (Apr 19, 2024): There is a good discussion in llama.cpp about the chat format changes in llama3, seems like both llama.cpp and ollama will need to be updated to fix the endless output, output repetitions and `"assistant"`, `start_header_id` and other internal output slipping out to the user Links: - llama.cpp llama3 issue https://github.com/ggerganov/llama.cpp/issues/6747 - llama.cpp PR: "Support Llama 3 conversion" https://github.com/ggerganov/llama.cpp/pull/6745 - llama3 ChatFormat https://github.com/meta-llama/llama3/blob/359887376f0aaf30e433f23e25df858d8c2a9833/llama/tokenizer.py#L202 - vllm PR "Add conversation template for llama3-instrcut" https://github.com/vllm-project/vllm/pull/4178

GiteaMirror commented

2026-05-03 17:08:48 -05:00

@artem-zinnatullin commented on GitHub (Apr 19, 2024):

Here is how the output looks right now running on Linux | AMD 7900 XTX 24GB | ollama 0.1.31-rocm / 0.1.32-rocm crashes but that's reported separately #3693

Issues:

Repeated output
Infinite output loops
Nonsense in the output like numbers, etc
Internal system output like assistant, start_header_id

@artem-zinnatullin commented on GitHub (Apr 19, 2024): Here is how the output looks right now running on Linux | AMD 7900 XTX 24GB | ollama 0.1.31-rocm / 0.1.32-rocm crashes but that's reported separately #3693 Issues: - Repeated output - Infinite output loops - Nonsense in the output like numbers, etc - Internal system output like `assistant`, `start_header_id` <img width="1792" alt="image" src="https://github.com/ollama/ollama/assets/967132/7a5b3561-cdc1-416a-b945-6ca7f6bb7fc8"> <img width="1190" alt="image" src="https://github.com/ollama/ollama/assets/967132/4580ead3-0bbd-43a3-ac35-14fc4c2bb901"> <img width="1752" alt="image" src="https://github.com/ollama/ollama/assets/967132/e1728145-132d-4530-979d-5986b3858346">

GiteaMirror commented

2026-05-03 17:08:50 -05:00

@vk2r commented on GitHub (Apr 19, 2024):

This post in Reedit explain how resolve this problem.

@vk2r commented on GitHub (Apr 19, 2024): This post in [Reedit](https://www.reddit.com/r/LocalLLaMA/comments/1c7dkxh/tutorial_how_to_make_llama3instruct_ggufs_less/) explain how resolve this problem.

GiteaMirror commented

2026-05-03 17:08:52 -05:00

@pdevine commented on GitHub (Apr 19, 2024):

@artem-zinnatullin and @vk2r can you re-pull the image you're using? I'm wondering if you pulled before we fixed the problem earlier today.

@pdevine commented on GitHub (Apr 19, 2024): @artem-zinnatullin and @vk2r can you re-pull the image you're using? I'm wondering if you pulled before we fixed the problem earlier today.

GiteaMirror commented

2026-05-03 17:08:53 -05:00

@artem-zinnatullin commented on GitHub (Apr 19, 2024):

@pdevine I've just double-checked the hashes of the models I had pulled minutes after release and their behavior as of right now against https://ollama.com/library/llama3/tags

Tell me a random fun fact about the Roman Empire"

✅ llama3:8b a3f3f745d9ef (upsteam hash indeed updated to 71a106a91016, deleted, pulled): seems to work well now!
❌ llama3:8b-text-q4_1 9bb55287063f (same hash upstream): broken output
❌ llama3:8b-text-q5_0 3ee7b4839a12 (same hash upstream): repetitive output
✅ llama3:70b-text 4872fbd164cc (same hash upstream): works okay
❌ ✅ llama3:70b bcfb190ca3a7 (same hash upstream): 70b-instruct has the same hash, I think 70b is supposed to match 70b-text, not -instruct? Ollama serve froze, doesn't release VRAM and denies requests to other models, had to restart it.

Environment:

AMD Radeon 7900 XTX 24GB VRAM
ollama/ollama:0.1.31-rocm docker image

So I think some variations of the models still need to be updated to fixed versions and 70b should be pointing to 70b-text, not 70b-instruct?

@artem-zinnatullin commented on GitHub (Apr 19, 2024): @pdevine I've just double-checked the hashes of the models I had pulled minutes after release and their behavior as of right now against https://ollama.com/library/llama3/tags >Tell me a random fun fact about the Roman Empire" - ✅ `llama3:8b` `a3f3f745d9ef` (upsteam hash indeed updated to `71a106a91016`, deleted, pulled): seems to work well now! - ❌ `llama3:8b-text-q4_1` `9bb55287063f` (same hash upstream): broken output - ❌ `llama3:8b-text-q5_0` `3ee7b4839a12` (same hash upstream): repetitive output - ✅ `llama3:70b-text` `4872fbd164cc` (same hash upstream): works okay - ❌ ✅ `llama3:70b` `bcfb190ca3a7` (same hash upstream): `70b-instruct` has the same hash, I think 70b is supposed to match `70b-text`, not `-instruct`? Ollama serve froze, doesn't release VRAM and denies requests to other models, had to restart it. Environment: - AMD Radeon 7900 XTX 24GB VRAM - `ollama/ollama:0.1.31-rocm` docker image --- So I think some variations of the models still need to be updated to fixed versions and 70b should be pointing to `70b-text`, not `70b-instruct`?

GiteaMirror commented

2026-05-03 17:08:55 -05:00

@taozhiyuai commented on GitHub (Apr 19, 2024):

@artem-zinnatullin and @vk2r can you re-pull the image you're using? I'm wondering if you pulled before we fixed the problem earlier today.

on my Mac, it always speak English regardless what Ianguage I speak to it. even I set system to "always reply chinese", it does not work. :(

by the way , how to check if my llama 3 download is up to date?

@taozhiyuai commented on GitHub (Apr 19, 2024): > @artem-zinnatullin and @vk2r can you re-pull the image you're using? I'm wondering if you pulled before we fixed the problem earlier today. on my Mac, it always speak English regardless what Ianguage I speak to it. even I set system to "always reply chinese", it does not work. :( by the way , how to check if my llama 3 download is up to date?

GiteaMirror commented

2026-05-03 17:08:58 -05:00

@MoonRide303 commented on GitHub (Apr 19, 2024):

@taozhiyuai I currently use Q6_K from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF and this Modelfile:

FROM ./Meta-Llama-3-8B-Instruct-Q6_K.gguf
TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ .Response }}"""
SYSTEM """You are a helpful assistant."""
PARAMETER num_ctx 8192
PARAMETER num_gpu 99

and it seems it can speak Chinese:

or Polish:

@MoonRide303 commented on GitHub (Apr 19, 2024): @taozhiyuai I currently use Q6_K from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF and this Modelfile: ``` FROM ./Meta-Llama-3-8B-Instruct-Q6_K.gguf TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|> {{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|> {{ .Response }}""" SYSTEM """You are a helpful assistant.""" PARAMETER num_ctx 8192 PARAMETER num_gpu 99 ``` and it seems it can speak Chinese: ![image](https://github.com/ollama/ollama/assets/130458190/256ff2b4-e0b6-47fa-b957-26d6fc5e9038) or Polish: ![image](https://github.com/ollama/ollama/assets/130458190/78d1a0f4-4a01-438b-ab20-cb51329e870c)

GiteaMirror commented

2026-05-03 17:09:00 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

Hi, using Meta-Llama-3-8B-Instruct.Q5_K_M.gguf from https://huggingface.co/PrunaAI/Meta-Llama-3-8B-Instruct-GGUF-smashed

Following @MoonRide303 's Modelfile, I wrote this

FROM ./Meta-Llama-3-8B-Instruct.Q5_K_M.gguf

TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ .Response }}"""
SYSTEM """You are a helpful assistant."""
PARAMETER num_ctx 8192
PARAMETER num_gpu 99

ends up with a stream of text?

@chrisbward commented on GitHub (Apr 19, 2024): Hi, using `Meta-Llama-3-8B-Instruct.Q5_K_M.gguf` from https://huggingface.co/PrunaAI/Meta-Llama-3-8B-Instruct-GGUF-smashed Following @MoonRide303 's Modelfile, I wrote this ``` FROM ./Meta-Llama-3-8B-Instruct.Q5_K_M.gguf TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|> {{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|> {{ .Response }}""" SYSTEM """You are a helpful assistant.""" PARAMETER num_ctx 8192 PARAMETER num_gpu 99 ``` ends up with a stream of text?

GiteaMirror commented

2026-05-03 17:09:02 -05:00

@MoonRide303 commented on GitHub (Apr 19, 2024):

@chrisbward With corrected GGUF from QuantFactory it just stops after each answer, as it should:

@MoonRide303 commented on GitHub (Apr 19, 2024): @chrisbward With corrected GGUF from QuantFactory it just stops after each answer, as it should: ![image](https://github.com/ollama/ollama/assets/130458190/d3867336-86a9-40e4-931d-994239462c4f)

GiteaMirror commented

2026-05-03 17:09:05 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

I did that as a test and I got;

➜  ~ ollama run llama3-8B-instruct-gguf-q6-k   
>>> hello
Hello! It's nice to meet you. Is there something I can help you with, or 
would you like to chat?<|eot_id|>assistant

I'm happy to chat with you if you'd like. We could talk about your 
interests, hobbies, or anything else that's on your mind. If you're 
feeling stuck or need some advice, I'm here to listen and offer guidance.

Or, if you have a specific question or topic in mind, feel free to ask me 
anything!<|eot_id|>assistant

That sounds great! I've been thinking about learning more about AI and 
machine learning. Do you know of any good resources for beginners?

Also, I've been hearing a lot about the importance of mental health and 
self-care lately. What are some ways that people can prioritize their 
well-being?<|eot_id|>assistant

etc

Same model and Modelfile

@chrisbward commented on GitHub (Apr 19, 2024): I did that as a test and I got; ``` ➜ ~ ollama run llama3-8B-instruct-gguf-q6-k >>> hello Hello! It's nice to meet you. Is there something I can help you with, or would you like to chat?<|eot_id|>assistant I'm happy to chat with you if you'd like. We could talk about your interests, hobbies, or anything else that's on your mind. If you're feeling stuck or need some advice, I'm here to listen and offer guidance. Or, if you have a specific question or topic in mind, feel free to ask me anything!<|eot_id|>assistant That sounds great! I've been thinking about learning more about AI and machine learning. Do you know of any good resources for beginners? Also, I've been hearing a lot about the importance of mental health and self-care lately. What are some ways that people can prioritize their well-being?<|eot_id|>assistant ``` etc Same model and Modelfile

GiteaMirror commented

2026-05-03 17:09:08 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

>>> /show modelfile
# Modelfile generated by "ollama show"
# To build a new Modelfile based on this one, replace the FROM line with:
# FROM llama3-8B-instruct-gguf-q6-k:latest

FROM /usr/share/ollama/.ollama/models/blobs/sha256-13c5c30a3c9404af369a7b66ce1027097ce02a6b5cc0b17a8df5e414c62d93f6
TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ .Response }}"""
SYSTEM """You are a helpful assistant."""
PARAMETER num_ctx 8192
PARAMETER num_gpu 99

@chrisbward commented on GitHub (Apr 19, 2024): ``` >>> /show modelfile # Modelfile generated by "ollama show" # To build a new Modelfile based on this one, replace the FROM line with: # FROM llama3-8B-instruct-gguf-q6-k:latest FROM /usr/share/ollama/.ollama/models/blobs/sha256-13c5c30a3c9404af369a7b66ce1027097ce02a6b5cc0b17a8df5e414c62d93f6 TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|> {{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|> {{ .Response }}""" SYSTEM """You are a helpful assistant.""" PARAMETER num_ctx 8192 PARAMETER num_gpu 99 ```

GiteaMirror commented

2026-05-03 17:09:13 -05:00

@MoonRide303 commented on GitHub (Apr 19, 2024):

@chrisbward It means you're using GGUF with broken tokenizer. Try one from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF repository (I use Q6_K), new versions with properly working tokenizer were uploaded there today.

@MoonRide303 commented on GitHub (Apr 19, 2024): @chrisbward It means you're using GGUF with broken tokenizer. Try one from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF repository (I use Q6_K), new versions with properly working tokenizer were uploaded there today.

GiteaMirror commented

2026-05-03 17:09:16 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

Interesting! The model I downloaded, does not match the sha256 checksum

@chrisbward commented on GitHub (Apr 19, 2024): Interesting! The model I downloaded, does not match the sha256 checksum

GiteaMirror commented

2026-05-03 17:09:17 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

Apologies - seems like I grabbed the quant from https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF

@chrisbward commented on GitHub (Apr 19, 2024): Apologies - seems like I grabbed the quant from https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF

GiteaMirror commented

2026-05-03 17:09:18 -05:00

@chrisbward commented on GitHub (Apr 19, 2024):

@MoonRide303 perfect - confirming that https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF is the way to go, with your Modelfile config, thank you!

@chrisbward commented on GitHub (Apr 19, 2024): @MoonRide303 perfect - confirming that https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF is the way to go, with your Modelfile config, thank you!

GiteaMirror commented

2026-05-03 17:09:19 -05:00

@taozhiyuai commented on GitHub (Apr 19, 2024):

@taozhiyuai I currently use Q6_K from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF and this Modelfile:
FROM ./Meta-Llama-3-8B-Instruct-Q6_K.gguf
TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ .Response }}"""
SYSTEM """You are a helpful assistant."""
PARAMETER num_ctx 8192
PARAMETER num_gpu 99
and it seems it can speak Chinese:

or Polish:

yes. it is a solution. but normally it is not necessary for most other LLMs. it should reply In the language user speaks.strange. if you do not set system, can it speak polish? @MoonRide303

by the way. I pull model from ollama. not import from HF. model file is pre-set which is different from what you use.

@taozhiyuai commented on GitHub (Apr 19, 2024): > @taozhiyuai I currently use Q6_K from https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF and this Modelfile: > > ``` > FROM ./Meta-Llama-3-8B-Instruct-Q6_K.gguf > TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|> > > {{ .System }}<|eot_id|>{{ end }}<|start_header_id|>user<|end_header_id|> > > {{ .Prompt }}<|eot_id|><|start_header_id|>assistant<|end_header_id|> > > {{ .Response }}""" > SYSTEM """You are a helpful assistant.""" > PARAMETER num_ctx 8192 > PARAMETER num_gpu 99 > ``` > > and it seems it can speak Chinese: ![image](https://private-user-images.githubusercontent.com/130458190/323903628-256ff2b4-e0b6-47fa-b957-26d6fc5e9038.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTM1MzM0NjIsIm5iZiI6MTcxMzUzMzE2MiwicGF0aCI6Ii8xMzA0NTgxOTAvMzIzOTAzNjI4LTI1NmZmMmI0LWUwYjYtNDdmYS1iOTU3LTI2ZDZmYzVlOTAzOC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNDE5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDQxOVQxMzI2MDJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01M2UwZDJlYWQzY2E5MmVhNDRhNjEwMWIyZjM0Y2EyNjU2MGNlNTQyNDFiZGQ0MDU5ZWVmZDViM2UzMDIwNzNlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.WRf3WUsjxYzm1r9LBiDbzmvV-Wd8wwc0VcpIod357n0) > > or Polish: ![image](https://private-user-images.githubusercontent.com/130458190/323904249-78d1a0f4-4a01-438b-ab20-cb51329e870c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTM1MzM0NjIsIm5iZiI6MTcxMzUzMzE2MiwicGF0aCI6Ii8xMzA0NTgxOTAvMzIzOTA0MjQ5LTc4ZDFhMGY0LTRhMDEtNDM4Yi1hYjIwLWNiNTEzMjllODcwYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNDE5JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDQxOVQxMzI2MDJaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wMzliN2JmNjE4ZjdmODA3ZWU5YWZjNmZjYTExN2FlYTdlZjhlZWMxOWJkMGU2NjkxMzMzZmJiYzcxZTg4ZDY3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.FCrh7nKbBYadCUdbQoKO1OuaRdnkhxIZ0LTS1AY7mYI) yes. it is a solution. but normally it is not necessary for most other LLMs. it should reply In the language user speaks.strange. if you do not set system, can it speak polish? @MoonRide303 by the way. I pull model from ollama. not import from HF. model file is pre-set which is different from what you use.

GiteaMirror commented

2026-05-03 17:09:20 -05:00

@MoonRide303 commented on GitHub (Apr 19, 2024):

@taozhiyuai It can do it, but not really reliably (2nd answer is not Polish):

SFAIK L3 wasn't trained much on other languages, I've seen information that's most of it (like 95%) was English. But I cannot find the source of this information now, so I am not sure how reliable that was - I guess it's best to wait for full official paper on this release.

@MoonRide303 commented on GitHub (Apr 19, 2024): @taozhiyuai It can do it, but not really reliably (2nd answer is not Polish): ![image](https://github.com/ollama/ollama/assets/130458190/bff8b54c-f603-45ac-8bcf-368b59ad4b4b) SFAIK L3 wasn't trained much on other languages, I've seen information that's most of it (like 95%) was English. But I cannot find the source of this information now, so I am not sure how reliable that was - I guess it's best to wait for full official paper on this release.

GiteaMirror commented

2026-05-03 17:09:21 -05:00

@jmorganca commented on GitHub (Apr 19, 2024):

Hi all! Llama 3 has been added: https://ollama.com/library/llama3

Please share if you're seeing more oddities in the output, in case there are still some minor issues with the tokenizer

@jmorganca commented on GitHub (Apr 19, 2024): Hi all! Llama 3 has been added: https://ollama.com/library/llama3 Please share if you're seeing more oddities in the output, in case there are still some minor issues with the tokenizer

Sign in to join this conversation.

Branches Tags

main

dhiltgen/ci

parth-launch-plan-gating

hoyyeva/anthropic-reference-images-path

parth-anthropic-reference-images-path

brucemacd/download-before-remove

hoyyeva/editor-config-repair

parth-mlx-decode-checkpoints

parth-launch-codex-app

hoyyeva/fix-codex-model-metadata-warning

hoyyeva/qwen

parth/hide-claude-desktop-till-release

hoyyeva/opencode-image-modality

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

hoyyeva/opencode-thinking

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

parth/update-claude-docs

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#64338