[GH-ISSUE #8618] Support Janus-Pro-7b for vision models #31337

New Issue

GiteaMirror · 2026-04-22T11:42:31-05:00

GiteaMirror commented

2026-04-22 11:42:31 -05:00

Originally created by @franz101 on GitHub (Jan 27, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/8618

Just announced and performing great with OCR
https://huggingface.co/deepseek-ai/Janus-Pro-7B

Originally created by @franz101 on GitHub (Jan 27, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/8618 Just announced and performing great with OCR https://huggingface.co/deepseek-ai/Janus-Pro-7B

GiteaMirror added the feature request label 2026-04-22 11:42:32 -05:00

GiteaMirror commented

2026-04-22 11:42:33 -05:00

@skytodmoon commented on GitHub (Jan 28, 2025):

Mark +1

@skytodmoon commented on GitHub (Jan 28, 2025): Mark +1

GiteaMirror commented

2026-04-22 11:42:34 -05:00

@libing64 commented on GitHub (Jan 28, 2025):

+1

@libing64 commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:34 -05:00

@kattatzu commented on GitHub (Jan 28, 2025):

+1

@kattatzu commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:35 -05:00

@dengber commented on GitHub (Jan 28, 2025):

+1

@dengber commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:35 -05:00

@random-zhu commented on GitHub (Jan 28, 2025):

Mark +1

@random-zhu commented on GitHub (Jan 28, 2025): Mark +1

GiteaMirror commented

2026-04-22 11:42:35 -05:00

@sakujor commented on GitHub (Jan 28, 2025):

+1

@sakujor commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:36 -05:00

@DhairyaNxtgen commented on GitHub (Jan 28, 2025):

+1

@DhairyaNxtgen commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:36 -05:00

@TheurgicDuke771 commented on GitHub (Jan 28, 2025):

+1

@TheurgicDuke771 commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:37 -05:00

@philogicae commented on GitHub (Jan 28, 2025):

+1

@philogicae commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:38 -05:00

@ImranR98 commented on GitHub (Jan 28, 2025):

Commenting "+1" sends an unnecessary email to everyone who is subscribed to the issue. Probably a better idea to just add a thumbs up to the original post.

@ImranR98 commented on GitHub (Jan 28, 2025): Commenting "+1" sends an unnecessary email to everyone who is subscribed to the issue. Probably a better idea to just add a thumbs up to the original post.

GiteaMirror commented

2026-04-22 11:42:39 -05:00

@edgett commented on GitHub (Jan 28, 2025):

+1

@edgett commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:39 -05:00

@BrokenByteOfCode commented on GitHub (Jan 28, 2025):

+1

@BrokenByteOfCode commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:40 -05:00

@movitecc commented on GitHub (Jan 28, 2025):

+1

@movitecc commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:41 -05:00

@iammrbt commented on GitHub (Jan 28, 2025):

+1

@iammrbt commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:41 -05:00

@cmheong commented on GitHub (Jan 28, 2025):

+1

@cmheong commented on GitHub (Jan 28, 2025): +1

GiteaMirror commented

2026-04-22 11:42:42 -05:00

@wwek commented on GitHub (Jan 29, 2025):

+1

@wwek commented on GitHub (Jan 29, 2025): +1

GiteaMirror commented

2026-04-22 11:42:43 -05:00

@OverStruck commented on GitHub (Jan 29, 2025):

+1

@OverStruck commented on GitHub (Jan 29, 2025): +1

GiteaMirror commented

2026-04-22 11:42:44 -05:00

@4austinpowers commented on GitHub (Jan 29, 2025):

+1

@4austinpowers commented on GitHub (Jan 29, 2025): +1

GiteaMirror commented

2026-04-22 11:42:44 -05:00

@deadprogram commented on GitHub (Jan 29, 2025):

How about also https://huggingface.co/deepseek-ai/Janus-Pro-1B for whoever has the correct setup also to import this, please.

@deadprogram commented on GitHub (Jan 29, 2025): How about also https://huggingface.co/deepseek-ai/Janus-Pro-1B for whoever has the correct setup also to import this, please.

GiteaMirror commented

2026-04-22 11:42:44 -05:00

@tobalo commented on GitHub (Jan 29, 2025):

+1

@tobalo commented on GitHub (Jan 29, 2025): +1

GiteaMirror commented

2026-04-22 11:42:45 -05:00

@nurena24 commented on GitHub (Jan 30, 2025):

+1

@nurena24 commented on GitHub (Jan 30, 2025): +1

GiteaMirror commented

2026-04-22 11:42:46 -05:00

@xindoreen commented on GitHub (Jan 30, 2025):

+1

@xindoreen commented on GitHub (Jan 30, 2025): +1

GiteaMirror commented

2026-04-22 11:42:46 -05:00

@toplinuxsir commented on GitHub (Jan 30, 2025):

+1

@toplinuxsir commented on GitHub (Jan 30, 2025): +1

GiteaMirror commented

2026-04-22 11:42:47 -05:00

@zytoh0 commented on GitHub (Jan 30, 2025):

Just announced and performing great with OCR https://huggingface.co/deepseek-ai/Janus-Pro-7B
Not just 7B but also 1B :)
https://huggingface.co/deepseek-ai/Janus-Pro-1B
https://huggingface.co/deepseek-ai/Janus-Pro-7B

@zytoh0 commented on GitHub (Jan 30, 2025): > Just announced and performing great with OCR https://huggingface.co/deepseek-ai/Janus-Pro-7B Not just 7B but also 1B :) https://huggingface.co/deepseek-ai/Janus-Pro-1B https://huggingface.co/deepseek-ai/Janus-Pro-7B

GiteaMirror commented

2026-04-22 11:42:47 -05:00

@MIC-BO commented on GitHub (Jan 31, 2025):

+1

@MIC-BO commented on GitHub (Jan 31, 2025): +1

GiteaMirror commented

2026-04-22 11:42:48 -05:00

@snt1017 commented on GitHub (Jan 31, 2025):

+1

@snt1017 commented on GitHub (Jan 31, 2025): +1

GiteaMirror commented

2026-04-22 11:42:48 -05:00

@jorgevespa commented on GitHub (Jan 31, 2025):

+1

@jorgevespa commented on GitHub (Jan 31, 2025): +1

GiteaMirror commented

2026-04-22 11:42:48 -05:00

@isaacasancheza commented on GitHub (Jan 31, 2025):

+1

@isaacasancheza commented on GitHub (Jan 31, 2025): +1

GiteaMirror commented

2026-04-22 11:42:49 -05:00

@wlsoft2006 commented on GitHub (Feb 1, 2025):

+1

@wlsoft2006 commented on GitHub (Feb 1, 2025): +1

GiteaMirror commented

2026-04-22 11:42:49 -05:00

@kongkang commented on GitHub (Feb 1, 2025):

+1

@kongkang commented on GitHub (Feb 1, 2025): +1

GiteaMirror commented

2026-04-22 11:42:50 -05:00

@jackwang2 commented on GitHub (Feb 3, 2025):

+1

@jackwang2 commented on GitHub (Feb 3, 2025): +1

GiteaMirror commented

2026-04-22 11:42:51 -05:00

@maddinek commented on GitHub (Feb 3, 2025):

+1

@maddinek commented on GitHub (Feb 3, 2025): +1

GiteaMirror commented

2026-04-22 11:42:51 -05:00

@jangrewe commented on GitHub (Feb 4, 2025):

Please STOP COMMENTING +1, use the 👍 reaction to the original post instead!

@jangrewe commented on GitHub (Feb 4, 2025): ### Please **STOP COMMENTING +1**, use the 👍 reaction to the original post instead!

GiteaMirror commented

2026-04-22 11:42:52 -05:00

@philogicae commented on GitHub (Feb 4, 2025):

Please STOP COMMENTING +1, use the 👍 reaction to the original post instead!

No.

@philogicae commented on GitHub (Feb 4, 2025): > ### Please **STOP COMMENTING +1**, use the 👍 reaction to the original post instead! No. ![Image](https://github.com/user-attachments/assets/b32f941a-4768-45f5-b9fd-77a3eac7e446)

GiteaMirror commented

2026-04-22 11:42:53 -05:00

@jangrewe commented on GitHub (Feb 4, 2025):

No.

What kind of special ~~idiot~~... individual are you? This is not about notifications, but about useless noise that adds nothing to the discussion.

@jangrewe commented on GitHub (Feb 4, 2025): > No. What kind of special ~idiot~... _individual_ are you? This is not about notifications, but about useless noise that adds nothing to the discussion.

GiteaMirror commented

2026-04-22 11:42:53 -05:00

@svaningelgem commented on GitHub (Feb 4, 2025):

What kind of special idiot are you?

Let's keep things professional, even though other people might annoy you...

What would be most useful to me is a guide on how to create & upload such model. I'd do this myself then...

@svaningelgem commented on GitHub (Feb 4, 2025): > What kind of special idiot are you? Let's keep things professional, even though other people might annoy you... What would be most useful to me is a guide on how to create & upload such model. I'd do this myself then...

GiteaMirror commented

2026-04-22 11:42:54 -05:00

@jangrewe commented on GitHub (Feb 4, 2025):

Let's keep things professional

fixed.

@jangrewe commented on GitHub (Feb 4, 2025): > Let's keep things professional fixed.

GiteaMirror commented

2026-04-22 11:42:55 -05:00

@dandv commented on GitHub (Feb 4, 2025):

Let's keep things professional

No, but seriously, what kind of people who can:

use GitHub
are interested in a CLI tool
to run inference locally

Don't already know to NOT SPAM WITH STUPDID +1s

AND

Keep doing it after commends advising very nicely NOT TO DO SO.

Are these bots? An influx of complete and utter GitHub n00bs?

@dandv commented on GitHub (Feb 4, 2025): > Let's keep things professional No, but seriously, what kind of people who can: - use GitHub - are interested in a CLI tool - to run inference locally Don't already know to NOT SPAM WITH STUPDID `+1`s AND Keep doing it after commends advising [very nicely](https://github.com/ollama/ollama/issues/8618#issuecomment-2619487542) NOT TO DO SO. Are these bots? An influx of complete and utter GitHub n00bs?

GiteaMirror commented

2026-04-22 11:42:55 -05:00

@vertago1 commented on GitHub (Feb 4, 2025):

Let's keep things professional

No, but seriously, what kind of people who can:

use GitHub

are interested in a CLI tool

to run inference locally

Don't already know to NOT SPAM WITH STUPDID +1s

AND

Keep doing it after commends advising very nicely NOT TO DO SO.

Are these bots? An influx of complete and utter GitHub n00bs?

They must not be devs or they would realize that kind of thing leads to turning off notifications for a thread and it going off the devs radar which is counterproductive if they really want this added.

@vertago1 commented on GitHub (Feb 4, 2025): > > Let's keep things professional > > No, but seriously, what kind of people who can: > > * use GitHub > * are interested in a CLI tool > * to run inference locally > > Don't already know to NOT SPAM WITH STUPDID `+1`s > > AND > > Keep doing it after commends advising [very nicely](https://github.com/ollama/ollama/issues/8618#issuecomment-2619487542) NOT TO DO SO. > > Are these bots? An influx of complete and utter GitHub n00bs? They must not be devs or they would realize that kind of thing leads to turning off notifications for a thread and it going off the devs radar which is counterproductive if they really want this added.

GiteaMirror commented

2026-04-22 11:42:56 -05:00

@cmheong commented on GitHub (Feb 5, 2025):

I got to this thread because a Google search directed me here, so this is probably not the place to post this comment, so my apologies in advance to the irritable ones on the mailing list. The reason everyone is here is we want to use Janus-Pro-7b from ollama. I get it, it is not supported as of now. Now I only got ollama last week so I am definitely a newbie. I simply asked Deepseek how to run Janus-Pro-7b-LM from ollama, and the instructions it gave actually worked. I am now running it from ollama. For those who are interested, the instructions are:
Download the gguf from https://huggingface.co/mradermacher/Janus-Pro-7B-LM-GGUF/blob/main/Janus-Pro-7B-LM.Q4_K_M.gguf
Copy it to your docker ollama container. I used 'docker cp'
Make the file Modelfile in the same directory containing the line:
./Janus-Pro-7B-LM.Q4_K_M.gguf
From your docker container, run the command
ollama create janus-pro-7b-lm -f Modelfile
Then run
ollama run janus-pro-7b-lm
That is all. Have fun with janus-pro-7b. I sure am.

@cmheong commented on GitHub (Feb 5, 2025): I got to this thread because a Google search directed me here, so this is probably not the place to post this comment, so my apologies in advance to the irritable ones on the mailing list. The reason everyone is here is we want to use Janus-Pro-7b from ollama. I get it, it is not supported as of now. Now I only got ollama last week so I am definitely a newbie. I simply asked Deepseek how to run Janus-Pro-7b-LM from ollama, and the instructions it gave actually worked. I am now running it from ollama. For those who are interested, the instructions are: Download the gguf from https://huggingface.co/mradermacher/Janus-Pro-7B-LM-GGUF/blob/main/Janus-Pro-7B-LM.Q4_K_M.gguf Copy it to your docker ollama container. I used 'docker cp' Make the file Modelfile in the same directory containing the line: ./Janus-Pro-7B-LM.Q4_K_M.gguf From your docker container, run the command ollama create janus-pro-7b-lm -f Modelfile Then run ollama run janus-pro-7b-lm That is all. Have fun with janus-pro-7b. I sure am.

GiteaMirror commented

2026-04-22 11:42:56 -05:00

@davrot commented on GitHub (Feb 5, 2025):

@cmheong Could you share the working Modelfile with us? Thanks!

@davrot commented on GitHub (Feb 5, 2025): @cmheong Could you share the working Modelfile with us? Thanks!

GiteaMirror commented

2026-04-22 11:42:57 -05:00

@jangrewe commented on GitHub (Feb 5, 2025):

@davrot Uhm... he says what you need to put in there? Those files are not rocket surgery, but just to make sure:

FROM  /path/to/Janus-Pro-7B-LM.Q4_K_M.gguf

For your reference: https://github.com/ollama/ollama/blob/main/docs/modelfile.md

@jangrewe commented on GitHub (Feb 5, 2025): @davrot Uhm... he says what you need to put in there? Those files are not rocket surgery, but just to make sure: ``` FROM /path/to/Janus-Pro-7B-LM.Q4_K_M.gguf ``` For your reference: https://github.com/ollama/ollama/blob/main/docs/modelfile.md

GiteaMirror commented

2026-04-22 11:42:57 -05:00

@jangrewe commented on GitHub (Feb 5, 2025):

@davrot Open WebUI != Ollama

@jangrewe commented on GitHub (Feb 5, 2025): @davrot Open WebUI != Ollama

GiteaMirror commented

2026-04-22 11:42:58 -05:00

@sealad886 commented on GitHub (Feb 6, 2025):

@jangrewe Can tell me how do you send images to "ollama run janus-pro-7b-lm" ?

Multimodal Models are described in the main README.md, near the bottom.

If you're having issues with a specific non-Ollama tool/frontend that connects to the Ollama API, see the documentation for that tool separately.

@sealad886 commented on GitHub (Feb 6, 2025): > [@jangrewe](https://github.com/jangrewe) Can tell me how do you send images to "ollama run janus-pro-7b-lm" ? > > ![Image](https://github.com/user-attachments/assets/64c426f4-a657-48a4-a9c4-3035d701c17b) [Multimodal Models](https://github.com/ollama/ollama?tab=readme-ov-file#multimodal-models) are described in the main README.md, near the bottom. If you're having issues with a specific non-Ollama tool/frontend that _connects to_ the Ollama API, see the documentation for that tool separately.

GiteaMirror commented

2026-04-22 11:42:58 -05:00

@davrot commented on GitHub (Feb 6, 2025):

ollama run janus-pro-7b-lm "What do you see in the image /data_1/deepseek/kohlfahrt0015.jpg"
?**

I don't see an image, I see a question asking me to provide information about a specific image or data file that may contain
a unique identifier and name format, possibly related to "deepseek" and "kohlfahrt". However, there is no actual visual
content associated with this request. It seems like the text contains placeholder characters, which might be due to encoding
issues or incomplete instructions. If you could provide more context or clarify what you're trying to achieve by asking
about an image or data file based on a specific name and identifier, I'd be happy to assist further!

ollama run llama3.2-vision:11b "What do you see in the image /data_1/deepseek/kohlfahrt0015.jpg"
Added image '/data_1/deepseek/kohlfahrt0015.jpg'
The image shows a group of people walking together, with trees and buildings visible in the background.

A group of people are walking together.
+ There are approximately 10 individuals in the group.
+ They appear to be walking on a sidewalk or path.
+ Some of them are looking at something off-camera, while others seem to be engaged in conversation.
The group is made up of both men and women.
+ The men are wearing casual clothing such as jeans and t-shirts.
+ The women are also dressed casually, with some wearing dresses or skirts.
They are all wearing similar jackets or coats.
+ The jackets are dark-colored and appear to be waterproof or windproof.
+ Some of the individuals have their hands in their pockets, while others are holding onto bags or other items.

Overall, the image suggests that the group is on a casual outing or hike, possibly enjoying the outdoors together.

@davrot commented on GitHub (Feb 6, 2025): > ollama run janus-pro-7b-lm "What do you see in the image /data_1/deepseek/kohlfahrt0015.jpg" ?** I don't see an image, I see a question asking me to provide information about a specific image or data file that may contain a unique identifier and name format, possibly related to "deepseek" and "kohlfahrt". However, there is no actual visual content associated with this request. It seems like the text contains placeholder characters, which might be due to encoding issues or incomplete instructions. If you could provide more context or clarify what you're trying to achieve by asking about an image or data file based on a specific name and identifier, I'd be happy to assist further! > ollama run llama3.2-vision:11b "What do you see in the image /data_1/deepseek/kohlfahrt0015.jpg" Added image '/data_1/deepseek/kohlfahrt0015.jpg' The image shows a group of people walking together, with trees and buildings visible in the background. * A group of people are walking together. + There are approximately 10 individuals in the group. + They appear to be walking on a sidewalk or path. + Some of them are looking at something off-camera, while others seem to be engaged in conversation. * The group is made up of both men and women. + The men are wearing casual clothing such as jeans and t-shirts. + The women are also dressed casually, with some wearing dresses or skirts. * They are all wearing similar jackets or coats. + The jackets are dark-colored and appear to be waterproof or windproof. + Some of the individuals have their hands in their pockets, while others are holding onto bags or other items. Overall, the image suggests that the group is on a casual outing or hike, possibly enjoying the outdoors together.

GiteaMirror commented

2026-04-22 11:42:58 -05:00

@sealad886 commented on GitHub (Feb 6, 2025):

Hey @davrot thanks for pasting from the shell terminal there. If you could, if would be very helpful to use the Markdown tags for indicating scripting, etc, so that that output is a bit clearer in terms of what commands you gave and what the output was, vs your own exposition (if any--based on the text, I'm assuming that's 100% LLM generated).

As another resource, you can check out the Llama3.2-Vision blog post that has usage information for that model, or the LLaVA announcement post that uses a slightly different method to interact with the model.

Overall, CLI-based multimodal interaction doesn't appear to be consistent across models. All models should be able to accept an image through the API, it seems. Refer back to those blog posts (in particular the Llama3.2-Vision one) for links to the docs.

@sealad886 commented on GitHub (Feb 6, 2025): Hey @davrot thanks for pasting from the shell terminal there. If you could, if would be very helpful to use the Markdown tags for indicating scripting, etc, so that that output is a bit clearer in terms of what commands you gave and what the output was, vs your own exposition (if any--based on the text, I'm assuming that's 100% LLM generated). As another resource, you can check out the [Llama3.2-Vision](https://ollama.com/blog/llama3.2-vision) blog post that has usage information for that model, or the [LLaVA announcement post](https://ollama.com/blog/vision-models) that uses a slightly different method to interact with the model. Overall, CLI-based multimodal interaction doesn't appear to be consistent across models. All models should be able to accept an image through the API, it seems. Refer back to those blog posts (in particular the Llama3.2-Vision one) for links to the docs.

GiteaMirror commented

2026-04-22 11:42:58 -05:00

@sealad886 commented on GitHub (Feb 6, 2025):

It doesn't appear that the GGUF available from HF actually works.

input:

response: ollama.ChatResponse = ollama.chat(model=model, messages=[
    {
            'role': 'user',
            'contents': 'Tell me about this image.',
            'images': ['/path/to/local/image.webp']
    }
])

print(response.message.content):

 * Hello, World!</div>
        <p id="text-1" class="para">Lorem ipsum dolor sit amet, consectetur adipiscing elit. Pellentesque eget arcu quis sapien euismod bibendum.</p>
        <p id="text-2" class="para">Nunc et orci non libero luctus convallis nec vel quam. Aliquam erat volutpat. Suspendisse sit amet ante ut nunc tristique aliquet.</p>
      </div>
    </body>
  </html>

To be fair, I don't know if the webp format is supported in this model or in the conversion to what I assume is base64, so that may be one thing causing issues here. But suffice it to say that that response is a wildly inappropriate response to the query posed.

@sealad886 commented on GitHub (Feb 6, 2025): It doesn't appear that the GGUF available from HF actually works. input: ```python response: ollama.ChatResponse = ollama.chat(model=model, messages=[ { 'role': 'user', 'contents': 'Tell me about this image.', 'images': ['/path/to/local/image.webp'] } ]) ``` `print(response.message.content)`: ```python * Hello, World!</div> <p id="text-1" class="para">Lorem ipsum dolor sit amet, consectetur adipiscing elit. Pellentesque eget arcu quis sapien euismod bibendum.</p> <p id="text-2" class="para">Nunc et orci non libero luctus convallis nec vel quam. Aliquam erat volutpat. Suspendisse sit amet ante ut nunc tristique aliquet.</p> </div> </body> </html> ``` To be fair, I don't know if the webp format is supported in this model or in the conversion to what I assume is base64, so that may be one thing causing issues here. But suffice it to say that that response is a wildly inappropriate response to the query posed.

GiteaMirror commented

2026-04-22 11:42:59 -05:00

@davrot commented on GitHub (Feb 6, 2025):

It seems that llama.cpp is working on it:

Add supports for Janus vision encoder and projector [WIP] #11646
https://github.com/ggerganov/llama.cpp/pull/11646

@davrot commented on GitHub (Feb 6, 2025): It seems that llama.cpp is working on it: > Add supports for Janus vision encoder and projector [WIP] #11646 > https://github.com/ggerganov/llama.cpp/pull/11646

GiteaMirror commented

2026-04-22 11:42:59 -05:00

@ravenouse commented on GitHub (Feb 6, 2025):

From my understanding, the current GGUF models available on Hugging Face do not include the vision encoder and projector components—only the language model. This means that the Janus model lacks image understanding when running with Ollama.

I have submitted a PR to llama.cpp and am working on adding support for the Janus vision encoder and projector. The main challenge is the customized code used by the DeepSeek team, along with potential modifications to the clip model architecture in C++. As a result, this PR may take some time to complete.

@ravenouse commented on GitHub (Feb 6, 2025): From my understanding, the current GGUF models available on Hugging Face do not include the vision encoder and projector components—only the language model. This means that the Janus model lacks image understanding when running with Ollama. I have submitted a [PR](https://github.com/ggerganov/llama.cpp/pull/11646) to llama.cpp and am working on adding support for the Janus vision encoder and projector. The main challenge is the customized code used by the DeepSeek team, along with potential modifications to the clip model architecture in C++. As a result, this PR may take some time to complete.

GiteaMirror commented

2026-04-22 11:43:00 -05:00

@S4GU4R0 commented on GitHub (Feb 8, 2025):

Are these bots? An influx of complete and utter GitHub n00bs?

It seems like it, or they're literally children. Having worked with kids in an online context, enthusiasm sometimes comes across as spam and bot-like behavior.

@S4GU4R0 commented on GitHub (Feb 8, 2025): > Are these bots? An influx of complete and utter GitHub n00bs? It seems like it, or they're literally children. Having worked with kids in an online context, enthusiasm sometimes comes across as spam and bot-like behavior.

GiteaMirror commented

2026-04-22 11:43:00 -05:00

@Forevery1 commented on GitHub (Feb 14, 2025):

+1

@Forevery1 commented on GitHub (Feb 14, 2025): +1

GiteaMirror commented

2026-04-22 11:43:00 -05:00

@DarkAlchy commented on GitHub (Feb 16, 2025):

Janus-7B is the best vision model I have tried to date locally as I gave it an image, and what it described I fed to Flux. The output Flux dev gave back was almost a verbatim copy. It did mess up the woman (a silhouette) to be a man, but the room was almost identical even to the images on the walls. Jaw dropped. Llama-3.2-vision is not even close, and the other ones I used to use are rubbish in comparison.

@DarkAlchy commented on GitHub (Feb 16, 2025): Janus-7B is the best vision model I have tried to date locally as I gave it an image, and what it described I fed to Flux. The output Flux dev gave back was almost a verbatim copy. It did mess up the woman (a silhouette) to be a man, but the room was almost identical even to the images on the walls. Jaw dropped. Llama-3.2-vision is not even close, and the other ones I used to use are rubbish in comparison.

GiteaMirror commented

2026-04-22 11:43:01 -05:00

@byjlw commented on GitHub (Feb 16, 2025):

Janus-7B is the best vision model I have tried to date locally as I gave it an image, and what it described I fed to Flux. The output Flux dev gave back was almost a verbatim copy. It did mess up the woman (a silhouette) to be a man, but the room was almost identical even to the images on the walls. Jaw dropped. Llama-3.2-vision is not even close, and the other ones I used to use are rubbish in comparison.

How did you run it? Can you describe exact steps?
Every other comment suggests that image input doesn't work with Ollama

@byjlw commented on GitHub (Feb 16, 2025): > Janus-7B is the best vision model I have tried to date locally as I gave it an image, and what it described I fed to Flux. The output Flux dev gave back was almost a verbatim copy. It did mess up the woman (a silhouette) to be a man, but the room was almost identical even to the images on the walls. Jaw dropped. Llama-3.2-vision is not even close, and the other ones I used to use are rubbish in comparison. How did you run it? Can you describe exact steps? Every other comment suggests that image input doesn't work with Ollama

GiteaMirror commented

2026-04-22 11:43:02 -05:00

@DarkAlchy commented on GitHub (Feb 17, 2025):

I would like to use Ollama with it, but I used it in Comfy UI.

@DarkAlchy commented on GitHub (Feb 17, 2025): I would like to use Ollama with it, but I used it in Comfy UI. ![Image](https://github.com/user-attachments/assets/d581aabc-a710-4e58-895c-e88881e5c622)

GiteaMirror commented

2026-04-22 11:43:02 -05:00

@snailfrying commented on GitHub (Feb 18, 2025):

https://ollama.com/gguf/DeepSeek-Janus-Pro-7B This website can be deployed, and the corresponding huggingface also has corresponding files that support ollama, as well as commands for using the model. However, I deployed it but did not use it properly。

@snailfrying commented on GitHub (Feb 18, 2025): https://ollama.com/gguf/DeepSeek-Janus-Pro-7B This website can be deployed, and the corresponding huggingface also has corresponding files that support ollama, as well as commands for using the model. However, I deployed it but did not use it properly。

GiteaMirror commented

2026-04-22 11:43:02 -05:00

@ghmole commented on GitHub (Mar 22, 2025):

+1

@ghmole commented on GitHub (Mar 22, 2025): +1

Sign in to join this conversation.

Branches Tags

main

parth-update-hermes-launch

parth-agent-system-prompt-cwd

hoyyeva/vscode-extension-docs-update

parth-gemma4-chat-template-renderer

parth-fix-claude-model-picker

parth-api-status-context-length

docs/vscode-extension-setup

hoyyeva/wire-up-context-length

hoyyeva/claude-code-context-doc

jmorganca/investigate-issue-17046

hoyyeva/hermes-docs

jmorganca/agent-loop-style

hoyyeva/openclaw

parth-agent-loop

hoyyeva/ollama-vscode-extension

brucemacd/cache-metrics

brucemacd/hermes-desktop

hoyyeva/docs-vscode

parth-input-style-experiment

brucemacd/docs-glm52

hoyyeva/poc-docs

Parth/mlx-launch-recommendations

parth-first-time-app-cli-experience

test/darwin-xcode-pin

improve-cloud-model-recommendations

hoyyeva/goose-docs

jmorganca/context-limit-fixes

hoyyeva/qwen-doc

hoyyeva/vscode-docs

jmorganca/remove-mlx-imagegen-code

parth-copilot-token-length-defaults

hoyyeva/poolside-windows

laguna-support

jmorganca/harden-markdown-rendering

laguna-renderer-parser

laguna-llamacpp

codex/make-integration-hidden-and-lunchable

brucemacd/omp-docs

pdevine/gguf-mtp-oldstyle

hoyyeva/migrate-pi

hoyyeva/anthropic-local-image-path

parth-launch-codex-app

hoyyeva/anthropic-reference-images-path

parth-anthropic-reference-images-path

brucemacd/download-before-remove

hoyyeva/editor-config-repair

parth-mlx-decode-checkpoints

parth/hide-claude-desktop-till-release

parth-add-claude-code-autoinstall

release_v0.22.0

pdevine/manifest-list

codex/fix-codex-model-metadata-warning

pdevine/addressable-manifest

brucemacd/launch-fetch-reccomended

jmorganca/llama-compat

launch-copilot-cli

release_v0.20.7

parth-auto-save-backup

parth-test

jmorganca/gemma4-audio-replacements

fix-manifest-digest-on-pull

hoyyeva/vscode-improve

brucemacd/install-server-wait

parth/update-claude-docs

brucemac/start-ap-install

pdevine/mlx-update

pdevine/qwen35_vision

drifkin/api-show-fallback

mintlify/image-generation-1773352582

hoyyeva/server-context-length-local-config

jmorganca/faster-reptition-penalties

jmorganca/convert-nemotron

parth-pi-thinking

pdevine/sampling-penalties

jmorganca/fix-create-quantization-memory

dongchen/resumable_transfer_fix

pdevine/sampling-cache-error

jessegross/mlx-usage

hoyyeva/openclaw-config

hoyyeva/app-html

pdevine/qwen3next

brucemacd/sign-sh-install

brucemacd/tui-update

brucemacd/usage-api

jmorganca/launch-empty

fix-app-dist-embed

mxyng/mlx-compile

mxyng/mlx-quant

mxyng/mlx-glm4.7

mxyng/mlx

brucemacd/simplify-model-picker

jmorganca/qwen3-concurrent

fix-glm-4.7-flash-mla-config

drifkin/qwen3-coder-opening-tag

brucemacd/usage-cli

fix-cuda12-fattn-shmem

ollama-imagegen-docs

parth/fix-multiline-inputs

brucemacd/config-docs

mxyng/model-files

mxyng/simple-execute

fix-imagegen-ollama-models

mxyng/async-upload

jmorganca/lazy-no-dtype-changes

imagegen-auto-detect-create

parth/decrease-concurrent-download-hf

fix-mlx-quantize-init

jmorganca/x-cleanup

usage

imagegen-readme

jmorganca/glm-image

mlx-gpu-cd

jmorganca/imagegen-modelfile

parth/agent-skills

parth/agent-allowlist

parth/signed-in-offline

parth/agents

parth/fix-context-chopping

improve-cloud-flow

parth/add-models-websearch

parth/prompt-renderer-mcp

jmorganca/native-settings

jmorganca/download-stream-hash

jmorganca/client2-rebased

brucemacd/oai-chat-req-multipart

jessegross/multi_chunk_reserve

grace/additional-omit-empty

grace/mistral-3-large

mxyng/tokenizer2

mxyng/tokenizer

jessegross/flash

hoyyeva/windows-nacked-app

mxyng/cleanup-attention

grace/deepseek-parser

hoyyeva/remember-unsent-prompt

parth/add-lfs-pointer-error-conversion

parth/olmo2-test2

hoyyeva/ollama-launchagent-plist

nicole/olmo-model

parth/olmo-test

mxyng/remove-embedded

parth/render-template

jmorganca/intellect-3

parth/remove-prealloc-linter

jmorganca/cmd-eval

nicole/nomic-embed-text-fix

mxyng/lint-2

hoyyeva/add-gemini-3-pro-preview

hoyyeva/load-model-list

mxyng/expand-path

mxyng/environ-2

hoyyeva/deeplink-json-encoding

parth/improve-tool-calling-tests

hoyyeva/conversation

hoyyeva/assistant-edit-response

hoyyeva/thinking

origin/brucemacd/invalid-char-i-err

parth/improve-tool-calling

jmorganca/required-omitempty

grace/qwen3-vl-tests

mxyng/iter-client

parth/docs-readme

nicole/embed-test

pdevine/integration-benchstat

parth/remove-generate-cmd

parth/add-toolcall-id

mxyng/server-tests

jmorganca/glm-4.6

jmorganca/gin-h-compat

drifkin/stable-tool-args

pdevine/qwen3-more-thinking

parth/add-websearch-client

nicole/websearch_local

jmorganca/qwen3-coder-updates

grace/deepseek-v3-migration-tests

mxyng/fix-create

jmorganca/cloud-errors

pdevine/parser-tidy

revert-12233-parth/simplify-entrypoints-runner

parth/enable-so-gpt-oss

brucemacd/qwen3vl

jmorganca/readme-simplify

parth/gpt-oss-structured-outputs

revert-12039-jmorganca/tools-braces

mxyng/embeddings

mxyng/gguf

mxyng/benchmark

mxyng/types-null

parth/move-parsing

mxyng/gemma2

jmorganca/docs

mxyng/16-bit

mxyng/create-stdin

pdevine/authorizedkeys

mxyng/quant

parth/opt-in-error-context-window

brucemacd/cache-models

brucemacd/runner-completion

jmorganca/llama-update-6

brucemacd/benchmark-list

brucemacd/partial-read-caps

parth/deepseek-r1-tools

mxyng/omit-array

parth/tool-prefix-temp

brucemacd/runner-test

jmorganca/qwen25vl

brucemacd/model-forward-test-ext

parth/python-function-parsing

jmorganca/cuda-compression-none

drifkin/num-parallel

drifkin/chat-truncation-fix

jmorganca/sync

parth/python-tools-calling

drifkin/array-head-count

brucemacd/create-no-loop

parth/server-enable-content-stream-with-tools

qwen25omni

mxyng/v3

brucemacd/ropeconfig

jmorganca/silence-tokenizer

parth/sample-so-test

parth/sampling-structured-outputs

brucemacd/doc-go-engine

parth/constrained-sampling-json

jmorganca/mistral-wip

brucemacd/mistral-small-convert

parth/sample-unmarshal-json-for-params

brucemacd/jomorganca/mistral

pdevine/bfloat16

jmorganca/mistral

brucemacd/mistral

pdevine/logging

parth/sample-correctness-fix

parth/sample-fix-sorting

jmorgan/sample-fix-sorting-extras

jmorganca/temp-0-images

brucemacd/parallel-embed-models

brucemacd/shim-grammar

jmorganca/fix-gguf-error

bmizerany/nameswork

jmorganca/faster-releases

bmizerany/validatenames

brucemacd/err-no-vocab

brucemacd/rope-config

brucemacd/err-hint

brucemacd/qwen2_5

brucemacd/logprobs

brucemacd/new_runner_graph_bench

progress-flicker

brucemacd/forward-test

brucemacd/go_qwen2

pdevine/gemma2

jmorganca/add-missing-symlink-eval

mxyng/next-debug

parth/set-context-size-openai

brucemacd/next-bpe-bench

brucemacd/next-bpe-test

brucemacd/new_runner_e2e

brucemacd/new_runner_qwen2

pdevine/convert-cohere2

brucemacd/convert-cli

parth/log-probs

mxyng/next-mlx

mxyng/cmd-history

parth/templating

parth/tokenize-detokenize

brucemacd/check-key-register

bmizerany/grammar

jmorganca/vendor-081b29bd

mxyng/func-checks

jmorganca/fix-null-format

parth/fix-default-to-warn-json

jmorganca/qwen2vl

jmorganca/no-concat

parth/cmd-cleanup-SO

brucemacd/check-key-register-structured-err

parth/openai-stream-usage

parth/fix-referencing-so

stream-tools-stop

jmorganca/degin-1

brucemacd/install-path-clean

brucemacd/push-name-validation

brucemacd/browser-key-register

jmorganca/openai-fix-first-message

jmorganca/fix-proxy

jessegross/sample

parth/disallow-streaming-tools

dhiltgen/remove_submodule

jmorganca/ga

jmorganca/mllama

pdevine/newlines

pdevine/geems-2b

jmorganca/llama-bump

mxyng/modelname-7

mxyng/gin-slog

mxyng/modelname-6

jyan/convert-prog

jyan/quant5

paligemma-support

pdevine/import-docs

jmorganca/openai-context

jyan/paligemma

jyan/p2

jyan/palitest

bmizerany/embedspeedup

jmorganca/llama-vit

brucemacd/allow-ollama

royh/ep-methods

royh/whisper

mxyng/api-models

mxyng/fix-memory

jyan/q4_4/8

jyan/ollama-v

royh/stream-tools

roy-embed-parallel

bmizerany/hrm

revert-5963-revert-5924-mxyng/llama3.1-rope

royh/embed-viz

jyan/local2

jyan/auth

jyan/local

jyan/parse-temp

jmorganca/template-mistral

jyan/reord-g

royh-openai-suffixdocs

royh-imgembed

royh-embed-parallel

jyan/quant4

royh-precision

jyan/progress

pdevine/fix-template

jyan/quant3

pdevine/ggla

mxyng/update-registry-domain

jmorganca/ggml-static

mxyng/create-context

jyan/v0.146

mxyng/layers-from-files

build_dist

bmizerany/noseek

royh-ls

royh-name

timeout

mxyng/server-timestamp

bmizerany/nosillyggufslurps

royh-params

jmorganca/llama-cpp-7c26775

royh-openai-delete

royh-show-rigid

jmorganca/enable-fa

jmorganca/no-error-template

jyan/format

royh-testdelete

bmizerany/fastverify

language_support

pdevine/ps-glitches

brucemacd/tokenize

bruce/iq-quants

bmizerany/filepathwithcoloninhost

mxyng/split-bin

bmizerany/client-registry

jmorganca/if-none-match

native

jmorganca/native

jmorganca/batch-embeddings

jmorganca/initcmake

jmorganca/mm

pdevine/showggmlinfo

modenameenforcealphanum

bmizerany/modenameenforcealphanum

jmorganca/done-reason

jmorganca/llama-cpp-8960fe8

ollama.com

bmizerany/filepathnobuild

bmizerany/types/model/defaultfix

rmdisplaylong

nogogen

bmizerany/x

modelfile-readme

bmizerany/replacecolon

jmorganca/limit

jmorganca/execstack

jmorganca/replace-assets

mxyng/tune-concurrency

jmorganca/testing

whitespace-detection

jmorganca/options

upgrade-all

scratch

cuda-search

mattw/airenamer

mattw/allmodelsonhuggingface

mattw/quantcontext

mattw/whatneedstorun

brucemacd/llama-mem-calc

mattw/faq-context

mattw/communitylinks

mattw/noprune

mattw/python-functioncalling

rename

mxyng/install

pulse

remove-first

editor

mattw/selfqueryingretrieval

cgo

mattw/howtoquant

api

matt/streamingapi

format-config

mxyng/extra-args

shell

update-nous-hermes

cp-model

upload-progress

fix-unknown-model

fix-model-names

delete-fix

insecure-registry

ls

deletemodels

progressbar

readme-updates

license-layers

skip-list

list-models

modelpath

matt/examplemodelfiles

distribution

go-opts

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: github-starred/ollama#31337