[GH-ISSUE #8333] GOT-OCR and voice model support #5341

Closed
opened 2026-04-12 16:32:27 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @Elton-Yang on GitHub (Jan 7, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/8333

Could the GOT-OCR image model be supported by Ollama? I like the way ollama deploys and runs model very much, and really hope ollama can support more types of model including image and audio models, specifically two models I use a lot: GOT-OCR2.0 (image OCR) and SenseVoiceSmall (audio STT). Thanks all developers for your hard work! really appreciate this project.

Originally created by @Elton-Yang on GitHub (Jan 7, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/8333 Could the GOT-OCR image model be supported by Ollama? I like the way ollama deploys and runs model very much, and really hope ollama can support more types of model including image and audio models, specifically two models I use a lot: GOT-OCR2.0 (image OCR) and SenseVoiceSmall (audio STT). Thanks all developers for your hard work! really appreciate this project.
GiteaMirror added the model label 2026-04-12 16:32:27 -05:00
Author
Owner

@rick-github commented on GitHub (Jan 7, 2025):

#3265, #7485

<!-- gh-comment-id:2574748370 --> @rick-github commented on GitHub (Jan 7, 2025): #3265, #7485
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#5341