[GH-ISSUE #7616] Please add microsoft/OmniParser model #30619

Open
opened 2026-04-22 10:26:47 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @craftslab on GitHub (Nov 11, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/7616

OmniParser is a general screen parsing tool, which interprets/converts UI screenshot to structured format, to improve existing LLM based UI agent. Training Datasets include: 1) an interactable icon detection dataset, which was curated from popular web pages and automatically annotated to highlight clickable and actionable regions, and 2) an icon description dataset, designed to associate each UI element with its corresponding function.

https://huggingface.co/microsoft/OmniParser

Thanks!

Originally created by @craftslab on GitHub (Nov 11, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/7616 OmniParser is a general screen parsing tool, which interprets/converts UI screenshot to structured format, to improve existing LLM based UI agent. Training Datasets include: 1) an interactable icon detection dataset, which was curated from popular web pages and automatically annotated to highlight clickable and actionable regions, and 2) an icon description dataset, designed to associate each UI element with its corresponding function. https://huggingface.co/microsoft/OmniParser Thanks!
GiteaMirror added the model label 2026-04-22 10:26:47 -05:00
Author
Owner

@stunney commented on GitHub (Mar 20, 2025):

https://github.com/ollama/ollama/issues/9144

<!-- gh-comment-id:2741059137 --> @stunney commented on GitHub (Mar 20, 2025): https://github.com/ollama/ollama/issues/9144
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#30619