[GH-ISSUE #7238] Ollama document intelligence engine #51108

Closed
opened 2026-04-28 18:24:16 -05:00 by GiteaMirror · 4 comments
Owner

Originally created by @dcasota on GitHub (Oct 17, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/7238

Has anyone Ollama examples for document intelligence?

In comparison, Azure AI comes with an engine which extracts elements from documents and makes them eligible passing the data into customizeable workflows. However, there are culprits even in the payed tier.

Azure AI Document Intelligence requirements

  • Microsoft Office files, or
  • JPEG, PNG, BMP, TIFF, or PDF format
  • PDF documents must have dimensions less than 17 x 17 inches or A3 paper size.
  • PDF documents must not be protected with a password.
  • Images must have dimensions between 50 x 50 pixels and 10,000 x 10,000 pixels.
  • file size less than 500 MB (payed standard tier)
  • only the first 2000 pages are analyzed (payed standard tier)

Some information about supported languages

Which Ollama examples fit to this extraction+passing-to-external-workflow demand?

Originally created by @dcasota on GitHub (Oct 17, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/7238 Has anyone Ollama examples for document intelligence? In comparison, Azure AI comes with an engine which extracts elements from documents and makes them eligible passing the data into customizeable workflows. However, there are culprits even in the payed tier. Azure AI Document Intelligence requirements - Microsoft Office files, or - JPEG, PNG, BMP, TIFF, or PDF format - PDF documents must have dimensions less than 17 x 17 inches or A3 paper size. - PDF documents must not be protected with a password. - Images must have dimensions between 50 x 50 pixels and 10,000 x 10,000 pixels. - file size less than 500 MB (payed standard tier) - only the first 2000 pages are analyzed (payed standard tier) Some information about [supported languages](https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/language-support/ocr?view=doc-intel-4.0.0&preserve-view=true&tabs=read-print%2Clayout-print%2Cgeneral) Which Ollama examples fit to this extraction+passing-to-external-workflow demand?
GiteaMirror added the feature request label 2026-04-28 18:24:16 -05:00
Author
Owner

@mitja commented on GitHub (Oct 18, 2024):

I think this is not in scope of Ollama. Ollama's focus is on local model serving and not integrated solutions. Maybe look at end user tools like OpenWebUI or frameworks like LlamaIndex, which can both use Ollama for model serving.

<!-- gh-comment-id:2421565740 --> @mitja commented on GitHub (Oct 18, 2024): I think this is not in scope of Ollama. Ollama's focus is on local model serving and not integrated solutions. Maybe look at end user tools like OpenWebUI or frameworks like LlamaIndex, which can both use Ollama for model serving.
Author
Owner

@dcasota commented on GitHub (Oct 18, 2024):

Selection is good. Why shouldn't founders and young entrepreneurs think about three years into the future, right? Ollama is actually the best example of an integrated solution in terms of Hugging Face and Python as the preferred language in examples.

<!-- gh-comment-id:2421615541 --> @dcasota commented on GitHub (Oct 18, 2024): Selection is good. Why shouldn't founders and young entrepreneurs think about three years into the future, right? Ollama is actually the best example of an integrated solution in terms of Hugging Face and Python as the preferred language in examples.
Author
Owner

@pdevine commented on GitHub (Oct 24, 2024):

Thanks for the issue @dcasota ! I'm going to consolidate this into #5762; I'd love to do something like this in the future.

<!-- gh-comment-id:2436035842 --> @pdevine commented on GitHub (Oct 24, 2024): Thanks for the issue @dcasota ! I'm going to consolidate this into #5762; I'd love to do something like this in the future.
Author
Owner

@dcasota commented on GitHub (Oct 24, 2024):

@pdevine I wasn‘t aware of #5762. Makes sense. Have a great business plan.

<!-- gh-comment-id:2436087554 --> @dcasota commented on GitHub (Oct 24, 2024): @pdevine I wasn‘t aware of #5762. Makes sense. Have a great business plan.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#51108