mirror of
https://github.com/ollama/ollama.git
synced 2025-12-05 18:46:22 -06:00
-
released this
2025-11-05 14:33:01 -06:00 | 110 commits to main since this release📅 Originally published on GitHub: Wed, 05 Nov 2025 21:41:21 GMT
🏷️ Git tag created: Wed, 05 Nov 2025 20:33:01 GMTollama runnow works with embedding modelsollama runcan now run embedding models to generate vector embeddings from text:ollama run embeddinggemma "Hello world"Content can also be provided to
ollama runvia standard input:echo "Hello world" | ollama run embeddinggemmaWhat's Changed
- Fixed errors when running
qwen3-vl:235bandqwen3-vl:235b-instruct - Enable flash attention for Vulkan (currently needs to be built from source)
- Add Vulkan memory detection for Intel GPU using DXGI+PDH
- Ollama will now return tool call IDs from the
/api/chatAPI - Fixed hanging due to CPU discovery
- Ollama will now show login instructions when switching to a cloud model in interactive mode
- Fix reading stale VRAM data
ollama runnow works with embedding models
New Contributors
- @ryanycoleman made their first contribution in https://github.com/ollama/ollama/pull/11740
- @Rajathbail made their first contribution in https://github.com/ollama/ollama/pull/12929
- @virajwad made their first contribution in https://github.com/ollama/ollama/pull/12664
- @AXYZdong made their first contribution in https://github.com/ollama/ollama/pull/8601
Full Changelog: https://github.com/ollama/ollama/compare/v0.12.9...v0.12.10
Downloads
- Fixed errors when running