[GH-ISSUE #2451] [FEATURE] Add support for Intel Xeon (Sapphire and Emerald Rapids) accelerators and AI features such as AMX and AVX 512. #63471

Closed
opened 2026-05-03 13:39:08 -05:00 by GiteaMirror · 4 comments
Owner

Originally created by @scouzi1966 on GitHub (Feb 11, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/2451

Originally assigned to: @dhiltgen on GitHub.

Note that Intel is trying to demystify AVX512 with a AVX 10 standard. But they are the same.

AVX512
https://www.intel.com/content/www/us/en/architecture-and-technology/avx-512-overview.html

AMX
https://www.intel.com/content/www/us/en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html

AVX512 is also being fully implemented by AMD

Originally created by @scouzi1966 on GitHub (Feb 11, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/2451 Originally assigned to: @dhiltgen on GitHub. Note that Intel is trying to demystify AVX512 with a AVX 10 standard. But they are the same. AVX512 https://www.intel.com/content/www/us/en/architecture-and-technology/avx-512-overview.html AMX https://www.intel.com/content/www/us/en/products/docs/accelerator-engines/advanced-matrix-extensions/overview.html AVX512 is also being fully implemented by AMD
Author
Owner

@pdevine commented on GitHub (Feb 18, 2024):

cc @dhiltgen

<!-- gh-comment-id:1950999403 --> @pdevine commented on GitHub (Feb 18, 2024): cc @dhiltgen
Author
Owner

@scouzi1966 commented on GitHub (Feb 19, 2024):

I've since analyzed the code base more closely and realize that this probably belongs more with llama.cpp project which would eventually make it's way here. There seems to be Intel involvement here as well.

https://github.com/intel/neural-speed.

You can close this request if you want from my point of view.

<!-- gh-comment-id:1953070811 --> @scouzi1966 commented on GitHub (Feb 19, 2024): I've since analyzed the code base more closely and realize that this probably belongs more with llama.cpp project which would eventually make it's way here. There seems to be Intel involvement here as well. https://github.com/intel/neural-speed. You can close this request if you want from my point of view.
Author
Owner

@dhiltgen commented on GitHub (Feb 19, 2024):

This might wind up being a dup of #2205

<!-- gh-comment-id:1953118810 --> @dhiltgen commented on GitHub (Feb 19, 2024): This might wind up being a dup of #2205
Author
Owner

@jmorganca commented on GitHub (May 11, 2024):

Closing for https://github.com/ollama/ollama/issues/2205

<!-- gh-comment-id:2105410230 --> @jmorganca commented on GitHub (May 11, 2024): Closing for https://github.com/ollama/ollama/issues/2205
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#63471