[GH-ISSUE #7375] use arm64 extensions ? #51199

Open
opened 2026-04-28 18:54:24 -05:00 by GiteaMirror · 9 comments
Owner

Originally created by @fxmbsw7 on GitHub (Oct 26, 2024).
Original GitHub issue: https://github.com/ollama/ollama/issues/7375

like neon64 and way others .. ?

greets ..

Originally created by @fxmbsw7 on GitHub (Oct 26, 2024). Original GitHub issue: https://github.com/ollama/ollama/issues/7375 like neon64 and way others .. ? greets ..
GiteaMirror added the feature request label 2026-04-28 18:54:24 -05:00
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

i added in llama.go in place of the -D arm .. int8 on arm8 -D__ARM_NEON64
and it didnt error out

<!-- gh-comment-id:2439824453 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): i added in llama.go in place of the -D arm .. int8 on arm8 -D__ARM_NEON64 and it didnt error out
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

( and u may know , new phones are not avx less )

<!-- gh-comment-id:2439826358 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): ( and u may know , new phones are not avx less )
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

i saw my snapdragon 8 gen 2 has bf16
and i8mm
i think this i8mm error is missing a package , with header files and stuff

the instructions and info via cpuinfo of my sd8g2 :
processor : 7 BogoMIPS : 38.40 Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 asimdfhm dit uscat ilrcpc flagm ssbs sb paca pacg dcpodp flagm2 frint i8mm bf16 bti CPU implementer : 0x41 CPU architecture: 8 CPU variant : 0x1 CPU part : 0xd4e CPU revision : 0

<!-- gh-comment-id:2439921273 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): i saw my snapdragon 8 gen 2 has bf16 and i8mm i think this i8mm error is missing a package , with header files and stuff the instructions and info via cpuinfo of my sd8g2 : processor : 7 BogoMIPS : 38.40 Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 asimdfhm dit uscat ilrcpc flagm ssbs sb paca pacg dcpodp flagm2 frint i8mm bf16 bti CPU implementer : 0x41 CPU architecture: 8 CPU variant : 0x1 CPU part : 0xd4e CPU revision : 0
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

( the qcom docs say additionally )
Qualcomm® Hexagon™ Processor
• Fused AI Accelerator Architecture
• Hexagon Tensor Accelerator
• Hexagon Vector eXtensions
• Hexagon Scalar Accelerator
• Hexagon Direct Link
• Support for mix precision (INT8+INT16)
• Support for all precisions (INT4, INT8, INT16, FP16)
• Micro Tile Inferencing

i cant currently find if its less.than arm8.6 or newer

<!-- gh-comment-id:2439922886 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): ( the qcom docs say additionally ) Qualcomm® Hexagon™ Processor • Fused AI Accelerator Architecture • Hexagon Tensor Accelerator • Hexagon Vector eXtensions • Hexagon Scalar Accelerator • Hexagon Direct Link • Support for mix precision (INT8+INT16) • Support for all precisions (INT4, INT8, INT16, FP16) • Micro Tile Inferencing i cant currently find if its less.than arm8.6 or newer
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

an ai bot says its arm9
so the arm8.6 cmd in llama.go should work ?
ill try ..

<!-- gh-comment-id:2439923418 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): an ai bot says its arm9 so the arm8.6 cmd in llama.go should work ? ill try ..
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

look into -march=native -mcpu=native -mtune=native

<!-- gh-comment-id:2439925395 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): look into -march=native -mcpu=native -mtune=native
Author
Owner

@fxmbsw7 commented on GitHub (Oct 27, 2024):

-m*=native and armv9 didnt work .. illegal operaton or instruction
armv8 worked
still testing on ..

<!-- gh-comment-id:2440018144 --> @fxmbsw7 commented on GitHub (Oct 27, 2024): -m*=native and armv9 didnt work .. illegal operaton or instruction armv8 worked still testing on ..
Author
Owner

@dhiltgen commented on GitHub (Nov 5, 2024):

Take a look at #7199 which should help lay some foundation to make this a bit easier.

<!-- gh-comment-id:2458130728 --> @dhiltgen commented on GitHub (Nov 5, 2024): Take a look at #7199 which should help lay some foundation to make this a bit easier.
Author
Owner

@fxmbsw7 commented on GitHub (Nov 5, 2024):

looks good ?

On Tue, Nov 5, 2024, 21:53 Daniel Hiltgen @.***> wrote:

Take a look at #7199 https://github.com/ollama/ollama/pull/7199 which
should help lay some foundation to make this a bit easier.


Reply to this email directly, view it on GitHub
https://github.com/ollama/ollama/issues/7375#issuecomment-2458130728,
or unsubscribe
https://github.com/notifications/unsubscribe-auth/AJMLP3IDHFJBJKJKZSEHJKTZ7EV4FAVCNFSM6AAAAABQVCKKKSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINJYGEZTANZSHA
.
You are receiving this because you authored the thread.Message ID:
@.***>

<!-- gh-comment-id:2458270555 --> @fxmbsw7 commented on GitHub (Nov 5, 2024): looks good ? On Tue, Nov 5, 2024, 21:53 Daniel Hiltgen ***@***.***> wrote: > Take a look at #7199 <https://github.com/ollama/ollama/pull/7199> which > should help lay some foundation to make this a bit easier. > > — > Reply to this email directly, view it on GitHub > <https://github.com/ollama/ollama/issues/7375#issuecomment-2458130728>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AJMLP3IDHFJBJKJKZSEHJKTZ7EV4FAVCNFSM6AAAAABQVCKKKSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINJYGEZTANZSHA> > . > You are receiving this because you authored the thread.Message ID: > ***@***.***> >
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#51199