[PR #3900] [CLOSED] use matrix multiplication kernels in more cases #73609

Closed
opened 2026-05-05 05:30:57 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/3900
Author: @jmorganca
Created: 4/25/2024
Status: Closed

Base: mainHead: jmorganca/mm


📝 Commits (1)

  • 27d88db use matrix multiplcation kernels in more cases

📊 Changes

1 file changed (+45 additions, -0 deletions)

View changed files

llm/patches/04-metal.diff (+45 -0)

📄 Description

TODO:

  • Set opts.NumCTX to $OLLAMA_NUM_PARALLEL * opts.NumCtx
  • Enable continuous batching?

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/3900 **Author:** [@jmorganca](https://github.com/jmorganca) **Created:** 4/25/2024 **Status:** ❌ Closed **Base:** `main` ← **Head:** `jmorganca/mm` --- ### 📝 Commits (1) - [`27d88db`](https://github.com/ollama/ollama/commit/27d88dbfdc3ae4516a8cb36f02499472f454d60b) use matrix multiplcation kernels in more cases ### 📊 Changes **1 file changed** (+45 additions, -0 deletions) <details> <summary>View changed files</summary> ➕ `llm/patches/04-metal.diff` (+45 -0) </details> ### 📄 Description TODO: - [ ] Set `opts.NumCTX` to `$OLLAMA_NUM_PARALLEL * opts.NumCtx` - [ ] Enable continuous batching? --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-05-05 05:30:57 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#73609