[PR #11913] [MERGED] Revert "cuda: leverage JIT for smaller footprint (#11635)" #75949

Closed
opened 2026-05-05 08:22:36 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/11913
Author: @dhiltgen
Created: 8/15/2025
Status: Merged
Merged: 8/15/2025
Merged by: @dhiltgen

Base: mainHead: ptx


📝 Commits (1)

  • 7ccd442 Revert "cuda: leverage JIT for smaller footprint (#11635)"

📊 Changes

1 file changed (+3 additions, -3 deletions)

View changed files

📝 CMakePresets.json (+3 -3)

📄 Description

This reverts commit dc5a645434.

Looks like this is causing problems on some systems. I'll investigate deeper later to see if there's some way to re-enable the savings, but for now, back this out to get a good build.

CUDA error: the provided PTX was compiled with an unsupported toolchain.
  current device: 0, in function ggml_cuda_mul_mat_q at //ml/backend/ggml/ggml/src/ggml-cuda/mmq.cu:129
  cudaGetLastError()

🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/11913 **Author:** [@dhiltgen](https://github.com/dhiltgen) **Created:** 8/15/2025 **Status:** ✅ Merged **Merged:** 8/15/2025 **Merged by:** [@dhiltgen](https://github.com/dhiltgen) **Base:** `main` ← **Head:** `ptx` --- ### 📝 Commits (1) - [`7ccd442`](https://github.com/ollama/ollama/commit/7ccd4424541d7d8453b171822a748a31e698c004) Revert "cuda: leverage JIT for smaller footprint (#11635)" ### 📊 Changes **1 file changed** (+3 additions, -3 deletions) <details> <summary>View changed files</summary> 📝 `CMakePresets.json` (+3 -3) </details> ### 📄 Description This reverts commit dc5a645434f0ea6364c426c6ba112da1afa40cb2. Looks like this is causing problems on some systems. I'll investigate deeper later to see if there's some way to re-enable the savings, but for now, back this out to get a good build. ``` CUDA error: the provided PTX was compiled with an unsupported toolchain. current device: 0, in function ggml_cuda_mul_mat_q at //ml/backend/ggml/ggml/src/ggml-cuda/mmq.cu:129 cudaGetLastError() ``` --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-05-05 08:22:36 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#75949