[PR #3859] [MERGED] add mixtral 8x7b model conversion #11300

Closed
opened 2026-04-12 23:27:12 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/3859
Author: @pdevine
Created: 4/24/2024
Status: Merged
Merged: 4/24/2024
Merged by: @pdevine

Base: mainHead: pdevine/mixtral


📝 Commits (2)

  • cfc4eb4 add mixtral model conversion
  • b160680 use mistral's layer handler for attn layers

📊 Changes

3 files changed (+138 additions, -25 deletions)

View changed files

📝 convert/convert.go (+17 -13)
convert/mixtral.go (+96 -0)
📝 convert/safetensors.go (+25 -12)

📄 Description

This change converts Mixtral 8x7b directly into an Ollama model. The 8x22b model will added in a separate PR as it has a different structure for the way the experts get laid out.


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/3859 **Author:** [@pdevine](https://github.com/pdevine) **Created:** 4/24/2024 **Status:** ✅ Merged **Merged:** 4/24/2024 **Merged by:** [@pdevine](https://github.com/pdevine) **Base:** `main` ← **Head:** `pdevine/mixtral` --- ### 📝 Commits (2) - [`cfc4eb4`](https://github.com/ollama/ollama/commit/cfc4eb44ccb96444f9ea73ff764ec19e10817c3e) add mixtral model conversion - [`b160680`](https://github.com/ollama/ollama/commit/b160680595b7cc5d7eadcd314163d8f40b8d000f) use mistral's layer handler for attn layers ### 📊 Changes **3 files changed** (+138 additions, -25 deletions) <details> <summary>View changed files</summary> 📝 `convert/convert.go` (+17 -13) ➕ `convert/mixtral.go` (+96 -0) 📝 `convert/safetensors.go` (+25 -12) </details> ### 📄 Description This change converts Mixtral 8x7b directly into an Ollama model. The 8x22b model will added in a separate PR as it has a different structure for the way the experts get laid out. --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-12 23:27:12 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#11300