[PR #12607] feat(models): add support for apertus architecture #24425

Open
opened 2026-04-19 17:34:07 -05:00 by GiteaMirror · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ollama/ollama/pull/12607
Author: @gaetschwartz
Created: 10/14/2025
Status: 🔄 Open

Base: mainHead: feature/apertus-support


📝 Commits (1)

  • 0de87a9 feat(models): add support for apertus architecture

📊 Changes

5 files changed (+227 additions, -0 deletions)

View changed files

📝 docs/modelfile.md (+1 -0)
📝 ml/backend.go (+6 -0)
📝 ml/backend/ggml/ggml.go (+7 -0)
model/models/apertus/model.go (+212 -0)
📝 model/models/models.go (+1 -0)

📄 Description

Apertus uses xIELU activation instead of SILU. Implementation follows the same patterns as other models with GPT2/Tekken tokenizer support.

References


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ollama/ollama/pull/12607 **Author:** [@gaetschwartz](https://github.com/gaetschwartz) **Created:** 10/14/2025 **Status:** 🔄 Open **Base:** `main` ← **Head:** `feature/apertus-support` --- ### 📝 Commits (1) - [`0de87a9`](https://github.com/ollama/ollama/commit/0de87a92a35f3a4decf55041b949a237aad44e96) feat(models): add support for apertus architecture ### 📊 Changes **5 files changed** (+227 additions, -0 deletions) <details> <summary>View changed files</summary> 📝 `docs/modelfile.md` (+1 -0) 📝 `ml/backend.go` (+6 -0) 📝 `ml/backend/ggml/ggml.go` (+7 -0) ➕ `model/models/apertus/model.go` (+212 -0) 📝 `model/models/models.go` (+1 -0) </details> ### 📄 Description Apertus uses xIELU activation instead of SILU. Implementation follows the same patterns as other models with GPT2/Tekken tokenizer support. ### References - https://huggingface.co/swiss-ai/Apertus-8B-2509 --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
GiteaMirror added the pull-request label 2026-04-19 17:34:07 -05:00
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#24425