[GH-ISSUE #9548] Support hot-swapping for LoRA adapters #52739

Open
opened 2026-04-29 00:43:32 -05:00 by GiteaMirror · 1 comment
Owner

Originally created by @anzhexe on GitHub (Mar 6, 2025).
Original GitHub issue: https://github.com/ollama/ollama/issues/9548

  • I have searched documentation
  • I have checked issues.

Hi,
Regarding Ollama's recent implementation of multiple LoRA adapters support (PR #7667), I'm curious about the possibility of hot-swapping capabilities for these adapters.

For reference, llama.cpp has already implemented this functionality as demonstrated in the following pull requests:

Are there any plans to implement a similar hot-swapping functionality within Ollama?

Originally created by @anzhexe on GitHub (Mar 6, 2025). Original GitHub issue: https://github.com/ollama/ollama/issues/9548 - [x] I have searched documentation - [x] I have checked issues. Hi, Regarding Ollama's recent implementation of multiple LoRA adapters support (PR #7667), I'm curious about the possibility of hot-swapping capabilities for these adapters. For reference, llama.cpp has already implemented this functionality as demonstrated in the following pull requests: - https://github.com/ggml-org/llama.cpp/pull/8857 - https://github.com/ggml-org/llama.cpp/pull/10994 Are there any plans to implement a similar hot-swapping functionality within Ollama?
GiteaMirror added the feature request label 2026-04-29 00:43:32 -05:00
Author
Owner

@mixxxxus commented on GitHub (Dec 20, 2025):

Bumping this.

Please consider runtime LoRA swapping. Being able to change adapters without merging them into the base model would make agent systems much easier to manage.

Even something simple like selecting adapters per request in '/api/chat' would already be great.

<!-- gh-comment-id:3678082597 --> @mixxxxus commented on GitHub (Dec 20, 2025): Bumping this. Please consider runtime LoRA swapping. Being able to **change adapters without merging them into the base model** would make agent systems much easier to manage. Even something simple like selecting adapters per request in '/api/chat' would already be great.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github-starred/ollama#52739