ollama/cmd at d319227df01254ba375dbabd1d42e851465e4476 - ollama - Computersurge

github-starred/ollama

mirror of https://github.com/ollama/ollama.git synced 2026-05-05 23:53:43 -05:00

Files

History

Patrick Devine 15e6076d79 mlx: Gemma4 MTP speculative decoding (#15980 )

This change adds support for MTP (multi-token prediction) speculative decoding for the
gemma4 model family.

It includes:
  * support for importing safetensors based gemma4 draft models with `ollama create`
  * a new DRAFT command in the Modelfile for specifying draft models
  * a --quantize-draft flag for the ollama create command to quantize the draft model
  * cache support for speculation
  * changes to the rotating cache to be able to handle MTP correctly
  * sampling support for draft model token prediction

---------

Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

2026-05-05 08:55:04 -07:00

..

Add support for gemma4 (#15214 )

2026-04-02 11:33:33 -07:00

cmd: refactor tui and launch (#14609 )

2026-03-12 18:39:06 -07:00

internal/fileutil

cmd: refactor tui and launch (#14609 )

2026-03-12 18:39:06 -07:00

launch: claude app (#15937 )

2026-05-02 19:19:57 -07:00

Runner for Ollama engine

2025-02-13 17:09:26 -08:00

launch: claude app (#15937 )

2026-05-02 19:19:57 -07:00

background_unix.go

cmd: ollama menu and launch improvements (#14038 )

2026-02-09 11:30:16 -08:00

background_windows.go

cmd: ollama menu and launch improvements (#14038 )

2026-02-09 11:30:16 -08:00

cmd_launcher_test.go

launch: claude app (#15937 )

2026-05-02 19:19:57 -07:00

cmd_test.go

mlx: Gemma4 MTP speculative decoding (#15980 )

2026-05-05 08:55:04 -07:00

cmd.go

mlx: Gemma4 MTP speculative decoding (#15980 )

2026-05-05 08:55:04 -07:00

editor_unix.go

feature: add ctrl-g to allow users to use an editor to edit their prompt (#14197 )

2026-02-11 13:04:41 -08:00

editor_windows.go

feature: add ctrl-g to allow users to use an editor to edit their prompt (#14197 )

2026-02-11 13:04:41 -08:00

interactive_test.go

Add support for gemma4 (#15214 )

2026-04-02 11:33:33 -07:00

interactive.go

modelfiles: fix /save command and add shortname for safetensors based models (#15413 )

2026-04-08 21:05:39 -07:00

start_darwin.go

cmd: ollama launch improvements (#14099 )

2026-02-05 15:08:17 -08:00

start_default.go

…

start_windows.go

spawn desktop quickly (#11011 )

2025-06-08 09:34:52 -07:00

start.go

…

warn_thinking_test.go

add thinking support to the api and cli (#10584 )

2025-05-28 19:38:52 -07:00