ollama

mirror of https://github.com/ollama/ollama.git synced 2026-05-22 13:42:25 -05:00

Author	SHA1	Message	Date
Parth Sareen	91c8e5e1a8	launch: enriched model inventory (#16230 )	2026-05-21 11:57:20 -07:00
Daniel Hiltgen	4b2d529966	Reduce startup model hydration (#16215 ) * Reduce startup model hydration Add a lightweight model list cache for tags and launch inventory, while keeping show cache population lazy. This avoids loading every local model at startup on large model stores. * harden flaky scheduler unit test * remove extra launch model metadata text * review comments * review comments	2026-05-19 15:53:08 -07:00
Parth Sareen	ac7295ccab	launch: codex app integration (#16120 )	2026-05-13 17:11:52 -07:00
Parth Sareen	f866e7608f	launch: disable Claude Desktop launch (#16028 )	2026-05-07 10:46:18 -07:00
Parth Sareen	bab59072fb	launch: add plan-aware model gating (#16027 )	2026-05-06 14:34:26 -07:00
Eva H	7c2c36bda2	cmd/launch: improve integration backup UX (#15907 )	2026-05-06 11:32:54 -04:00
Parth Sareen	9ba5a04914	launch: claude app (#15937 )	2026-05-02 19:19:57 -07:00
Parth Sareen	b6447caebc	launch: use vram bytes for model recommendations (#15885 )	2026-04-29 18:40:14 -07:00
Eva H	bad32c7244	launch/docs: fix title for pool (#15883 )	2026-04-29 17:18:44 -04:00
Parth Sareen	321cc8a2ba	server/launch: add model recommendations cache endpoint (#15868 )	2026-04-28 17:09:04 -07:00
Daniel Hiltgen	87288ced4f	New models (#15861 ) * mlx: add laguna model support * convert: support fp8 safetensors import Decode HF F8_E4M3 safetensors with block scale companions into GGUF-supported tensor types, and record which output tensors came from FP8 source weights. Use that source-precision metadata during create quantization: default FP8-sourced GGUFs to Q8_0, keep non-FP8 tensors at their original precision for Q8_0, and promote non-FP8 quantizable tensors to Q8_0 for Q4_K requests. * ggml: add laguna model support * server: preserve generate logprobs with builtin parsers Generate requests were dropping logprob-only chunks whenever a builtin parser buffered visible content. Chat already handled this case, but generate only forwarded chunks with visible response, thinking, or tool-call output. Keep generate chunks that carry logprobs even when the builtin parser has not flushed visible content yet, and add a regression test that exercises the behavior with a generic thinking parser. * review comments - perf improvements * ggml: implement nemotron 3 nano omni * add poolside integration * update poolside doc * adapt to new cache setup * fix test * fix test --------- Co-authored-by: Eva Ho <hoyyeva@gmail.com>	2026-04-28 11:50:12 -07:00
Eva H	b4442c6d17	launch: resave managed integration config when live config drifts (#15776 )	2026-04-23 19:32:36 -04:00
Parth Sareen	8e05d734b9	launch: add kimi cli integration with installer flow (#15723 )	2026-04-20 15:33:32 -07:00
Parth Sareen	a50ce61c54	launch: skip unchanged managed-single rewrite (#15633 )	2026-04-16 16:20:42 -07:00
Mike Wallio	7d271e6dc9	cmd/launch: add Copilot CLI integration (#15583 ) --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: ParthSareen <parth.sareen@ollama.com>	2026-04-15 17:22:53 -07:00
Parth Sareen	43f90def04	launch: add hermes (#15569 )	2026-04-15 12:00:23 -07:00
Eva H	5818001610	launch: skip unchanged integration rewrite configration (#15491 )	2026-04-13 17:18:56 -07:00
Parth Sareen	4e16f562c0	launch: add openclaw channels setup (#15407 )	2026-04-08 13:25:27 -07:00
Eva H	d64812eb5d	cmd: improve multi-select sorting and selection status (#15200 )	2026-04-08 10:39:18 -07:00
Parth Sareen	8e54823fd3	revert context length warnings change (#15121 )	2026-03-28 16:43:59 -07:00
Parth Sareen	7c8da5679e	launch: improve multi-select for already added models (#15113 )	2026-03-28 13:44:40 -07:00
Eva H	366625a831	launch: warn when server context length is below 64k for local models (#15044 ) A stop-gap for now to guide users better. We'll add more in-depth recommendations per integration as well. --------- Co-authored-by: Parth Sareen <parth.sareen@ollama.com>	2026-03-27 00:15:53 -07:00
Eva H	7575438366	cmd: ollama launch vscode (#15060 ) Co-authored-by: Parth Sareen <parth.sareen@ollama.com>	2026-03-25 16:37:02 -04:00
Eva H	b5e7888414	cmd/launch: skip redundant config writes when model unchanged (#14941 )	2026-03-18 17:36:52 -04:00
Parth Sareen	870599f5da	launch: remove warning for default policy (#14830 )	2026-03-13 15:01:38 -07:00
Parth Sareen	bb867c6fdb	launch: fix headless --yes integration flow and policy scoping (#14815 )	2026-03-13 11:45:36 -07:00
Parth Sareen	76925f1284	cmd: TUI model ordering (#14814 )	2026-03-13 10:19:22 -07:00
Parth Sareen	af5f7c0a9e	cmd: refactor tui and launch (#14609 )	2026-03-12 18:39:06 -07:00

28 Commits