ollama-ollama/server at 86513cb697c9842265863b03ba1e916b959a63c0 - ollama-ollama - Computersurge

github-starred/ollama-ollama

mirror of https://github.com/ollama/ollama.git synced 2026-04-29 17:29:05 -05:00

Files

History

Jesse Gross a16f96658b mlxrunner: Enforce model context limit

Currently, context length is unbounded - the cache will keep
growing forever independent of the model's trained context
length. This caps it and enforces semantics similar to most
cloud services:
 - Long prompts will result in an error, not truncation.
 - Generation that exceeds the context will be stopped

2026-02-27 17:29:47 -08:00

..

…

aliases.go

add ability to disable cloud (#14221 )

2026-02-12 15:47:00 -08:00

auth_test.go

server: reject unexpected auth hosts (#13738 )

2026-01-16 14:10:36 -05:00

auth.go

server: reject unexpected auth hosts (#13738 )

2026-01-16 14:10:36 -05:00

create_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

create.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

download.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

fixblobs_test.go

…

fixblobs.go

…

images_test.go

x/imagegen: add image edit capabilities (#13846 )

2026-01-22 20:35:08 -08:00

images.go

mlxrunner: Enforce model context limit

2026-02-27 17:29:47 -08:00

logprob.go

…

model.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

prompt_test.go

model: support for qwen3.5 architecture (#14378 )

2026-02-24 20:08:05 -08:00

prompt.go

mlxrunner: Enforce model context limit

2026-02-27 17:29:47 -08:00

quantization_test.go

model: support for qwen3.5 architecture (#14378 )

2026-02-24 20:08:05 -08:00

quantization.go

model: support for qwen3.5 architecture (#14378 )

2026-02-24 20:08:05 -08:00

routes_aliases_test.go

add ability to disable cloud (#14221 )

2026-02-12 15:47:00 -08:00

routes_aliases.go

cmd: ollama launch improvements (#14099 )

2026-02-05 15:08:17 -08:00

routes_cloud_test.go

add ability to disable cloud (#14221 )

2026-02-12 15:47:00 -08:00

routes_create_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

routes_debug_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_delete_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

routes_generate_renderer_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_generate_test.go

bugfix: better mlx model scheduling (#14290 )

2026-02-17 13:57:05 -08:00

routes_harmony_streaming_test.go

preserve tool definition and call JSON ordering (#13525 )

2026-01-05 18:03:36 -08:00

routes_list_test.go

…

routes_options_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_test.go

…

routes.go

mlxrunner: Enforce model context limit

2026-02-27 17:29:47 -08:00

sched_test.go

mlxrunner: Report actual memory usage from runner

2026-02-27 17:29:47 -08:00

sched.go

mlxrunner: Enforce model context limit

2026-02-27 17:29:47 -08:00

sparse_common.go

…

sparse_windows.go

…

test_home_test.go

add ability to disable cloud (#14221 )

2026-02-12 15:47:00 -08:00

upload.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00