ollama-ollama/server at df70249520fda991a83f607d485fbf4e64cfe1fd - ollama-ollama - Computersurge

github-starred/ollama-ollama

mirror of https://github.com/ollama/ollama.git synced 2026-03-09 03:12:11 -05:00

Files

History

Jeffrey Morgan df70249520 server: optimize chatPrompt to reduce tokenization calls (#14040 )

Change the truncation algorithm to start with all messages and remove
from the front until it fits, rather than adding messages one at a time
from the back. This reduces tokenization calls from O(n) to O(1) in the
common case where all messages fit in context.

2026-02-04 01:21:31 -08:00

..

docs: fix typos in repository documentation (#10683 )

2025-11-15 20:22:29 -08:00

aliases.go

cmd: claude launch improvements (#14064 )

2026-02-03 19:33:58 -08:00

auth_test.go

server: reject unexpected auth hosts (#13738 )

2026-01-16 14:10:36 -05:00

auth.go

server: reject unexpected auth hosts (#13738 )

2026-01-16 14:10:36 -05:00

create_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

create.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

download.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

fixblobs_test.go

…

fixblobs.go

…

images_test.go

x/imagegen: add image edit capabilities (#13846 )

2026-01-22 20:35:08 -08:00

images.go

x/imagegen: add image edit capabilities (#13846 )

2026-01-22 20:35:08 -08:00

logprob.go

logprob: add bytes to logprobs (#13068 )

2025-11-13 13:49:25 -08:00

model.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

prompt_test.go

server: optimize chatPrompt to reduce tokenization calls (#14040 )

2026-02-04 01:21:31 -08:00

prompt.go

server: optimize chatPrompt to reduce tokenization calls (#14040 )

2026-02-04 01:21:31 -08:00

quantization_test.go

Reapply "feat: incremental gguf parser (#10822 )" (#11114 ) (#11119 )

2025-06-20 11:11:40 -07:00

quantization.go

model: add qwen3-next architecture (#14051 )

2026-02-03 23:27:21 -08:00

routes_aliases_test.go

cmd: claude launch improvements (#14064 )

2026-02-03 19:33:58 -08:00

routes_aliases.go

cmd: claude launch improvements (#14064 )

2026-02-03 19:33:58 -08:00

routes_create_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

routes_debug_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_delete_test.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00

routes_generate_renderer_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_generate_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_harmony_streaming_test.go

preserve tool definition and call JSON ordering (#13525 )

2026-01-05 18:03:36 -08:00

routes_list_test.go

…

routes_options_test.go

server: use tiered VRAM-based default context length

2026-02-02 10:47:09 -08:00

routes_test.go

server: return error when embedding contains NaN or Inf values (#13599 )

2026-01-03 02:20:12 -05:00

routes.go

cmd: claude launch improvements (#14064 )

2026-02-03 19:33:58 -08:00

sched_test.go

server: fix ollama ps showing configured instead of actual context length

2026-02-02 10:47:09 -08:00

sched.go

glm 4.7 flash support on experimental engine (#13838 )

2026-02-02 15:22:11 -08:00

sparse_common.go

…

sparse_windows.go

…

upload.go

Clean up the manifest and modelpath (#13807 )

2026-01-21 11:46:17 -08:00