ollama

mirror of https://github.com/ollama/ollama.git synced 2026-03-09 07:16:38 -05:00

Files

Jesse Gross 638faeac54 mlxrunner: Report actual memory usage from runner

The MLX runner previously reported a static VRAM estimate that was
computed at load time and consisted only of the weights. This is
strictly less than the actual memory usage, as it does not include
the KV cache or compute graph.

2026-02-27 17:29:47 -08:00

llm_darwin.go

Optimize container images for startup (#6547 )

2024-09-12 12:10:30 -07:00

llm_linux.go

Optimize container images for startup (#6547 )

2024-09-12 12:10:30 -07:00

llm_windows.go

win: lint fix (#10571 )

2025-05-05 11:08:12 -07:00

server_test.go

llm: Don't always evict models on CPU-only systems

2025-12-02 10:58:08 -08:00

server.go

mlxrunner: Report actual memory usage from runner

2026-02-27 17:29:47 -08:00

status.go

logs: catch rocm errors (#12888 )

2025-10-31 09:54:25 -07:00