Jesse Gross
4d5ff25724
mlxrunner: Report actual memory usage from runner
...
The MLX runner previously reported a static VRAM estimate that was
computed at load time and consisted only of the weights. This is
strictly less than the actual memory usage, as it does not include
the KV cache or compute graph.
2026-02-25 15:06:37 -08:00
..
2025-11-15 20:22:29 -08:00
2026-02-12 15:47:00 -08:00
2026-01-16 14:10:36 -05:00
2026-01-16 14:10:36 -05:00
2026-01-21 11:46:17 -08:00
2026-01-21 11:46:17 -08:00
2026-01-21 11:46:17 -08:00
2026-01-22 20:35:08 -08:00
2026-01-22 20:35:08 -08:00
2025-11-13 13:49:25 -08:00
2026-01-21 11:46:17 -08:00
2026-02-24 20:08:05 -08:00
2026-02-24 20:08:05 -08:00
2026-02-24 20:08:05 -08:00
2026-02-24 20:08:05 -08:00
2026-02-12 15:47:00 -08:00
2026-02-05 15:08:17 -08:00
2026-02-12 15:47:00 -08:00
2026-01-21 11:46:17 -08:00
2026-02-02 10:47:09 -08:00
2026-01-21 11:46:17 -08:00
2026-02-02 10:47:09 -08:00
2026-02-17 13:57:05 -08:00
2026-01-05 18:03:36 -08:00
2024-12-31 18:02:30 -08:00
2026-02-02 10:47:09 -08:00
2026-01-03 02:20:12 -05:00
2026-02-25 15:06:37 -08:00
2026-02-25 15:06:37 -08:00
2026-02-25 15:06:37 -08:00
2024-08-09 12:16:19 -07:00
2024-08-09 12:16:19 -07:00
2026-02-12 15:47:00 -08:00
2026-01-21 11:46:17 -08:00