Files
ollama/x/mlxrunner
Jesse Gross a60b9adcce mlxrunner: Fix prompt eval timing and count metrics
Only the last token's processing time is included in prompt processing,
giving an artificially high rate. In addition, the number of tokens
only included the tokens that miss the cache, instead of our historic
total tokens.
2026-02-27 17:29:47 -08:00
..
2026-02-26 18:38:27 -08:00
2026-02-13 22:30:42 -08:00