mirror of
https://github.com/ollama/ollama.git
synced 2026-03-11 17:34:04 -05:00
When we later have a large batch running purely on a CPU, this results the error: GGML_ASSERT(talloc->buffer_id >= 0) Disabling this means that we will incrementally reallocate memory as the graph grows. Fixes #10410