mirror of
https://github.com/ollama/ollama.git
synced 2026-03-11 17:34:04 -05:00
If there is a CUDA OOM, we currently don't check the return value and will evetually segfault. This checks for the problem and generates a Go error. At the moment, this will still result in a panic but having the error is the first step to being able to handle it more gracefully.