ollama37/ml/backend/ggml/ggml.go at bab6f34dc0f441c6a18b7cbc2465e1b386cf613e

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-11 08:17:03 +00:00

Files

Michael Yang bab6f34dc0 ml/backend/ggml: update model loading for hybrid/multi backends

use a similar strategy as llama.cpp for deciding where tensors should be
allocated. this will be improved later to be aware of usable memory
before assigning the tensor

2025-03-07 14:08:21 -08:00

20 KiB

Raw Blame History

View Raw

20 KiB Raw Blame History

20 KiB

Raw Blame History