ollama37/server/sched.go at 345420998e90090d2d6fba38ad5c2f3f5512adf4

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-12 00:37:04 +00:00

Files

Daniel Hiltgen 345420998e Prevent partial loading on mixed GPU brands

In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands.  This makes sure we find the set of GPU(s) that
best fit for the partial load.

2024-07-30 11:00:55 -07:00

28 KiB

Raw Blame History

View Raw

28 KiB Raw Blame History

28 KiB

Raw Blame History