ollama37/llm/memory.go at a2cc8571c5b2b8f77a8a5e2f65cb7aaa56482dc4

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-13 09:17:02 +00:00

Files

Jesse Gross a2cc8571c5 llm: Consistently track unassigned model data

In some cases, if we fail to assign a piece of the model to a GPU then
we lose track of this data. Although it doesn't change the memory
allocation, it does affect the total size of the model reported by
tools such as ollama ps (and also the percent offloaded).

This makes it look like setting num_gpu isn't reflected in ollama ps,
which isn't true but the offloading percent may appear to not change.

Spreading the model across more GPUs will continue to impact the
reported total size of the model.

2025-05-19 09:52:48 -07:00

12 KiB

Raw Blame History

View Raw

12 KiB Raw Blame History

12 KiB

Raw Blame History