ollama37/integration/concurrency_test.go at 1b44d873e74f62de4f53f154da386919c1426f8b

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-10 07:46:59 +00:00

Files

Daniel Hiltgen cc269ba094 Remove no longer supported max vram var

The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios.  With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups.  Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.

2024-07-22 09:08:11 -07:00

6.1 KiB

Raw Blame History

View Raw

6.1 KiB Raw Blame History

6.1 KiB

Raw Blame History