Files
ollama37/ml/backend/ggml
Jesse Gross 9a43994c45 ggml: Disable unused pipeline parallelism
We're not currently using it, even in cases where we could. Disabling
it improves generation performance by 10-30% with multiple GPUs.
2025-07-11 13:30:05 -07:00
..
2025-03-11 14:49:19 -07:00
2025-03-11 14:49:19 -07:00