Files
ollama37/llama/patches/0016-add-ollama-vocab-for-grammar-support.patch
Daniel Hiltgen 424810450f Move quantization to new backend (#10363)
* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.
2025-05-06 11:20:48 -07:00

8.4 KiB