Daniel Hiltgen
424810450f
Move quantization to new backend ( #10363 )
...
* Move quantization logic to GGML via new backend
This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.
* Remove "add model quantizations"
This is no longer needed now that quantization is implemented in Go+GGML code directly.
2025-05-06 11:20:48 -07:00
..
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-05-06 11:20:48 -07:00
2025-05-06 11:20:48 -07:00
2025-01-29 15:03:38 -08:00
2025-04-16 15:14:01 -07:00
2025-05-01 18:24:09 -07:00
2025-05-01 18:24:09 -07:00
2025-05-01 18:24:09 -07:00
2025-05-01 18:24:09 -07:00
2025-01-29 15:03:38 -08:00
2025-04-16 15:14:01 -07:00
2025-04-24 11:51:19 -07:00
2025-04-24 11:51:19 -07:00
2025-05-01 18:24:09 -07:00
2025-05-01 18:24:09 -07:00
2025-04-16 15:14:01 -07:00
2025-05-01 18:24:09 -07:00
2025-02-26 20:34:44 -08:00
2025-02-26 20:34:44 -08:00
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-04-24 17:26:02 -07:00
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-02-26 20:34:44 -08:00
2025-04-16 15:14:01 -07:00
2025-04-16 15:14:01 -07:00
2025-05-06 11:20:48 -07:00
2025-05-01 18:24:09 -07:00
2025-05-06 11:20:48 -07:00
2025-01-29 15:03:38 -08:00
2025-05-01 18:24:09 -07:00
2025-02-26 20:34:44 -08:00
2025-05-01 18:24:09 -07:00
2025-02-26 20:34:44 -08:00
2025-04-16 15:14:01 -07:00
2025-01-31 10:25:39 -08:00
2025-01-29 15:03:38 -08:00
2025-01-29 15:03:38 -08:00
2025-04-16 15:14:01 -07:00
2025-01-29 15:03:38 -08:00