ollama37

matt/ollama37

Fork 0

mirror of https://github.com/dogkeeper886/ollama37.git synced 2025-12-10 15:57:04 +00:00

Commit Graph

Author	SHA1	Message	Date
Daniel Hiltgen	424810450f	Move quantization to new backend (#10363 ) * Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly.	2025-05-06 11:20:48 -07:00
Michael Yang	a7835c6716	fix: write gguf padding (#10510 ) * add gguf_test * fix padding padding was being added to offset but not to the running count	2025-04-30 17:59:31 -07:00

Author

SHA1

Message

Date

Daniel Hiltgen

424810450f

Move quantization to new backend (#10363 )

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

2025-05-06 11:20:48 -07:00

Michael Yang

a7835c6716

fix: write gguf padding (#10510 )

* add gguf_test

* fix padding

padding was being added to offset but not to the running count

2025-04-30 17:59:31 -07:00

2 Commits