ggml : quantization refactoring (#3833)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-07 09:57:00 +00:00

* ggml : factor all quantization code in ggml-quants

ggml-ci

* ggml-quants : fix Zig and Swift builds + quantize tool

ggml-ci

* quantize : --pure option for disabling k-quant mixtures

---------

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

This commit is contained in:

Georgi Gerganov

2023-10-29 18:32:28 +02:00

committed by

GitHub

parent ff3bad83e2

commit d69d777c02

11 changed files with 2372 additions and 2385 deletions

5052

k_quants.c

View File

File diff suppressed because it is too large Load Diff

ggml : quantization refactoring (#3833)

5052 k_quants.c View File

5052

k_quants.c

View File