llama.cpp/ggml-quants.c at 2774b0c97427ee3ad3e2ee121354d078794e89d9

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Kawrakow 7c4263d426 ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (#5760 )

* WIP: make i-quants work for QK_K = 64

* iq2_xs: attempt to fix AVX dot product for QK_K = 64

Tests pass, but I get gibberish.

* QK_K = 64 tests pass on ARM_NEON and Metal

Sadly, that does not mean it actually works.

* Make CUDA compile with QK_K = 64

Tests don't pass, plus we get misaligned access

* Q2_K: fixed bug in imatrix quantization for QK_K = 64

* iq1_s: turn off SIMD implementation for QK_K = 64 (it does not work)

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

2024-02-28 10:37:02 +02:00

532 KiB

Raw Blame History

View Raw

532 KiB Raw Blame History

532 KiB

Raw Blame History