llama.cpp/ggml-quants.c at d752327c3338d5b9634121d651c0105f2c933f9b

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Kawrakow cbc8343619 Make IQ1_M work for QK_K = 64 (#6327 )

* iq1_m: make it work for QK_K = 64 (WIP)

* iq1_m: make it work for QK_K = 64 (scalar and AVX2)

* iq1_m: QK_K = 64 seems to work on Metal and ARM_NEON

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

2024-03-27 08:44:27 +01:00

513 KiB

Raw Blame History

View Raw

513 KiB Raw Blame History

513 KiB

Raw Blame History