llama.cpp/k_quants.c at ccd81a751bfd6f313d5bea7ea20cd2eee3ee53b0

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

katsu560 be2301bcda k_quants : add AVX support to dot functions with QK_K as 64 (#2339 )

* add AVX to ggml_vec_dot_q2_K_q8_K()

* add AVX to ggml_vec_dot_q3_K_q8_K()

* add AVX to ggml_vec_dot_q4_K_q8_K()

* add AVX to ggml_vec_dot_q5_K_q8_K()

* add AVX to ggml_vec_dot_q6_K_q8_K()

* refactor AVX code in ggml_vec_dot_q6_K_q8_K()

2023-07-25 15:13:41 +03:00

167 KiB

Raw Blame History

View Raw

167 KiB Raw Blame History

167 KiB

Raw Blame History