llama.cpp/ggml-metal.m at a0edf73bda31c7c4e649e6f07c6fd30a729929cd

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-19 11:57:07 +00:00

Files

Jhen-Jie Hong c67fe68e41 metal : implement q5_0 and q5_1 kernels (#3648 )

* metal : implement dequantize_q5_0

* metal : block_q_n_dot_y for block_q5_0 (broken)

* metal : revert unnecessary change

* metal : implement dequantize_q5_1

* metal : block_q_n_dot_y for q5_1 (broken)

* metal : fix block_q_n_dot_y

* minor : spaces / formatting

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-10-18 15:21:48 +03:00

82 KiB

Raw Blame History

View Raw

82 KiB Raw Blame History

82 KiB

Raw Blame History