llama.cpp/ggml-metal.m at d0f77b1353fc820d1ff1e6b87bc6bedde315938d

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-17 11:37:10 +00:00

Files

Georgi Gerganov d67777c202 metal : add Q8_0 support (#2763 )

* metal : add dequantize_q8_0 kernel

* metal : add mul_mat_q8_0_f32 kernel

* metal : add Q8_0 mul_mm kernel

2023-08-24 16:19:57 +03:00

View Raw