llama.cpp/ggml-metal.m at 0248ca811e076ac0017e4cb35651ca6b57c3bfd5

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-15 11:17:31 +00:00

Files

Georgi Gerganov d67777c202 metal : add Q8_0 support (#2763 )

* metal : add dequantize_q8_0 kernel

* metal : add mul_mat_q8_0_f32 kernel

* metal : add Q8_0 mul_mm kernel

2023-08-24 16:19:57 +03:00

View Raw