llama.cpp/ggml-opencl.c at 56551bc11f46b2716fdf61bb48ac28414889dc0a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

0cc4m 76a884920a ggml : add CLBlast q5_0, q5_1, q8_0 dequant kernels (#1225 )

* Implement q5_0, q5_1 and q8_0

* Work around q5_0 OpenCL issue

* Fix q8_0 dequant kernel

* Move cl kernels into ggml-opencl.c

* Use two memcpy calls for q5_0 buffer transfer

2023-04-30 21:34:52 +03:00

13 KiB

Raw Blame History

View Raw

13 KiB Raw Blame History

13 KiB

Raw Blame History