llama.cpp/ggml-opencl.c at 3924088512d9e12e90ed6dbf28a6c5712481d33e

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

0cc4m 76a884920a ggml : add CLBlast q5_0, q5_1, q8_0 dequant kernels (#1225 )

* Implement q5_0, q5_1 and q8_0

* Work around q5_0 OpenCL issue

* Fix q8_0 dequant kernel

* Move cl kernels into ggml-opencl.c

* Use two memcpy calls for q5_0 buffer transfer

2023-04-30 21:34:52 +03:00

13 KiB

Raw Blame History

View Raw

13 KiB Raw Blame History

13 KiB

Raw Blame History