llama.cpp/ggml-opencl.cpp at 0711a5f6dce7f04c2a791b14bc47f7d4cb545408

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-05 09:36:52 +00:00

Files

0cc4m d411968e99 opencl : support k-quants (#1836 )

* Porting q2_k kernel to OpenCL

* Set global and local sizes for kernel calls for dequantizing k-quants

* Added q6_k kernel

* Fix q4_k opencl struct order

* Replace uchar with uint8_t

* Finish dequant kernels

* Added OpenCL DMMV kernels

* Fix q2_k, improve code

* Fix q3_k

* Shorten switch statements

* Improve code formatting

---------

Co-authored-by: Concedo <39025047+LostRuins@users.noreply.github.com>

2023-06-16 21:59:49 +03:00

60 KiB

Raw Blame History

View Raw

60 KiB Raw Blame History

60 KiB

Raw Blame History