Files
llama.cpp/ggml/src/ggml-sycl/dequantize.hpp
AidanBeltonS fadde67135 Dequant improvements rebase (#8255)
* Single load for half2

* Store scales in local mem

* Vec load quantized values
2024-07-03 09:55:34 +08:00

23 KiB