llama.cpp/ggml/src/ggml-sycl/dequantize.hpp at c887d8b01726b11ea03dbcaa9d44fa74422d0076

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-16 11:27:03 +00:00

Files

AidanBeltonS fadde67135 Dequant improvements rebase (#8255 )

* Single load for half2

* Store scales in local mem

* Vec load quantized values

2024-07-03 09:55:34 +08:00

View Raw