Files
llama.cpp/ggml/src/ggml-cuda/quantize.cu
Johannes Gäßler 5143fa895e CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802)
* CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
2025-09-05 16:07:02 +02:00

6.5 KiB