llama.cpp/ggml-quants.c at bbe7c56c9993af86aa2d84cbe1fd69e1b4300cea

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

Georgi Gerganov 38566680cd ggml : add IQ2 to test-backend-ops + refactoring (#4990 )

* ggml : add IQ2 to test-backend-ops + refactoring

ggml-ci

* cuda : update supports_op for IQ2

ggml-ci

* ci : enable LLAMA_CUBLAS=1 for CUDA nodes

ggml-ci

* cuda : fix out-of-bounds-access in `mul_mat_vec_q`

ggml-ci

* tests : avoid creating RNGs for each Q tensor

ggml-ci

* tests : avoid creating RNGs for each tensor

ggml-ci

2024-01-17 18:54:56 +02:00

360 KiB

Raw Blame History

View Raw

360 KiB Raw Blame History

360 KiB

Raw Blame History