llama.cpp/sgemm.cpp at db10f01310beea8a1ef7798651b9d692fd1149d0

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Eve 465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891 )

* basic avx implementation

* style

* combine denibble with load

* reduce 256 to 128 (and back!) conversions

* sse load

* Update sgemm.cpp

* oops

oops

2024-05-08 17:29:23 +03:00

31 KiB

Raw Blame History

View Raw

31 KiB Raw Blame History

31 KiB

Raw Blame History