llama.cpp/examples/imatrix/imatrix.cpp at 05490fad7f7f60ff2bed9ad05cd81b44e82ccde3

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Kawrakow 726c0fa9a2 Slightly faster imatrix (#5050 )

* imatrix: speedup by avoiding unnecessary allocations and copies

* imatrix: add --no-ppl option to skip PPL calculations altogether

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

2024-01-21 08:01:20 +02:00

17 KiB

Raw Blame History

View Raw

17 KiB Raw Blame History

17 KiB

Raw Blame History