llama.cpp/examples/eval-callback/eval-callback.cpp at 196f5083efe636ceaf247aa4dca5593c6c2b743f

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-21 12:16:57 +00:00

Files

Georgi Gerganov 196f5083ef common : more accurate sampling timing (#17382 )

* common : more accurate sampling timing

* eval-callback : minor fixes

* cont : add time_meas impl

* cont : fix log msg [no ci]

* cont : fix multiple definitions of time_meas

* llama-cli : exclude chat template init from time measurement

* cont : print percentage of unaccounted time

* cont : do not reset timings

2025-11-20 13:40:10 +02:00

7.1 KiB

Raw Blame History

View Raw

7.1 KiB Raw Blame History

7.1 KiB

Raw Blame History