llama.cpp/examples/batched/batched.cpp at 11bff290458f12f020b588792707f76ec658a27a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-04 09:32:00 +00:00

Files

Georgi Gerganov 8c70a5ff25 batched : add bench tool (#3545 )

* batched : add bench tool

* batched : minor fix table

* batched-bench : add readme + n_kv_max is now configurable

* batched-bench : init warm-up batch

* batched-bench : pass custom set of PP, TG and PL

* batched-bench : add mmq CLI arg

2023-10-11 21:25:33 +03:00

7.5 KiB

Raw Blame History

View Raw

7.5 KiB Raw Blame History

7.5 KiB

Raw Blame History