batched : add bench tool (#3545)

* batched : add bench tool * batched : minor fix table * batched-bench : add readme + n_kv_max is now configurable * batched-bench : init warm-up batch * batched-bench : pass custom set of PP, TG and PL * batched-bench : add mmq CLI arg
2025-10-27 08:21:30 +00:00 · 2023-10-11 21:25:33 +03:00
parent 24ba3d829e
commit 8c70a5ff25
7 changed files with 321 additions and 3 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -55,6 +55,7 @@ models-mnt
 /server
 /simple
 /batched
+/batched-bench
 /export-lora
 /finetune
 /speculative