llama.cpp/examples/server/bench/bench.py at ecef206ccb186a1cde8dd2523b1da3e12f593f9e

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Pierrick Hymbert 2f0ee84b9b server: bench: minor fixes (#10765 )

* server/bench:
- support openAI streaming standard output with [DONE]\n\n
- export k6 raw results in csv
- fix too many tcp idle connection in tcp_wait
- add metric time to emit first token

* server/bench:
- fix when prometheus not started
- wait for server to be ready before starting bench

2025-01-02 18:06:12 +01:00

13 KiB

Raw Blame History

View Raw

13 KiB Raw Blame History

13 KiB

Raw Blame History