llama.cpp/examples/server/bench/script.js at 43f2b07193cbcccd266734320ea9b948f5a01926

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Pierrick Hymbert 2f0ee84b9b server: bench: minor fixes (#10765 )

* server/bench:
- support openAI streaming standard output with [DONE]\n\n
- export k6 raw results in csv
- fix too many tcp idle connection in tcp_wait
- add metric time to emit first token

* server/bench:
- fix when prometheus not started
- wait for server to be ready before starting bench

2025-01-02 18:06:12 +01:00

6.2 KiB

Raw Blame History

View Raw

6.2 KiB Raw Blame History

6.2 KiB

Raw Blame History