llama.cpp/examples/server/bench/prometheus.yml at ee02ad02c56ff36a5edd22d8617ab3f9546ce7fe - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Pierrick Hymbert a016026a3a server: continuous performance monitoring and PR comment (#6283 )

* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

2024-03-27 20:26:49 +01:00

10 lines

183 B

YAML

Raw Blame History

 global:
   scrape_interval:     10s
   external_labels:
     llamacpp: 'server'
 scrape_configs:
   - job_name: 'llama.cpp server'
     static_configs:
       - targets: ['localhost:8080']