mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-11-04 09:32:00 +00:00
* server: enrich health endpoint with available slots, return 503 if not slots are available * server: document new status no slot available in the README.md
122 KiB
122 KiB