llama.cpp/examples/server/server.cpp at 24a447e20af425fa44cf10feaa632b6bb596c80f

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

Justine Tunney db49ff8ed7 server : replace sleep with condition variables (#4673 )

The server currently schedules tasks using a sleep(5ms) busy loop. This
adds unnecessary latency since most sleep implementations do a round up
to the system scheduling quantum (usually 10ms). Other libc sleep impls
spin for smaller time intervals which results in the server's busy loop
consuming all available cpu. Having the explicit notify() / wait() code
also helps aid in the readability of the server code.

See mozilla-Ocho/llamafile@711344b

2023-12-29 16:24:12 +02:00

115 KiB

Raw Blame History

View Raw

115 KiB Raw Blame History

115 KiB

Raw Blame History