llama.cpp/examples/server/server.cpp at 7a3895641c80f52c56d1cab3088f9dbabcec2fe9

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Tobias Lütke 7a3895641c allow server to multithread

because web browsers send a lot of garbage requests we want the server
to multithread when serving 404s for favicon's etc. To avoid blowing up
llama we just take a mutex when it's invoked.

2023-07-04 09:14:49 -04:00

42 KiB

Raw Blame History

View Raw

42 KiB Raw Blame History

42 KiB

Raw Blame History