Files
llama.cpp/tools
Georgi Gerganov 13b339bcd9 server : do not default to multiple slots with speculative decoding (#17017)
* server : do not default to multiple slots with speculative decoding

* cont : fix
2025-11-05 14:32:55 +02:00
..
2025-09-22 09:11:39 +03:00