Files
llama.cpp/common
Georgi Gerganov 13b339bcd9 server : do not default to multiple slots with speculative decoding (#17017)
* server : do not default to multiple slots with speculative decoding

* cont : fix
2025-11-05 14:32:55 +02:00
..
2025-10-27 23:54:01 +01:00
2025-05-30 16:25:45 +03:00