llama.cpp/examples/server/server.cpp at 1f922254f0c984a8fb9fbaa0c390d7ffae49aedb

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-05 09:36:52 +00:00

Files

Georgi Gerganov 9ca2e67762 server : add speculative decoding support (#10455 )

* server : add speculative decoding support

ggml-ci

* server : add helper function slot.can_speculate()

ggml-ci

2024-11-25 16:31:38 +02:00

View Raw