mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-11-05 09:36:52 +00:00
* server : add speculative decoding support ggml-ci * server : add helper function slot.can_speculate() ggml-ci
138 KiB
138 KiB