llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Georgi Gerganov 11ac9800af llama : improve infill support and special token detection (#9798 )

* llama : improve infill support

ggml-ci

* llama : add more FIM token strings

ggml-ci

* server : update prompt on slot restore (#9800)

* gguf : deprecate old FIM token KVs

2024-10-12 08:21:51 +03:00

llama.h

llama : improve infill support and special token detection (#9798 )

2024-10-12 08:21:51 +03:00