llama.cpp/llama.cpp at 9ca79d5cbbc8d43f2bff951404b6a40ff1ee3788

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

Kerfuffle 9ca79d5cbb kv cache slot search improvements (#3493 )

* kv cache slot search improvements

* Use n_ctx in kv find slot for consistency

* Ensure kv cache head points to a valid slot in llama_decode internal

* Add some comments to prevent dumb people (like me) from getting confused.

2023-10-06 10:10:13 -06:00

301 KiB

Raw Blame History

View Raw

301 KiB Raw Blame History

301 KiB

Raw Blame History