Files
llama.cpp/examples/parallel
Kerfuffle 5df7d06c42 llama : allow exporting a view of the KV cache (#4180)
* Allow exporting a view of the KV cache

* Allow dumping the sequences per cell in common

* Track max contiguous cells value and position as well

* Fix max contiguous empty cells index calculation

Make dump functions deal with lengths or sequences counts > 10 better

* Fix off by one error in dump_kv_cache_view

* Add doc comments for KV cache view functions

Eliminate cell sequence struct; use llama_seq_id directly

Minor cleanups
2023-11-23 18:31:20 +02:00
..

llama.cpp/example/parallel

Simplified simulation of serving incoming requests in parallel