llama.cpp/tools/llama-bench/llama-bench.cpp at b8a5cfd11abc066f165e924136d0fa1db44d2b95

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-10 10:27:03 +00:00

Files

Georgi Gerganov 7956bb4d7f bench : cache the llama_context state at computed depth (#16944 )

* bench : cache llama_context state at depth

* cont : handle failures to restore the old state

* cont : print information when the state is being reused

2025-11-07 21:23:11 +02:00

86 KiB

Raw Blame History

View Raw

86 KiB Raw Blame History

86 KiB

Raw Blame History