Files
llama.cpp/llama.cpp
Georgi Gerganov d7b800b8bc llama : pad KV cache size (#4280)
* llama : pad KV cache size to 32

* metal : try to improve batched decoding
2023-12-03 10:58:16 +02:00

372 KiB