Files
llama.cpp/ggml
Georgi Gerganov b3964c1e89 metal : optimize FA vec for large sequences and BS <= 8 (#15566)
* metal : optmize FA vec for large heads and sequences

* metal : adjust small-batch mul mv kernels

ggml-ci

* batched-bench : fix total speed computation

ggml-ci

* cont : add comments

ggml-ci
2025-08-26 14:22:14 +03:00
..
2025-08-22 15:33:15 +02:00
2024-07-13 18:12:39 +02:00