llama.cpp/tests/test-backend-ops.cpp at 20cc625edc2264aae2779e71bef1593e6a4e8c43

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-09 10:17:06 +00:00

Files

Georgi Gerganov 0a319bb75e metal : add support for non-padded FA KV (#16148 )

* metal : pad K, V and Mask when needed

* cont : simplify

* cuda : add TODO about KV padding requirement

* metal : add comments

* metal : remove mask padding requirement

2025-10-07 08:23:30 +03:00

277 KiB

Raw Blame History

View Raw

277 KiB Raw Blame History

277 KiB

Raw Blame History