llama.cpp/tests/test-backend-ops.cpp at a31cf36ad946a13b3a646bf0dadf2a481e89f944

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Georgi Gerganov 0a319bb75e metal : add support for non-padded FA KV (#16148 )

* metal : pad K, V and Mask when needed

* cont : simplify

* cuda : add TODO about KV padding requirement

* metal : add comments

* metal : remove mask padding requirement

2025-10-07 08:23:30 +03:00

277 KiB

Raw Blame History

View Raw

277 KiB Raw Blame History

277 KiB

Raw Blame History