llama.cpp/tests/test-backend-ops.cpp at 3f750f8d760ab5a61491e6a9409072dfeee4b4d7

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Georgi Gerganov 0a319bb75e metal : add support for non-padded FA KV (#16148 )

* metal : pad K, V and Mask when needed

* cont : simplify

* cuda : add TODO about KV padding requirement

* metal : add comments

* metal : remove mask padding requirement

2025-10-07 08:23:30 +03:00

277 KiB

Raw Blame History

View Raw

277 KiB Raw Blame History

277 KiB

Raw Blame History