llama.cpp/tests/test-backend-ops.cpp at 1ae74882f8f6755e44dff8f23f3abdc5b53ab7c1

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-07 09:57:00 +00:00

Files

Aman Gupta 4146d6a1a6 CUDA: add expert reduce kernel (#16857 )

* CUDA: add expert reduce kernel

* contigous checks, better formatting, use std::vector instead of array

* use vector empty instead of size

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

2025-10-31 20:05:07 +08:00

297 KiB

Raw Blame History

View Raw

297 KiB Raw Blame History

297 KiB

Raw Blame History