llama.cpp/tests/test-backend-ops.cpp at 691698e1524f227f5ecb185536e69dc46fa9a533

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

Diego Devesa a5e47592b6 cuda : optimize argmax (#10441 )

* cuda : optimize argmax

* remove unused parameter

ggml-ci

* fixup : use full warps

ggml-ci

* Apply suggestions from code review

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

* fix ub

* ggml : check ne00 <= INT32_MAX in argmax and argsort

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

2024-11-21 18:18:50 +01:00

143 KiB

Raw Blame History

View Raw

143 KiB Raw Blame History

143 KiB

Raw Blame History