llama.cpp/ggml-cuda.cu at 68a6b98b3c8af7e5baade3ee45fe1d2c7b9323a9

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

slaren 3a9cb4ca64 cuda, metal : fix nans in soft_max (#5574 )

* cuda : fix nans in soft_max

* metal : fix nans in soft_max

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2024-02-19 10:04:45 +02:00

View Raw