Files
llama.cpp/ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp
Jeff Bolz 94e82c7ead vulkan: clamp matmul and FA results to the max finite value (#15652)
* vulkan: clamp matmul and FA results to the max finite value

* only clamp for fp16
2025-08-31 08:27:57 +02:00

12 KiB