Files
llama.cpp/ggml/src/ggml-vulkan/vulkan-shaders
Jeff Bolz 98197e5c98 vulkan: optimizations for deepseek prompt processing (#14555)
* vulkan: allow unclamped loads in coopmat2 mul_mat_id shader

* vulkan: increase coopmat2 mul_mat_id tile size

* vulkan: optimize mat_mul_id row_ids search to batch loads, and port to coopmat1 path

* vulkan: use smaller FA row size when head size is large. applies to both scalar and CM2 paths (CM1 isn't used due to shared memory limits)
2025-07-12 11:51:58 +02:00
..
2025-05-02 20:54:30 +03:00
2025-07-01 10:14:21 +02:00
2025-02-28 07:52:51 +01:00
2025-07-09 18:16:12 +02:00