llama.cpp/ggml/src/ggml-vulkan/ggml-vulkan.cpp at 16bc059660c1c59e566628201c0ca2c20c9f4bc3

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

Jeff Bolz ba1ceb3456 vulkan: fix noncontig check for mat_mul_id splitting (#14683 )

* vulkan: fix noncontig check for mat_mul_id splitting

Remove supports_op check for > 4096 (splitting fixes this)

* vulkan: fix batched matmul dequant for Q*_K

2025-07-15 21:51:09 +02:00

570 KiB

Raw Blame History

View Raw

570 KiB Raw Blame History

570 KiB

Raw Blame History