llama.cpp/ggml-vulkan.cpp at 895407f31b358e3d9335e847d13f033491ec8a5b

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Sergio López c88c74f967 vulkan: only use M-sized matmul on Apple GPUs (#5412 )

* vulkan: refactor guess_matmul_pipeline for vendor

Refactor ggml_vk_guess_matmul_pipeline to simplify adding per-vendor
conditionals.

Signed-off-by: Sergio Lopez <slp@redhat.com>

* vulkan: only use M-sized matmul on Apple GPUs

L-sized and S-sized matmuls are broken on Apple GPUs, force using
M-size with this vendor.

Signed-off-by: Sergio Lopez <slp@redhat.com>

---------

Signed-off-by: Sergio Lopez <slp@redhat.com>

2024-02-11 15:12:00 +01:00

252 KiB

Raw Blame History

View Raw

252 KiB Raw Blame History

252 KiB

Raw Blame History