vulkan : implement Stable Diffusion operators (ggml/904)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

* Fix Vulkan repeat op

* Implement Vulkan concat op

* Delete old Vulkan shader generator

* Implement Vulkan im2col op

* Implement Vulkan unary gelu_quick op

* Implement Vulkan group_norm op

* Implement Vulkan timestep_embedding op

* Implement Vulkan upscale op

* Fix Vulkan vk_context tensor extra index issue

* Fix Vulkan matmul shader parameter bug

* Properly fix Vulkan matmul shader parameter bug

* Add Vulkan ADD f16 + f32 -> f16 operator support

* Implement Vulkan tanh op

* Fix Vulkan group count too large Validation error on non-Nvidia GPUs

* Throw error when too much memory is requested

* Fix another Vulkan group count too large Validation error on non-Nvidia GPUs

* Fix matmul MMQ condition

* Implement Vulkan pad op

* Fix Vulkan crash when tensor is used multiple times in a compute graph

* Add Vulkan CONCAT f16 + f16 -> f16 op

* Add Vulkan LEAKY_RELU op

This commit is contained in:

0cc4m

2024-08-04 17:28:08 +02:00

committed by

Georgi Gerganov

parent 655858ace0

commit a3738b2fa7

28 changed files with 1032 additions and 293 deletions

840

ggml/src/ggml-vulkan.cpp

View File

File diff suppressed because it is too large Load Diff

vulkan : implement Stable Diffusion operators (ggml/904)

840 ggml/src/ggml-vulkan.cpp View File

840

ggml/src/ggml-vulkan.cpp

View File