llama.cpp/ggml-cuda.cu at 0136966dafb452601c23f30395878d5a65ddc559

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-16 11:27:03 +00:00

Files

agray3 197c00681b Allow multiple copy function pointers for CUDA graph kernel param updates (#7565 )

CUDA graphs require parameter updates to kernels associated with
GGML_OP_CPY nodes. Previously the implementation only checked for a
single CUDA kernel in such nodes, but this caused a bug in cases where
2 such kernels exist. This fixes the issue by using a vector to allow
multiple function pointers to be stored and checked against.

Fixes #7942

2024-05-27 19:33:42 +02:00

119 KiB

Raw Blame History

View Raw

119 KiB Raw Blame History

119 KiB

Raw Blame History