llama.cpp/getrows.cuh at a8bd14d55717754a1f48313a846a2b16fa998ad2 - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-28 08:31:25 +00:00

Files

slaren ae1f211ce2 cuda : refactor into multiple files (#6269 )

2024-03-25 13:50:23 +01:00

6 lines

141 B

Plaintext

Raw Blame History

 #include "common.cuh"
 #define CUDA_GET_ROWS_BLOCK_SIZE 256
 void ggml_cuda_op_get_rows(ggml_backend_cuda_context & ctx, ggml_tensor * dst);