llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

slaren 7a11eb3a26 cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (#8800 )

* cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X

* update asserts

* only use dmmv for supported types

* add test

2024-08-01 15:26:22 +02:00

2024-06-26 18:33:02 +03:00

2024-07-28 01:41:25 +02:00

2024-08-01 15:26:22 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cann: update cmake (#8765 )

2024-07-30 12:37:35 +02:00