llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-16 11:27:03 +00:00

Files

Johannes Gäßler 5eff6ec9b1 CUDA: MoE helper in device code, better tile sizes (#15525 )

* CUDA: MoE helper in device code, better tile sizes

* reduce superfluous CUDA blocks

2025-08-25 17:23:40 +02:00

cuda.h

2025-08-05 22:10:36 +03:00

hip.h

2025-08-25 17:23:40 +02:00

musa.h

2025-08-07 10:53:21 +02:00