llama.cpp/src/llama-graph.cpp at 2776db6c810cc08b44b68326204a6c6a228ad4ff

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-18 11:46:58 +00:00

Files

Aman Gupta a90eb94ca9 CUDA: fuse rope + set_rows (#16884 )

* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem

2025-11-13 08:50:01 +08:00

68 KiB

Raw Blame History

View Raw

68 KiB Raw Blame History

68 KiB

Raw Blame History