Files
llama.cpp/src
Aman Gupta a90eb94ca9 CUDA: fuse rope + set_rows (#16884)
* CUDA: add fused rope

* move k forward_expand up

* create helper function instead of re-using params

* make assert statement more in line with comment

* rope_norm: coalesced writes to global mem
2025-11-13 08:50:01 +08:00
..
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-10-31 21:20:47 +01:00
2025-10-31 21:20:47 +01:00