llama.cpp/ggml-cuda.cu at b18c66ca6eee4fe0465cff5042daf05005dc9ab2

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

slaren 8a052c131e ggml-cuda : support stablelm rope (#4156 )

* ggml-cuda : support stablelm rope

* remove unused freq_base kernel parameter

* add n_dims parameter to llm_build_k_shift, default to n_rot via overload

* llama : fix llm_build_k_shift args

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-11-24 18:04:31 +01:00

311 KiB

Raw Blame History

View Raw

311 KiB Raw Blame History

311 KiB

Raw Blame History