llama.cpp/ggml.c at fee3c1d740c0e027c81e2f2f3fb48d619857175f

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-11 10:36:54 +00:00

Files

Francis Couture-Harpin fee3c1d740 llama : allow doing the equivalent of SSM_CONV with SUM_ROWS and MUL

* ggml : allow GGML_OP_CONCAT to work on non-contiguous tensors

The implementation already supported it,
and this makes Mamba's conv step slightly faster.

2024-06-03 13:54:39 -04:00

738 KiB

Raw Blame History

View Raw

738 KiB Raw Blame History

738 KiB

Raw Blame History