Files
llama.cpp/ggml.c
Francis Couture-Harpin fee3c1d740 llama : allow doing the equivalent of SSM_CONV with SUM_ROWS and MUL
* ggml : allow GGML_OP_CONCAT to work on non-contiguous tensors

The implementation already supported it,
and this makes Mamba's conv step slightly faster.
2024-06-03 13:54:39 -04:00

738 KiB