Files
llama.cpp/src
Francis Couture-Harpin 4b6fb6524b context : round n_tokens to next multiple of n_seqs when reserving
This fixes RWKV inference which fails when ubatch.n_seq_tokens is 0.
2025-06-11 16:19:17 -04:00
..