batch : auto-gen positions + verify multi-sequence input (#14177)

* batch : verify multi-sequence input batches ggml-ci * cont : auto-gen positions + verify multi-seq input ggml-ci * cont : first print debug info, then perform validation ggml-ci * cont : fix position auto-gen + add comments ggml-ci
2025-10-30 08:42:00 +00:00 · 2025-06-15 09:18:37 +03:00
parent 00ba772610
commit b9912ac570
5 changed files with 155 additions and 26 deletions
--- a/src/llama-cparams.h
+++ b/src/llama-cparams.h
@@ -4,6 +4,7 @@

 #include <cstdint>

+// TODO: rename to something shorter
 #define LLAMA_MAX_PARALLEL_SEQUENCES 64

 struct llama_cparams {