feat: Add s_off as a parameter in the args struct

This may not be necessary, but it more closely mirrors the CUDA kernel Branch: GraniteFourPerf Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
2025-11-06 09:46:50 +00:00 · 2025-07-15 17:03:38 -06:00
parent 01612b7409
commit ba74a24730
1 changed files with 1 additions and 0 deletions
--- a/ggml/src/ggml-metal/ggml-metal-impl.h
+++ b/ggml/src/ggml-metal/ggml-metal-impl.h
@@ -519,6 +519,7 @@ typedef struct {
    int64_t  n_group;
    int64_t  n_seq_tokens;
    int64_t  n_seqs;
    int64_t  s_off;
    uint64_t nb01;
    uint64_t nb02;
    uint64_t nb03;