[SYCL] Fix the sub group size of Intel (#8106)

* use warp_size macro for all sycl kernels * fix mask of permute_sub_group_by_xor * fix rms_norm with correct warp number * fix rms_norm_f32/group_norm_f32 * move norm to norm.cpp file * fix quantize bug * fix mmvq's batch size
2025-11-10 10:27:03 +00:00 · 2024-07-02 02:16:00 +00:00
parent 5fac350b9c
commit d08c20edde
9 changed files with 587 additions and 509 deletions
--- a/ggml/src/ggml-sycl/backend.hpp
+++ b/ggml/src/ggml-sycl/backend.hpp
@@ -20,5 +20,6 @@
 #include "mmq.hpp"
 #include "mmvq.hpp"
 #include "rope.hpp"
+#include "norm.hpp"

 #endif // GGML_SYCL_BACKEND_HPP