mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-11-12 10:47:01 +00:00
[SYCL] refactor (#6408)
* seperate lower precision GEMM from the main files * fix workgroup size hardcode
This commit is contained in:
3031
ggml-sycl/mmq.cpp
Normal file
3031
ggml-sycl/mmq.cpp
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user