llama.cpp/ggml/src/ggml-webgpu/ggml-webgpu.cpp at 647b960bd8017ee882d6633bc2e43e2ae82ee85c

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-09 10:17:06 +00:00

Files

Reese Levine 647b960bd8 ggml webgpu: faster matrix multiplication/matrix-vector multiplication (#17031 )

* Faster tensors (#8)

Add fast matrix and matrix/vector multiplication.

* Use map for shader replacements instead of pair of strings

2025-11-07 19:27:20 -08:00

View Raw