This website requires JavaScript.
Explore
Help
Sign In
CS348Project
/
llama.cpp
Watch
5
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-11-16 11:27:03 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5a0e3ef6f00c658fbae53797f02d5a360ebf8fec
llama.cpp
/
ggml
/
src
/
ggml-cuda
/
vendors
History
Johannes Gäßler
5eff6ec9b1
CUDA: MoE helper in device code, better tile sizes (
#15525
)
...
* CUDA: MoE helper in device code, better tile sizes * reduce superfluous CUDA blocks
2025-08-25 17:23:40 +02:00
..
cuda.h
llama : add gpt-oss (
#15091
)
2025-08-05 22:10:36 +03:00
hip.h
CUDA: MoE helper in device code, better tile sizes (
#15525
)
2025-08-25 17:23:40 +02:00
musa.h
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (
#15131
)
2025-08-07 10:53:21 +02:00