This website requires JavaScript.
Explore
Help
Sign In
CS348Project
/
llama.cpp
Watch
5
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-10-27 08:21:30 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5143fa895e7725c5bd2135daf7d8f793d98fa91c
llama.cpp
/
ggml
History
Johannes Gäßler
5143fa895e
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (
#15802
)
...
* CUDA: fastdiv, launch bounds for mmvq + q8_1 quant
2025-09-05 16:07:02 +02:00
..
cmake
ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (
#15094
)
2025-08-07 13:45:41 +02:00
include
ggml: add ops for WAN video model (cuda && cpu) (
#15669
)
2025-09-04 10:38:49 +02:00
src
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (
#15802
)
2025-09-05 16:07:02 +02:00
.gitignore
vulkan : cmake integration (
#8119
)
2024-07-13 18:12:39 +02:00
CMakeLists.txt
ggml-cpu : optimize RVV kernels (
#15720
)
2025-09-03 16:16:21 +08:00