llama.cpp/ggml at bfd2f21fb43525a8757a8c9e44032fd14bac222b - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

History

Francis Couture-Harpin bfd2f21fb4 bitnet : replace 1.58b with b1.58, as in the paper

2024-06-28 20:38:12 -04:00

..

llama : reorganize source code + improve CMake (#8006 )

2024-06-26 18:33:02 +03:00

ggml-quants : 1.625 bpw ternary packing for BitNet 1.58b

2024-06-27 02:06:22 -04:00

bitnet : replace 1.58b with b1.58, as in the paper

2024-06-28 20:38:12 -04:00

CMakeLists.txt

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (#8140 )

2024-06-26 21:34:14 +02:00

ggml_vk_generate_shaders.py

llama : reorganize source code + improve CMake (#8006 )

2024-06-26 18:33:02 +03:00