llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Erik Scholz a81283820a gguf: gguf_writer refactor (#15691 )

* gguf: split gguf writer into base and buf impl
* gguf: templated gguf write out
* gguf: file based writer (avoid writing everything to memory first!)
* examples(llama2c): fix log not being the same level and compiler nits

2025-09-05 11:34:28 +02:00

cmake

ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )

2025-08-07 13:45:41 +02:00

include

ggml: add ops for WAN video model (cuda && cpu) (#15669 )

2025-09-04 10:38:49 +02:00

src

gguf: gguf_writer refactor (#15691 )

2025-09-05 11:34:28 +02:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml-cpu : optimize RVV kernels (#15720 )

2025-09-03 16:16:21 +08:00