llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Georgi Gerganov 6a2c6145a0 metal : extend mat-mat multiplication support (#16225 )

* metal : support mul_mm with src1->type == GGML_TYPE_F16

* metal : support mul_mm_id with src1->type == GGML_TYPE_F16

[no ci]

* metal : mul_mm support ne00 % 32 != 0

* metal : support mul_mm_id with ne00 % 32 != 0

* cont : remove unnecessary unrolls

* cont : simplify data loading

* metal : optimize mul_mm when output bounds checks are not needed

2025-09-28 09:34:44 +03:00

cmake

ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (#15094 )

2025-08-07 13:45:41 +02:00

include

llama: print memory breakdown on exit (#15860 )

2025-09-24 16:53:48 +02:00

src

metal : extend mat-mat multiplication support (#16225 )

2025-09-28 09:34:44 +03:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

common : use cpp-httplib as a cURL alternative for downloads (#16185 )

2025-09-26 14:12:19 +03:00