llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-20 12:07:33 +00:00

Files

Aaron Teo 27131e5f34 ggml-cpu: disable fp32->fp16 nnpa conversions for now

there are some conversion failures in nnpa that requires the eyes of an
ibm stsm. will create a separate pr to introduce the fp32->fp16 change.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

2025-06-21 16:58:43 +08:00

cmake

ggml-cpu : rework weak alias on apple targets (#14146 )

2025-06-16 13:54:15 +08:00

include

ggml-cpu: add nnpa compile flag

2025-06-21 14:46:41 +08:00

src

ggml-cpu: disable fp32->fp16 nnpa conversions for now

2025-06-21 16:58:43 +08:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml-cpu: add nnpa compile flag

2025-06-21 14:46:41 +08:00