llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-15 11:17:31 +00:00

Files

l3utterfly 13002a0896 ggml-hexagon: respect input size when getting/setting tensor data (#16836 )

* respect input size when getting/setting tensor data

allows partial repacking/copying when get tensor size is smaller than the actual tensor

* Removed duplicate repack_mxfp4_mxfp4x4x2 function

2025-10-30 21:46:31 -07:00

htp

Hexagon Op queue & dispatch optimizations (#16820 )

2025-10-29 06:29:12 -07:00

CMakeLists.txt

Add experimental ggml-hexagon backend for the Hexagon NPU (#16547 )

2025-10-22 13:47:09 -07:00

ggml-hexagon.cpp

ggml-hexagon: respect input size when getting/setting tensor data (#16836 )

2025-10-30 21:46:31 -07:00

htp-utils.c

Add experimental ggml-hexagon backend for the Hexagon NPU (#16547 )

2025-10-22 13:47:09 -07:00

htp-utils.h

Add experimental ggml-hexagon backend for the Hexagon NPU (#16547 )

2025-10-22 13:47:09 -07:00