llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-15 11:17:31 +00:00

Files

Aleksei Nikiforov 7adc79c032 gguf-py : add support for endian conversion of BF16 data (#16594 )

BF16 requires special handling in this script
while it's a 2-bytes data, but view is 1-byte by default.
Switch to correct view before attempting byteswapping.

With this change correctly byteswapping models like
Meta-Llama-3-8B-Instruct-bf16-GGUF
should be possible.

2025-10-15 22:43:08 +02:00

gguf_convert_endian.py

gguf-py : add support for endian conversion of BF16 data (#16594 )

2025-10-15 22:43:08 +02:00

gguf_dump.py

gguf-py : dump bpw per layer and model in markdown mode (#14703 )

2025-07-16 00:04:42 +02:00

gguf_editor_gui.py

gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561 )

2025-05-29 15:36:05 +02:00

gguf_hash.py

gguf-py: fixed local detection of gguf package (#11180 )