llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-28 08:31:25 +00:00

Files

Aleksei Nikiforov 64387f6e95 gguf-py: byteswapping improvements (#12851 )

* gguf-py: implement byteswapping for Q4_0

This is needed to byteswap Mistral model.

Also restore original shapes after byteswapping tensors.
It is not needed at the moment, but do it in case
they'd be used in future.

* Rework byteswapping code in gguf-py

Move out details from byteswapping tensor blocks code

2025-08-28 16:56:41 +08:00

gguf_convert_endian.py

gguf-py: byteswapping improvements (#12851 )

2025-08-28 16:56:41 +08:00

gguf_dump.py

gguf-py : dump bpw per layer and model in markdown mode (#14703 )

2025-07-16 00:04:42 +02:00

gguf_editor_gui.py

gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561 )

2025-05-29 15:36:05 +02:00

gguf_hash.py

gguf-py: fixed local detection of gguf package (#11180 )