llama.cpp/convert_hf_to_gguf.py at 1c07c0c68c692d39b83f491bad9447af852bb652

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-12 10:47:01 +00:00

Files

compilade 1c07c0c68c convert : handle compressed-tensors quant method (#17069 )

* convert : handle compressed-tensors quant method

* convert : handle int-quantized models

* convert : handle naive-quantized models

* gguf-py : __pos__ is also unary

* convert : fix flake8 lint

* convert : use F32 for dequant of pack-quantized tensors

2025-11-09 09:45:50 -05:00

473 KiB

Executable File

Raw Blame History

View Raw

473 KiB Executable File Raw Blame History

473 KiB

Executable File

Raw Blame History