Files
llama.cpp/convert-hf-to-gguf.py
Francis Couture-Harpin 961e293833 convert-hf : simplify BitNet pre-quantization
This still results in the exact same tensor weights and scales,
but it reveals some weirdness in the current algorithm.
2024-06-27 02:06:28 -04:00

137 KiB
Executable File