llama.cpp/convert-hf-to-gguf.py at 961e2938333ce6e1fa723a7be09e984093950864

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-08 10:07:01 +00:00

Files

Francis Couture-Harpin 961e293833 convert-hf : simplify BitNet pre-quantization

This still results in the exact same tensor weights and scales,
but it reveals some weirdness in the current algorithm.

2024-06-27 02:06:28 -04:00

View Raw