llama.cpp/gguf-py/gguf/quants.py at ad76569f8e78ab6ca921bda25cef25a157361719

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

compilade 4134999e01 gguf-py : Numpy dequantization for most types (#8939 )

* gguf-py : Numpy dequantization for most types

* gguf-py : Numpy dequantization for grid-based i-quants

2024-08-11 14:45:41 -04:00

View Raw