llama.cpp/gguf-py/gguf/quants.py at c6d4cb46559b359d2682cf2a002e7fe01bb7a767

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-15 11:17:31 +00:00

Files

compilade 4134999e01 gguf-py : Numpy dequantization for most types (#8939 )

* gguf-py : Numpy dequantization for most types

* gguf-py : Numpy dequantization for grid-based i-quants

2024-08-11 14:45:41 -04:00

View Raw