llama.cpp/examples/quantize/quantize.cpp at deb7240100da99555b9ab9dc635021e591fceaf5

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Kawrakow 1d0331c12a quantize: options for output and token embedding tensors qtype (#6239 )

* quantize: be able to specify the output tensor type

* quantize: be able to specify the token embedding tensor type

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

2024-03-22 20:47:14 +02:00

14 KiB

Raw Blame History

View Raw

14 KiB Raw Blame History

14 KiB

Raw Blame History