llama.cpp/convert_hf_to_gguf.py at 4b9f4cb0f89a88de4bdf97727d0457b0c648804c

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-07 09:57:00 +00:00

Files

Gabe Goodhart 1d0125bcf1 feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (#16177 )

This is a configuration of the hparams in the GraniteHybrid architecture
that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x).
It may be used for some models in the Granite 4 family with the
GraniteHybrid architecture acting as a superset arch. Rather than support
it directly in the c++ graph, we simply coerce the architecture flag back
to the correct "granite" or "granitemoe" architecture.

Branch: gabe-l-hart/GraniteNonHybridConversion

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

2025-09-22 20:40:10 +02:00

419 KiB

Executable File

Raw Blame History

View Raw

419 KiB Executable File Raw Blame History

419 KiB

Executable File

Raw Blame History