llama.cpp/gguf-py/gguf/tensor_mapping.py at a17c4f7d75f3dfdcf2de04ef5e85c8764313d3d4

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Xuan-Son Nguyen 8f22dc0a53 model : add hunyuan moe (#14425 )

* model : add hunyuan moe

* tokenizer ok

* fix tensor name

* cgraph init

* chat template

* wip

* almost working

* skip embed, fix bos

* cleanup

* yarn scaling

* cleanup

* correct rope type

* failed token fix

* ntk alpha freq_base

* tokenization working

* cleanup and pr changes

* vocab_size sanity check

* ntk alpha generic

* Update convert_hf_to_gguf.py

* Apply suggestions from code review

* fix regression

* fix style

---------

Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>

2025-07-08 11:24:06 +03:00

57 KiB

Raw Blame History

View Raw

57 KiB Raw Blame History

57 KiB

Raw Blame History