model: add Ernie 4.5 MoE support (#14658)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-14 11:07:10 +00:00

* Add Ernie4.5 MoE

* Fix Flake errors.

* Properly encode/decode MoE layer step

* Correct tensor mappings (.weight)

* Pass and read n_ff_exp

* n_ff_shexp calculation and further minor changes

* Rope fixes.

* .gitignore fix

* Add unit32 cast for Linux builds

* Apply suggestions from code review

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Further fixes from code review

* Fix trailing whitespace

* Reenable missing experts error

* Code style from code review

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Fix non-MoE regression

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

This commit is contained in:

Piotr Wilkin (ilintar)

2025-07-17 23:15:32 +02:00

committed by

GitHub

parent d6fb3f6b49

commit cb887f1bc1

7 changed files with 373 additions and 26 deletions

									
										1

src/llama-arch.h
									
												View File
												
				@@ -86,6 +86,7 @@ enum llm_arch {

				    LLM_ARCH_DOTS1,

				    LLM_ARCH_ARCEE,

				    LLM_ARCH_ERNIE4_5,

				    LLM_ARCH_ERNIE4_5_MOE,

				    LLM_ARCH_HUNYUAN_MOE,

				    LLM_ARCH_SMOLLM3,

				    LLM_ARCH_LFM2,

model: add Ernie 4.5 MoE support (#14658)

1 src/llama-arch.h Unescape Escape View File

1

src/llama-arch.h

View File