llama.cpp/common/common.cpp at eba92d64c3f6d86de2e6b4dd3a540d2805a22b0c

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

fairydreaming 8fcb563613 Load all MoE experts during warmup (#11571 )

* llama : introduce llama_set_warmup() API call that controls warmup mode; use all MoE experts during warmup

* common : use new API to enable warmup mode during model warmup

---------

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

2025-03-14 13:47:05 +01:00

70 KiB

Raw Blame History

View Raw

70 KiB Raw Blame History

70 KiB

Raw Blame History