llama.cpp/common/common.cpp at 02082f1519565fc7b49de211b28bc5404a69209b

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

fairydreaming 8fcb563613 Load all MoE experts during warmup (#11571 )

* llama : introduce llama_set_warmup() API call that controls warmup mode; use all MoE experts during warmup

* common : use new API to enable warmup mode during model warmup

---------

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

2025-03-14 13:47:05 +01:00

70 KiB

Raw Blame History

View Raw

70 KiB Raw Blame History

70 KiB

Raw Blame History