llama.cpp/common/arg.cpp at 3cc1f1f1d24472a6558c942b1c78989ff4b0e569 - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-10 10:27:03 +00:00

Files

David Huang 7f323a589f Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B (#13386 )

2025-05-11 14:18:39 +02:00

140 KiB

Raw Blame History

View Raw