llama.cpp/examples/eval-callback/eval-callback.cpp at a8bca68f727844e7dcf24a956003b3c2039ea563

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Gabe Goodhart a8bca68f72 fix: Compute the full sum in llama-eval-callback, not just the sum of printed values (#15637 )

This makes it much easier to compare between llama.cpp and transformers!

https://github.com/ggml-org/llama.cpp/issues/nemotron-nano-15409
Branch: gabe-l-hart/nvidia-nemotron-nano-15409

Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

2025-08-28 15:27:36 -05:00

6.8 KiB

Raw Blame History

View Raw

6.8 KiB Raw Blame History

6.8 KiB

Raw Blame History