llama.cpp/tools at eeee367de51fb34d46c8103fc0ae827e84d94470 - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-12 10:47:01 +00:00

Files

History

Aidan eeee367de5 server: fix correct time_ms calculation in prompt_progress (#17093 )

* fix: correct time_ms calculation in send_partial_response

The time_ms field was incorrectly calculated. The division was happening
before the subtraction leading to incorrect values.

Before: (ggml_time_us() - slot.t_start_process_prompt / 1000) After:
(ggml_time_us() - slot.t_start_process_prompt) / 1000

* docs : document time_ms field in prompt_progress

2025-11-08 15:12:11 +02:00

..

scripts : add script to bench models (#16894 )

2025-11-02 00:15:31 +02:00

cvector-generator

cmake : Do not install tools on iOS targets (#15903 )

2025-09-16 09:54:44 +07:00

cmake : Do not install tools on iOS targets (#15903 )

2025-09-16 09:54:44 +07:00

ci : use smaller model (#16168 )

2025-09-22 09:11:39 +03:00

Manually link -lbsd to resolve flock symbol on AIX (#16610 )

2025-10-23 19:37:31 +08:00

bench : cache the llama_context state at computed depth (#16944 )

2025-11-07 21:23:11 +02:00

llama-cli: prevent spurious assistant token (#16202 )

2025-09-29 10:03:12 +03:00

hparams : add n_embd_inp() to support extended embed (#16928 )

2025-11-07 19:27:58 +01:00

perplexity : show more kl-divergence data (#16321 )

2025-09-29 09:30:45 +03:00

ci : use smaller model (#16168 )

2025-09-22 09:11:39 +03:00

rpc : report actual free memory (#16616 )

2025-10-17 18:02:52 +03:00

Manually link -lbsd to resolve flock symbol on AIX (#16610 )

2025-10-23 19:37:31 +08:00

server: fix correct time_ms calculation in prompt_progress (#17093 )

2025-11-08 15:12:11 +02:00

cmake : Do not install tools on iOS targets (#15903 )

2025-09-16 09:54:44 +07:00

model : Apertus model implementation (#15852 )

2025-10-02 20:43:22 +03:00

CMakeLists.txt

mtmd : rename llava directory to mtmd (#13311 )

2025-05-05 16:02:55 +02:00