llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

Files

Pascal 1faa13a118 webui: updated the chat service to only include max_tokens in the req… (#16489 )

* webui: updated the chat service to only include max_tokens in the request payload when the setting is explicitly provided, while still mapping explicit zero or null values to the infinite-token sentinel

* chore: update webui build output

2025-10-09 22:54:57 +02:00

index.html.gz

webui: updated the chat service to only include max_tokens in the req… (#16489 )

2025-10-09 22:54:57 +02:00

loading.html

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00