server : add documentation for parallel_tool_calls param (#15647)

Co-authored-by: Pierre F <no@p.e>
2025-10-27 08:21:30 +00:00 · 2025-08-29 19:25:40 +02:00
parent 81017865ee
commit 792b44f2ed
2 changed files with 4 additions and 0 deletions
--- a/docs/function-calling.md
+++ b/docs/function-calling.md
@@ -21,6 +21,8 @@ Function calling is supported for all models (see https://github.com/ggml-org/ll
  - Use `--chat-template-file` to override the template when appropriate (see examples below)
  - Generic support may consume more tokens and be less efficient than a model's native format.

+- Multiple/parallel tool calling is supported on some models but disabled by default, enable it by passing `"parallel_tool_calls": true` in the completion endpoint payload.
+
 <details>
 <summary>Show some common templates and which format handler they use</summary>