llama.cpp/common/chat.cpp at b2426e469e2fdb6c44216d56baa4cfff4f39ae00

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Piotr Wilkin (ilintar) b2426e469e chat : nemotron thinking & toolcalling support (#15676 )

* feat: nemotron thinking & toolcalling support

* Trailing whitespaces

* Corrected template for Nemotron

* Template and parser fixes

* Final template and grammar changes

* Whitespace

* Always do lazy grammar processing since </think> tag will always be there.

* Allow extra content after toolcall

* Whitespace

* New tests: thinking + tools, tools + content, thinking + tools + content (new!)

* Whitespace

* Remove cURL test script

2025-09-05 01:22:22 +02:00

110 KiB

Raw Blame History

View Raw

110 KiB Raw Blame History

110 KiB

Raw Blame History