llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Georgi Gerganov f5a77a629b Introduce C-style API (#370 )

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

2023-03-22 07:32:36 +02:00

ggml-vocab.bin

Introduce C-style API (#370 )

2023-03-22 07:32:36 +02:00