Introduce C-style API (#370)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

This commit is contained in:

Georgi Gerganov

2023-03-22 07:32:36 +02:00

committed by

GitHub

parent da0e9fe90c

commit f5a77a629b

14 changed files with 1954 additions and 1752 deletions

1565

llama.cpp Normal file

View File

File diff suppressed because it is too large Load Diff

Introduce C-style API (#370)

1565 llama.cpp Normal file View File

1565

llama.cpp Normal file

View File