llama.cpp/convert-pth-to-ggml.py at 6b6dbc8910c6d53f4d96c46c8fcec70e2cd435d8

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

Georgi Gerganov f5a77a629b Introduce C-style API (#370 )

* Major refactoring - introduce C-style API

* Clean up

* Add <cassert>

* Add <iterator>

* Add <algorithm> ....

* Fix timing reporting and accumulation

* Measure eval time only for single-token calls

* Change llama_tokenize return meaning

2023-03-22 07:32:36 +02:00

5.2 KiB

Raw Blame History

View Raw

5.2 KiB Raw Blame History

5.2 KiB

Raw Blame History