llama : sync gguf-llama.cpp with latest llama.cpp (#2608)

* llama : sync gguf-llama.cpp with latest llama.cpp

* minor : indentation + assert

* llama : refactor gguf_buffer and gguf_ctx_buffer

* llama : minor
This commit is contained in:
Georgi Gerganov
2023-08-14 16:28:44 +03:00
committed by GitHub
parent 6f64b6c0f8
commit f00780b2ee
6 changed files with 692 additions and 463 deletions

File diff suppressed because it is too large Load Diff