llama : sync gguf-llama.cpp with latest llama.cpp (#2608)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

* llama : sync gguf-llama.cpp with latest llama.cpp

* minor : indentation + assert

* llama : refactor gguf_buffer and gguf_ctx_buffer

* llama : minor

This commit is contained in:

Georgi Gerganov

2023-08-14 16:28:44 +03:00

committed by

GitHub

parent 6f64b6c0f8

commit f00780b2ee

6 changed files with 692 additions and 463 deletions

989

gguf-llama.cpp

View File

File diff suppressed because it is too large Load Diff

llama : sync gguf-llama.cpp with latest llama.cpp (#2608)

989 gguf-llama.cpp View File

989

gguf-llama.cpp

View File