mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-10-28 08:31:25 +00:00
readme : add notice about new file format
ggml-ci
This commit is contained in:
12
README.md
12
README.md
@@ -9,11 +9,17 @@
|
|||||||
|
|
||||||
Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
|
Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
|
||||||
|
|
||||||
### 🚧 Incoming breaking change + refactoring:
|
### Hot topics
|
||||||
|
|
||||||
See PR https://github.com/ggerganov/llama.cpp/pull/2398 for more info.
|
A new file format has been introduced: [GGUF](https://github.com/ggerganov/llama.cpp/pull/2398)
|
||||||
|
|
||||||
To devs: avoid making big changes to `llama.h` / `llama.cpp` until merged
|
Last revision compatible with the old format: [dadbed9](https://github.com/ggerganov/llama.cpp/commit/dadbed99e65252d79f81101a392d0d6497b86caa)
|
||||||
|
|
||||||
|
### Current `master` should be considered in Beta - expect some issues for a few days!
|
||||||
|
|
||||||
|
### Be prepared to re-convert and / or re-quantize your GGUF models while this notice is up!
|
||||||
|
|
||||||
|
### Issues with non-GGUF models will be considered with low priority!
|
||||||
|
|
||||||
----
|
----
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user