Merge branch 'master' into compilade/bitnet-ternary

2025-10-30 08:42:00 +00:00 · 2024-08-22 16:42:24 -04:00
parent 35cc5567c8 11b84eb457
commit cb6d9962c4
77 changed files with 4681 additions and 2212 deletions
--- a/examples/quantize/README.md
+++ b/examples/quantize/README.md
@@ -34,7 +34,7 @@ Run the quantized model:

 ```bash
 # start inference on a gguf model
-./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -n 128
+./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -cnv -p "You are a helpful assistant"
 ```

 When running the larger models, make sure you have enough disk space to store all the intermediate files.