llama : add support for GritLM (#5959)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-27 08:21:30 +00:00

* add gritlm example

* gritlm results match

* tabs to spaces

* comment out debug printing

* rebase to new embed

* gritlm embeddings are back babeee

* add to gitignore

* allow to toggle embedding mode

* Clean-up GritLM sample code.

* Fix types.

* Flush stdout and output ending newline if streaming.

* mostly style fixes; correct KQ_mask comment

* add causal_attn flag to llama_cparams

* gritml : minor

* llama : minor

---------

Co-authored-by: Douglas Hanley <thesecretaryofwar@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

DAN™

2024-03-10 11:56:30 -04:00

committed by

GitHub

parent 2960eae847

commit bcebd7dbf6

7 changed files with 267 additions and 4 deletions

1

.gitignore vendored

View File

@@ -45,6 +45,7 @@ models-mnt
 /embedding
 /gguf
 /gguf-llama-simple
 /gritlm
 /imatrix
 /infill
 /libllama.so

llama : add support for GritLM (#5959)

1 .gitignore vendored Unescape Escape View File

1

.gitignore vendored

View File