llama.cpp/examples/server/utils.hpp at 3ca23481dd309bd51cc31c73a4cc34f922cc372f

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

Xuan Son Nguyen 99b71c068f Server: Use multi-task for embeddings endpoint (#6001 )

* use multitask for embd endpoint

* specify types

* remove redundant {"n_predict", 0}

2024-03-13 11:39:11 +01:00

View Raw