llama.cpp/examples/retrieval/retrieval.cpp at 0c41e03ceba1aafaf469476a56347419d5785376

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-15 11:17:31 +00:00

Files

gtygo 4b9afbbe90 retrieval : fix memory leak in retrieval query handling (#8955 )

* retrieval

* Reuse querybatch to reduce frequent memory allocation

* delete unused white space

2024-08-15 10:40:12 +03:00

View Raw