llama.cpp/llama.cpp at b05102fe8cfa9893851c6bf6efd15cdc20b6afa2

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-05 09:36:52 +00:00

Files

Ian Bull e1e721094d llama : fix memory leak in llama_batch_free (#5252 )

The llama_batch_init allocates memory for a fixed number of tokens.
However, the llama_batch_free only frees memory for the number of
tokens that were added to the batch.

This change-set uses a null terminated array for the batch seq_id, and
frees all the elements until the nullptr is reached. This change-set
also changes the name of the first parameter from `n_tokens` to
`n_tokens_alloc` to more clearly indicate that this value is the number
of tokens allocated to the batch, not the number of tokens in the batch.

2024-02-02 09:20:13 +02:00

451 KiB

Raw Blame History

View Raw

451 KiB Raw Blame History

451 KiB

Raw Blame History