llama.cpp/ggml.c at 4d698495eae6912db94dcdedb0c3b01c63143646

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

slaren 5488fb789e ggml : allocate graphs in a context (#2392 )

* ggml : graph allocation in contexts

* allocate work buffer as a ggml_object in ggml_graph_compute_with_ctx

* llama.cpp : allocate graph in the context

* add GGML_PAD

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-07-26 15:56:53 +02:00

591 KiB

Raw Blame History

View Raw

591 KiB Raw Blame History

591 KiB

Raw Blame History