llama.cpp/llama.cpp at 9475cdb7a352c7ac3cf868affa2cb71327e7ac80

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

slaren 5488fb789e ggml : allocate graphs in a context (#2392 )

* ggml : graph allocation in contexts

* allocate work buffer as a ggml_object in ggml_graph_compute_with_ctx

* llama.cpp : allocate graph in the context

* add GGML_PAD

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-07-26 15:56:53 +02:00

143 KiB

Raw Blame History

View Raw

143 KiB Raw Blame History

143 KiB

Raw Blame History