llama.cpp/ggml-opencl.cpp at 74a6d922f12ccfe16b0c265f43be8978c6f25e98

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Howard Su 58970a4c39 Leverage mmap for offloading tensors to GPU (#1597 )

* Rebase to latest

* Show progress

* Add assert to make sure we only allocate temp buffer for non-CPU backend tensor

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

2023-06-12 14:44:16 +02:00

43 KiB

Raw Blame History

View Raw

43 KiB Raw Blame History

43 KiB

Raw Blame History