llama.cpp/ggml-opencl.h at ccd81a751bfd6f313d5bea7ea20cd2eee3ee53b0

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Howard Su 58970a4c39 Leverage mmap for offloading tensors to GPU (#1597 )

* Rebase to latest

* Show progress

* Add assert to make sure we only allocate temp buffer for non-CPU backend tensor

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

2023-06-12 14:44:16 +02:00

845 B

Raw Blame History

View Raw

845 B Raw Blame History

845 B

Raw Blame History