llama.cpp/ggml-cuda.h at c9e2c26f413377b352845f442cdab976ce85a05d

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-04 09:32:00 +00:00

Files

slaren 50cb666b8a Improve cuBLAS performance by using a memory pool (#1094 )

* Improve cuBLAS performance by using a memory pool

* Move cuda specific definitions to ggml-cuda.h/cu

* Add CXX flags to nvcc

* Change memory pool synchronization mechanism to a spin lock
General code cleanup

2023-04-21 21:59:17 +02:00

2.0 KiB

Raw Blame History

View Raw

2.0 KiB Raw Blame History

2.0 KiB

Raw Blame History