llama.cpp/ggml-cuda.cu at 7e312f165c5047d6e16680d1eebc83055e95c313

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

slaren 50cb666b8a Improve cuBLAS performance by using a memory pool (#1094 )

* Improve cuBLAS performance by using a memory pool

* Move cuda specific definitions to ggml-cuda.h/cu

* Add CXX flags to nvcc

* Change memory pool synchronization mechanism to a spin lock
General code cleanup

2023-04-21 21:59:17 +02:00

6.1 KiB

Raw Blame History

View Raw

6.1 KiB Raw Blame History

6.1 KiB

Raw Blame History