llama.cpp/ggml-cuda.h at c6524f46eb93fdb949330293a8469fd70080bd5a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-31 08:51:55 +00:00

Files

slaren 50cb666b8a Improve cuBLAS performance by using a memory pool (#1094 )

* Improve cuBLAS performance by using a memory pool

* Move cuda specific definitions to ggml-cuda.h/cu

* Add CXX flags to nvcc

* Change memory pool synchronization mechanism to a spin lock
General code cleanup

2023-04-21 21:59:17 +02:00

2.0 KiB

Raw Blame History

View Raw

2.0 KiB Raw Blame History

2.0 KiB

Raw Blame History