llama.cpp/ggml-cuda.h at e4cf982e0d4fcfbb4b977a52dbeacd115da10c3b

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

slaren 50cb666b8a Improve cuBLAS performance by using a memory pool (#1094 )

* Improve cuBLAS performance by using a memory pool

* Move cuda specific definitions to ggml-cuda.h/cu

* Add CXX flags to nvcc

* Change memory pool synchronization mechanism to a spin lock
General code cleanup

2023-04-21 21:59:17 +02:00

2.0 KiB

Raw Blame History

View Raw

2.0 KiB Raw Blame History

2.0 KiB

Raw Blame History