llama.cpp/ggml/src/ggml-cuda/ggml-cuda.cu at cae9fb4361138b937464524eed907328731b81f6

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-22 12:27:26 +00:00

Files

Nikita Sarychev cae9fb4361 HIP: Only call rocblas_initialize on rocblas versions with the multiple instantation bug (#11080 )

This disables the workaround on rocblas fixed versions (>=4.0.0) to eliminate the runtime cost and unnecessary VRAM allocation of loading all tensile objects.

2025-01-28 16:42:20 +01:00

134 KiB

Raw Blame History

View Raw

134 KiB Raw Blame History

134 KiB

Raw Blame History