llama.cpp/tests/test-backend-ops.cpp at fe86282efe69b9479eb021d7390d49999afdbbdc

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Jeff Bolz 96452a3fa4 vulkan: Reuse conversion results in prealloc_y (#15410 )

* vulkan: Reuse conversion results in prealloc_y

Cache the pipeline and tensor that were most recently used to fill prealloc_y,
and skip the conversion if the current pipeline/tensor match.

* don't use shared pointer for prealloc_y_last_pipeline_used

2025-08-21 16:55:00 +02:00

242 KiB

Raw Blame History

View Raw

242 KiB Raw Blame History

242 KiB

Raw Blame History