llama.cpp/ggml/src/ggml-backend.cpp at b907255f4bd169b0dc7dca9553b4c54af5170865

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-03 09:22:01 +00:00

Files

Jeff Bolz e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850 )

* vulkan: sort graph to allow more parallel execution

Add a backend proc to allow the backend to modify the graph. The
vulkan implementation looks at which nodes depend on each other
and greedily reorders them to group together nodes that don't
depend on each other. It only reorders the nodes, doesn't change
the contents of any of them.

With #15489, this reduces the number of synchronizations needed.

* call optimize_graph per-split

2025-09-09 02:10:07 +08:00

84 KiB

Raw Blame History

View Raw

84 KiB Raw Blame History

84 KiB

Raw Blame History