llama.cpp/src/llama-graph.cpp at 20f8e43e63033a1bf5ba936b468e70aec36f6e53

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-05 09:36:52 +00:00

Files

Francis Couture-Harpin 20f8e43e63 graph : add back hybrid memory graph input

But this time it contains the sub-cache graph inputs.
This *should* make it easier to handle updating the inputs
when caching the graph (eventually).

2025-07-03 17:07:46 -04:00

View Raw