llama.cpp/gguf-py/gguf/lazy.py at 6cbbd8e1dfdef8ed70b676a400441f9cf34fc10a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Francis Couture-Harpin 6cbbd8e1df gguf-py : support lazy tensor splitting

Splitting usually involves returning tuples of tensors,
which need to be handled properly to avoid early eager evaluation.

2025-04-07 19:20:54 -04:00

View Raw