mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-11-20 12:07:33 +00:00
Splitting usually involves returning tuples of tensors, which need to be handled properly to avoid early eager evaluation.
Splitting usually involves returning tuples of tensors, which need to be handled properly to avoid early eager evaluation.