mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-10-28 08:31:25 +00:00
* ggml : move AMX to the CPU backend --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
GGUF split Example
CLI to split / merge GGUF files.
Command line options:
--split: split GGUF to multiple GGUF, default operation.--split-max-size: max size per split inMorG, f.ex.500Mor2G.--split-max-tensors: maximum tensors in each split: default(128)--merge: merge multiple GGUF to a single GGUF.