mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-28 08:31:25 +00:00 
			
		
		
		
	 1442677f92
			
		
	
	1442677f92
	
	
	
		
			
			* common : gpt_params_parse do not print usage * common : rework usage print (wip) * common : valign * common : rework print_usage * infill : remove cfg support * common : reorder args * server : deduplicate parameters ggml-ci * common : add missing header ggml-ci * common : remote --random-prompt usages ggml-ci * examples : migrate to gpt_params ggml-ci * batched-bench : migrate to gpt_params * retrieval : migrate to gpt_params * common : change defaults for escape and n_ctx * common : remove chatml and instruct params ggml-ci * common : passkey use gpt_params
GGUF split Example
CLI to split / merge GGUF files.
Command line options:
- --split: split GGUF to multiple GGUF, default operation.
- --split-max-size: max size per split in- Mor- G, f.ex.- 500Mor- 2G.
- --split-max-tensors: maximum tensors in each split: default(128)
- --merge: merge multiple GGUF to a single GGUF.