mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-31 08:51:55 +00:00 
			
		
		
		
	 8c70a5ff25
			
		
	
	8c70a5ff25
	
	
	
		
			
			* batched : add bench tool * batched : minor fix table * batched-bench : add readme + n_kv_max is now configurable * batched-bench : init warm-up batch * batched-bench : pass custom set of PP, TG and PL * batched-bench : add mmq CLI arg
		
			
				
	
	
	
		
			239 B
		
	
	
	
	
	
	
	
			
		
		
	
	
			239 B