mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-31 08:51:55 +00:00 
			
		
		
		
	 0cc63754b8
			
		
	
	0cc63754b8
	
	
	
		
			
			It's like simple-chat but it uses smart pointers to avoid manual memory cleanups. Less memory leaks in the code now. Avoid printing multiple dots. Split code into smaller functions. Uses no exception handling. Signed-off-by: Eric Curtin <ecurtin@redhat.com>
		
			
				
	
	
		
			8 lines
		
	
	
		
			177 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			8 lines
		
	
	
		
			177 B
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| # llama.cpp/example/run
 | |
| 
 | |
| The purpose of this example is to demonstrate a minimal usage of llama.cpp for running models.
 | |
| 
 | |
| ```bash
 | |
| ./llama-run Meta-Llama-3.1-8B-Instruct.gguf
 | |
| ...
 |