Johannes Gäßler
							
						 
					 | 
					
						
						
							
						
						e81b8e4b7f
					 | 
					
						
						
							
							llama: use FA + max. GPU layers by default (#15434)
						
						
						
						
						
						
						
						* llama: use max. GPU layers by default, auto -fa
* ggml-backend: abort instead of segfault 
						
						
					 | 
					
						2025-08-30 16:32:10 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Georgi Gerganov
							
						 
					 | 
					
						
						
							
						
						d2fcd91cf9
					 | 
					
						
						
							
							server : disable context shift by default (#15416)
						
						
						
						
						
						
						
						* server : disable context shift by default
ggml-ci
* server : make scopr of test parameters local 
						
						
					 | 
					
						2025-08-19 16:46:37 +03:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Xuan-Son Nguyen
							
						 
					 | 
					
						
						
							
						
						6aa892ec2a
					 | 
					
						
						
							
							server : do not return error out of context (with ctx shift disabled) (#13577)
						
						
						
						
						
						
					 | 
					
						2025-05-16 21:50:00 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Diego Devesa
							
						 
					 | 
					
						
						
							
						
						1d36b3670b
					 | 
					
						
						
							
							llama : move end-user examples to tools directory (#13249)
						
						
						
						
						
						
						
						* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co> 
						
						
					 | 
					
						2025-05-02 20:27:13 +02:00 | 
					
					
						
						
						
							
							
							
							
							
							
							
							
						
					 |