| 
							
							
								 Lukas Straub | a9f77a8be3 | server : add openai-style logit_bias support (#14946) Signed-off-by: Lukas Straub <lukasstraub2@web.de> | 2025-07-31 14:08:23 +02:00 |  | 
			
				
					| 
							
							
								 Olivier Chafik | f13847cfb5 | server: fix regression on streamed non-chat completion w/ stops (#13785) * more forgiving message diffs: partial stop words aren't erased, full stops are
* Add (slow) server test for completion + stream + stop | 2025-05-26 14:16:37 +01:00 |  | 
			
				
					| 
							
							
								 Xuan-Son Nguyen | 360a9c98e1 | server : fix cache_tokens bug with no cache_prompt (#13533) | 2025-05-14 13:35:07 +02:00 |  | 
			
				
					| 
							
							
								 Diego Devesa | 1d36b3670b | llama : move end-user examples to tools directory (#13249) * llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co> | 2025-05-02 20:27:13 +02:00 |  |