Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						68ff663a04 
					 
					
						
						
							
							repo : update links to new url ( #11886 )  
						
						... 
						
						
						
						* repo : update links to new url
ggml-ci
* cont : more urls
ggml-ci 
						
						
					 
					
						2025-02-15 16:40:57 +02:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						c7f460ab88 
					 
					
						
						
							
							server: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless --reasoning-format none (#11607 )  
						
						... 
						
						
						
						* extract & return thoughts in reasoning_content field (unless --reasoning-format) for DeepSeek R1 & Command R7B
* tool-calls: add deepseek r1 template (models/templates/llama-cpp-deepseek-r1.jinja) + hackommodate broken official template
* tool-calls: accommodate variety of wrong tool call opening tags both R1 Qwen 32B and 7B distills like to spit out
* server/oai: ensure content is null when there are tool calls, and reasoning_content appears before content for readability
* tool-calls: add DeepSeek R1 Qwen distills to server/README.md & server tests
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com > 
						
						
					 
					
						2025-02-13 10:05:16 +00:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						0fb77f821f 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-02-12 21:46:02 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						8a59053f63 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-02-06 21:23:03 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						7c9e0ca520 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-02-04 12:59:21 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						8ec05832fa 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-02-03 14:57:08 +02:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						8b576b6c55 
					 
					
						
						
							
							Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars ( #9639 )  
						
						... 
						
						
						
						---------
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com >
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
Co-authored-by: Xuan Son Nguyen <son@huggingface.co > 
						
						
					 
					
						2025-01-30 19:13:58 +00:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						815857791d 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-01-29 11:25:29 +02:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						6171c9d258 
					 
					
						
						
							
							Add Jinja template support ( #11016 )  
						
						... 
						
						
						
						* Copy minja from 58f0ca6dd7https://github.com/google/minja/pull/22 )
* Apply suggestions from code review
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com >
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
* Finish suggested renamings
* Move chat_templates inside server_context + remove mutex
* Update --chat-template-file w/ recent change to --chat-template
* Refactor chat template validation
* Guard against missing eos/bos tokens (null token otherwise throws in llama_vocab::impl::token_get_attr)
* Warn against missing eos / bos tokens when jinja template references them
* rename: common_chat_template[s]
* reinstate assert on chat_templates.template_default
* Update minja to b8437df626https://github.com/google/minja/pull/25 
* Update minja from https://github.com/google/minja/pull/27 
* rm unused optional header
---------
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com >
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com > 
						
						
					 
					
						2025-01-21 13:18:51 +00:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						f26c874179 
					 
					
						
						
							
							scripts : restore hf.sh ( #11288 )  
						
						... 
						
						
						
						ggml-ci 
						
						
					 
					
						2025-01-18 13:18:32 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						f11cfdfd7f 
					 
					
						
						
							
							ci : use -no-cnv in gguf-split tests ( #11254 )  
						
						... 
						
						
						
						* ci : use -no-cnv in gguf-split tests
ggml-ci
* ci : use -no-cnv in requantize tests
ggml-ci
* scripts : fix [no ci] 
						
						
					 
					
						2025-01-15 18:28:35 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						44d1e796d0 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-01-14 10:39:42 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						a4f3f5d8e6 
					 
					
						
						
							
							scripts : sync gguf (cont)  
						
						
						
						
					 
					
						2025-01-14 09:40:52 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						48e1ae0e61 
					 
					
						
						
							
							scripts : sync gguf  
						
						
						
						
					 
					
						2025-01-14 09:36:58 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						d00a80e89d 
					 
					
						
						
							
							scripts : sync opencl  
						
						
						
						
					 
					
						2025-01-14 09:19:58 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						99a3755a3c 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-01-08 13:40:30 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						78c6785175 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2025-01-04 16:09:53 +02:00 
						 
				 
			
				
					
						
							
							
								Djip007 
							
						 
					 
					
						
						
							
						
						2cd43f4900 
					 
					
						
						
							
							ggml : more perfo with llamafile tinyblas on x86_64 ( #10714 )  
						
						... 
						
						
						
						* more perfo with llamafile tinyblas on x86_64.
- add bf16 suport
- change dispache strategie (thanks:
https://github.com/ikawrakow/ik_llama.cpp/pull/71  )
- reduce memory bandwidth
simple tinyblas dispache and more cache freindly
* tinyblas dynamic dispaching
* sgemm: add M blocs.
* - git 2.47 use short id of len 9.
- show-progress is not part of GNU Wget2
* remove not stable test 
						
						
					 
					
						2024-12-24 18:54:49 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						5437d4aaf5 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-12-17 18:36:02 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						87cf323cef 
					 
					
						
						
							
							scripts : change build path to "build-bench" for compare-commits.sh ( #10836 )  
						
						
						
						
					 
					
						2024-12-15 18:44:47 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						0cd182ebcc 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-12-05 13:27:42 +02:00 
						 
				 
			
				
					
						
							
							
								Diego Devesa 
							
						 
					 
					
						
						
							
						
						59f4db1088 
					 
					
						
						
							
							ggml : add predefined list of CPU backend variants to build ( #10626 )  
						
						... 
						
						
						
						* ggml : add predefined list of CPU backend variants to build
* update CPU dockerfiles 
						
						
					 
					
						2024-12-04 14:45:40 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						1cd3df46bd 
					 
					
						
						
							
							scripts : remove amx sync  
						
						... 
						
						
						
						ggml-ci 
						
						
					 
					
						2024-12-03 20:04:49 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						c505471857 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-12-03 20:04:49 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						8648c52101 
					 
					
						
						
							
							make : deprecate ( #10514 )  
						
						... 
						
						
						
						* make : deprecate
ggml-ci
* ci : disable Makefile builds
ggml-ci
* docs : remove make references [no ci]
* ci : disable swift build
ggml-ci
* docs : remove obsolete make references, scripts, examples
ggml-ci
* basic fix for compare-commits.sh
* update build.md
* more build.md updates
* more build.md updates
* more build.md updates
* Update Makefile
Co-authored-by: Diego Devesa <slarengh@gmail.com >
---------
Co-authored-by: slaren <slarengh@gmail.com > 
						
						
					 
					
						2024-12-02 21:22:53 +02:00 
						 
				 
			
				
					
						
							
							
								Diego Devesa 
							
						 
					 
					
						
						
							
						
						3420909dff 
					 
					
						
						
							
							ggml : automatic selection of best CPU backend ( #10606 )  
						
						... 
						
						
						
						* ggml : automatic selection of best CPU backend
* amx : minor opt
* add GGML_AVX_VNNI to enable avx-vnni, fix checks 
						
						
					 
					
						2024-12-01 16:12:41 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						fee824a1a1 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-27 11:10:42 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						87a533be57 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-21 09:22:11 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						9fe0fb0626 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-19 20:03:21 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						5c9a8b22b1 
					 
					
						
						
							
							scripts : update sync  
						
						
						
						
					 
					
						2024-11-17 08:30:29 +02:00 
						 
				 
			
				
					
						
							
							
								Johannes Gäßler 
							
						 
					 
					
						
						
							
						
						4e54be0ec6 
					 
					
						
						
							
							llama/ex: remove --logdir argument ( #10339 )  
						
						
						
						
					 
					
						2024-11-16 23:00:41 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						f245cc28d4 
					 
					
						
						
							
							scripts : fix missing key in compare-llama-bench.py ( #10332 )  
						
						
						
						
					 
					
						2024-11-16 10:32:50 +02:00 
						 
				 
			
				
					
						
							
							
								Johannes Gäßler 
							
						 
					 
					
						
						
							
						
						4047be74da 
					 
					
						
						
							
							scripts: update compare-llama-bench.py ( #10319 )  
						
						
						
						
					 
					
						2024-11-15 21:19:03 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						cbf5541a82 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-15 15:44:06 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						4802ad350b 
					 
					
						
						
							
							scripts : fix regex in sync [no ci]  
						
						
						
						
					 
					
						2024-11-15 08:38:43 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						5ea926dad7 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-13 18:11:54 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						eec4d71737 
					 
					
						
						
							
							scripts : add amx to sync-ggml.sh [no ci]  
						
						
						
						
					 
					
						2024-11-07 23:11:36 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						3b08828674 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-07 23:08:24 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						a2c6fd747c 
					 
					
						
						
							
							scripts : sync update  
						
						
						
						
					 
					
						2024-11-07 23:07:55 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						ce027adfb3 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-04 10:33:37 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						815fe72adc 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-11-01 10:28:24 +02:00 
						 
				 
			
				
					
						
							
							
								Diego Devesa 
							
						 
					 
					
						
						
							
						
						c5b0f4b5d9 
					 
					
						
						
							
							llama : refactor model loader with backend registry ( #10026 )  
						
						
						
						
					 
					
						2024-10-30 02:01:23 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						8d8ff71536 
					 
					
						
						
							
							llama : remove Tail-Free sampling ( #10071 )  
						
						... 
						
						
						
						ggml-ci 
						
						
					 
					
						2024-10-29 10:42:05 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						cc2983d375 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-10-26 10:34:08 +03:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						9e4a2563ea 
					 
					
						
						
							
							scripts : fix amx sync [no ci]  
						
						
						
						
					 
					
						2024-10-26 10:33:31 +03:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						190a37d797 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-10-23 17:23:55 +03:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						17bb928080 
					 
					
						
						
							
							readme : remove --memory-f32 references ( #9925 )  
						
						
						
						
					 
					
						2024-10-17 23:43:05 +03:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						0e41b300ed 
					 
					
						
						
							
							sync : ggml  
						
						
						
						
					 
					
						2024-10-16 11:28:14 +03:00 
						 
				 
			
				
					
						
							
							
								standby24x7 
							
						 
					 
					
						
						
							
						
						fa42aa6d89 
					 
					
						
						
							
							scripts : fix spelling typo in messages and comments ( #9782 )  
						
						... 
						
						
						
						Signed-off-by: Masanari Iida <standby24x7@gmail.com > 
						
						
					 
					
						2024-10-08 09:19:53 +03:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						b6d6c5289f 
					 
					
						
						
							
							sync : llama.cpp  
						
						
						
						
					 
					
						2024-10-06 12:53:28 +03:00