| 
							
							
								 Georgi Gerganov | afa8a9ec9b | llama : add llama_vocab, functions -> methods, naming (#11110)* llama : functions -> methods (#11110)
* llama : add struct llama_vocab to the API (#11156)
ggml-ci
* hparams : move vocab params to llama_vocab (#11159)
ggml-ci
* vocab : more pimpl (#11165)
ggml-ci
* vocab : minor tokenization optimizations (#11160)
ggml-ci
Co-authored-by: Diego Devesa <slarengh@gmail.com>
* lora : update API names (#11167)
ggml-ci
* llama : update API names to use correct prefix (#11174)
* llama : update API names to use correct prefix
ggml-ci
* cont
ggml-ci
* cont
ggml-ci
* minor [no ci]
* vocab : llama_vocab_add_[be]os -> llama_vocab_get_add_[be]os (#11174)
ggml-ci
* vocab : llama_vocab_n_vocab -> llama_vocab_n_tokens (#11174)
ggml-ci
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com> | 2025-01-12 11:32:42 +02:00 |  | 
			
				
					| 
							
							
								 Georgi Gerganov | c2a16c0bdb | server : fix free of spec context and batch (#10651) ggml-ci | 2024-12-07 11:52:44 +02:00 |  | 
			
				
					| 
							
							
								 Georgi Gerganov | 9fd8c2687f | server : add more information about error (#10455) | 2024-11-25 22:28:59 +02:00 |  | 
			
				
					| 
							
							
								 Georgi Gerganov | d9d54e498d | speculative : refactor and add a simpler example (#10362) * speculative : refactor and add a simpler example
ggml-ci
* speculative : clean-up and add comments and TODOs [no ci]
* speculative : manage context in common_speculative
ggml-ci
* speculative : simplify
ggml-ci
* speculative : simplify (cont)
ggml-ci
* speculative : add --draft-min CLI arg
* speculative : minor fixup
* make : build fixes
* speculative : do not redraft previous drafts
ggml-ci
* speculative : fix the draft sampling
ggml-ci
* speculative : fix compile warning
* common : refactor args
ggml-ci
* common : change defaults [no ci]
* common : final touches
ggml-ci | 2024-11-25 09:58:41 +02:00 |  |