mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-10-31 08:51:55 +00:00 
			
		
		
		
	 807b0c49ff
			
		
	
	807b0c49ff
	
	
	
		
			
			* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
		
			
				
	
	
		
			113 lines
		
	
	
		
			1.9 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			113 lines
		
	
	
		
			1.9 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| ied 4 ½ months
 | ||
| __ggml_vocab_test__
 | ||
| Führer
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
| __ggml_vocab_test__
 | ||
|  
 | ||
| __ggml_vocab_test__
 | ||
|   
 | ||
| __ggml_vocab_test__
 | ||
|    
 | ||
| __ggml_vocab_test__
 | ||
| 	
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
| 
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
| 
 | ||
| 
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
| 
 | ||
| 
 | ||
| 
 | ||
| __ggml_vocab_test__
 | ||
| 	
 | ||
| 
 | ||
| __ggml_vocab_test__
 | ||
| Hello world
 | ||
| __ggml_vocab_test__
 | ||
|  Hello world
 | ||
| __ggml_vocab_test__
 | ||
| Hello World
 | ||
| __ggml_vocab_test__
 | ||
|  Hello World
 | ||
| __ggml_vocab_test__
 | ||
|  Hello World!
 | ||
| __ggml_vocab_test__
 | ||
| Hello, world!
 | ||
| __ggml_vocab_test__
 | ||
|  Hello, world!
 | ||
| __ggml_vocab_test__
 | ||
|  this is 🦙.cpp
 | ||
| __ggml_vocab_test__
 | ||
| w048 7tuijk dsdfhu
 | ||
| __ggml_vocab_test__
 | ||
| нещо на Български
 | ||
| __ggml_vocab_test__
 | ||
| កាន់តែពិសេសអាចខលចេញ
 | ||
| __ggml_vocab_test__
 | ||
| 🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ (only emoji that has its own token)
 | ||
| __ggml_vocab_test__
 | ||
| Hello
 | ||
| __ggml_vocab_test__
 | ||
|  Hello
 | ||
| __ggml_vocab_test__
 | ||
|   Hello
 | ||
| __ggml_vocab_test__
 | ||
|    Hello
 | ||
| __ggml_vocab_test__
 | ||
|     Hello
 | ||
| __ggml_vocab_test__
 | ||
|     Hello
 | ||
|     Hello
 | ||
| __ggml_vocab_test__
 | ||
|  (
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
|  =
 | ||
| __ggml_vocab_test__
 | ||
| ' era
 | ||
| __ggml_vocab_test__
 | ||
| Hello, y'all! How are you 😁 ?我想在apple工作1314151天~
 | ||
| __ggml_vocab_test__
 | ||
| !!!!!!
 | ||
| __ggml_vocab_test__
 | ||
| 3
 | ||
| __ggml_vocab_test__
 | ||
| 33
 | ||
| __ggml_vocab_test__
 | ||
| 333
 | ||
| __ggml_vocab_test__
 | ||
| 3333
 | ||
| __ggml_vocab_test__
 | ||
| 33333
 | ||
| __ggml_vocab_test__
 | ||
| 333333
 | ||
| __ggml_vocab_test__
 | ||
| 3333333
 | ||
| __ggml_vocab_test__
 | ||
| 33333333
 | ||
| __ggml_vocab_test__
 | ||
| 333333333
 | ||
| __ggml_vocab_test__
 | ||
| Cửa Việt
 | ||
| __ggml_vocab_test__
 | ||
|  discards
 | ||
| __ggml_vocab_test__
 | ||
| 
 | ||
|  
 | ||
| 
 | ||
|  
 | ||
| 
 | ||
| 
 | ||
|  	 		 	
 | ||
|   
 | ||
|    
 | ||
|     
 | ||
|      
 | ||
| 🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ 🦙🦙 3 33 333 3333 33333 333333 3333333 33333333 3.3 3..3 3...3 កាន់តែពិសេសអាច😁 ?我想在apple工作1314151天~ ------======= нещо на Български ''''''```````""""......!!!!!!?????? I've been 'told he's there, 'RE you sure? 'M not sure I'll make it, 'D you like some tea? We'Ve a'lL
 | ||
| __ggml_vocab_test__
 |