mirror of
				https://github.com/ggml-org/llama.cpp.git
				synced 2025-11-03 09:22:01 +00:00 
			
		
		
		
	* llama, main : constrain sampling to grammar * allow loading grammar from file * fix whitespace errors * handle & print parser errors * add comments to grammar syntax and allow newlines where unambiguous * add missing include * support alternates in root rule * fix bugs with empty token and EOS * adjust JSON grammar * remove swp file * rewrite ternary expressions Co-authored-by: Henri Vasserman <henv@hot.ee> * use struct for grammar elements and add Unicode support * add unicode escapes * add inverse char ranges * only sample full tokens (no peeking or truncation) * llama : minor style changes blindly applied in online editor - hopefully I didn't break something * update help text * add warning message if EOS is disabled --------- Co-authored-by: Henri Vasserman <henv@hot.ee> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
		
			
				
	
	
		
			30 lines
		
	
	
		
			665 B
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			30 lines
		
	
	
		
			665 B
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
# Grammar for subset of JSON - doesn't support full string or number syntax
 | 
						|
 | 
						|
root  ::= object
 | 
						|
value ::= object | array | string | number | boolean | "null"
 | 
						|
 | 
						|
object ::=
 | 
						|
  "{" ws (
 | 
						|
            string ":" ws value
 | 
						|
    ("," ws string ":" ws value)*
 | 
						|
  )? "}"
 | 
						|
 | 
						|
array  ::=
 | 
						|
  "[" ws (
 | 
						|
            value
 | 
						|
    ("," ws value)*
 | 
						|
  )? "]"
 | 
						|
 | 
						|
string  ::=
 | 
						|
  "\"" (
 | 
						|
    [^"\\] |
 | 
						|
    "\\" (["\\/bfnrt] | "u" [0-9a-fA-F] [0-9a-fA-F] [0-9a-fA-F] [0-9a-fA-F]) # escapes
 | 
						|
  )* "\"" ws
 | 
						|
 | 
						|
# Only plain integers currently
 | 
						|
number  ::= "-"? [0-9]+ ws
 | 
						|
boolean ::= ("true" | "false") ws
 | 
						|
 | 
						|
# Optional space: by convention, applied in this grammar after literal chars when allowed
 | 
						|
ws ::= ([ \t\n] ws)?
 |