Wang Ran (汪然) 
							
						 
					 
					
						
						
							
						
						98036d5670 
					 
					
						
						
							
							fix typo of README.md ( #10605 )  
						
						
						
						
					 
					
						2024-12-04 02:22:50 +01:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						5e1ed95583 
					 
					
						
						
							
							grammars : add English-only grammar ( #10612 )  
						
						
						
						
					 
					
						2024-12-01 21:37:54 +02:00 
						 
				 
			
				
					
						
							
							
								Xuan Son Nguyen 
							
						 
					 
					
						
						
							
						
						a71d81cf8c 
					 
					
						
						
							
							server : revamp chat UI with vuejs and daisyui ( #10175 )  
						
						... 
						
						
						
						* server : simple chat UI with vuejs and daisyui
* move old files to legacy folder
* embed deps into binary
* basic markdown support
* add conversation history, save to localStorage
* fix bg-base classes
* save theme preferences
* fix tests
* regenerate, edit, copy buttons
* small fixes
* docs: how to use legacy ui
* better error handling
* make CORS preflight more explicit
* add GET method for CORS
* fix tests
* clean up a bit
* better auto scroll
* small fixes
* use collapse-arrow
* fix closeAndSaveConfigDialog
* small fix
* remove console.log
* fix style for <pre> element
* lighter bubble color (less distract when reading) 
						
						
					 
					
						2024-11-07 17:31:10 -04:00 
						 
				 
			
				
					
						
							
							
								Vinesh Janarthanan 
							
						 
					 
					
						
						
							
						
						8a308354f6 
					 
					
						
						
							
							server : match OAI structured output response ( #9527 )  
						
						
						
						
					 
					
						2024-09-18 09:50:34 +03:00 
						 
				 
			
				
					
						
							
							
								Pascal Patry 
							
						 
					 
					
						
						
							
						
						9379d3cc17 
					 
					
						
						
							
							readme : rename result_format to response_format ( #9300 )  
						
						
						
						
					 
					
						2024-09-04 09:45:40 +03:00 
						 
				 
			
				
					
						
							
							
								daghanerdonmez 
							
						 
					 
					
						
						
							
						
						9beb2dda03 
					 
					
						
						
							
							readme : fix typo [no ci] ( #8389 )  
						
						... 
						
						
						
						Bakus-Naur --> Backus-Naur 
						
						
					 
					
						2024-07-09 09:16:00 +03:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						139cc621e9 
					 
					
						
						
							
							json: restore default additionalProperties to false, fix some pattern escapes (#8180 )  
						
						... 
						
						
						
						* json: expand ESCAPED_IN_REGEXPS_BUT_NOT_IN_LITERALS charset
* json: revert default of additionalProperties to false
* Update README.md 
						
						
					 
					
						2024-06-28 09:26:45 +01:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						cb0b06a8a6 
					 
					
						
						
							
							json: update grammars/README w/ examples & note about additionalProperties (#8132 )  
						
						... 
						
						
						
						* json: update grammars/README
* mention broken prefixItems
* add mention to llama-gbnf-validator
* json: explicit type: object for nested items object in cli example 
						
						
					 
					
						2024-06-27 22:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						1c641e6aac 
					 
					
						
						
							
							build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )  
						
						... 
						
						
						
						* `main`/`server`: rename to `llama` / `llama-server` for consistency w/ homebrew
* server: update refs -> llama-server
gitignore llama-server
* server: simplify nix package
* main: update refs -> llama
fix examples/main ref
* main/server: fix targets
* update more names
* Update build.yml
* rm accidentally checked in bins
* update straggling refs
* Update .gitignore
* Update server-llm.sh
* main: target name -> llama-cli
* Prefix all example bins w/ llama-
* fix main refs
* rename {main->llama}-cmake-pkg binary
* prefix more cmake targets w/ llama-
* add/fix gbnf-validator subfolder to cmake
* sort cmake example subdirs
* rm bin files
* fix llama-lookup-* Makefile rules
* gitignore /llama-*
* rename Dockerfiles
* rename llama|main -> llama-cli; consistent RPM bin prefixes
* fix some missing -cli suffixes
* rename dockerfile w/ llama-cli
* rename(make): llama-baby-llama
* update dockerfile refs
* more llama-cli(.exe)
* fix test-eval-callback
* rename: llama-cli-cmake-pkg(.exe)
* address gbnf-validator unused fread warning (switched to C++ / ifstream)
* add two missing llama- prefixes
* Updating docs for eval-callback binary to use new `llama-` prefix.
* Updating a few lingering doc references for rename of main to llama-cli
* Updating `run-with-preset.py` to use new binary names.
Updating docs around `perplexity` binary rename.
* Updating documentation references for lookup-merge and export-lora
* Updating two small `main` references missed earlier in the finetune docs.
* Update apps.nix
* update grammar/README.md w/ new llama-* names
* update llama-rpc-server bin name + doc
* Revert "update llama-rpc-server bin name + doc"
This reverts commit e474ef1df4hanclinto@gmail.com > 
						
						
					 
					
						2024-06-13 00:41:52 +01:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						b61eb9644d 
					 
					
						
						
							
							json: refine constraint for whitespace to avoid runaways yet allow pretty print ( #7866 )  
						
						
						
						
					 
					
						2024-06-11 02:22:57 +01:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						396b18dfec 
					 
					
						
						
							
							json: document schema conversion in GBNF readme, align manual grammar examples & converters (#7841 )  
						
						... 
						
						
						
						* json: fix char pattern in grammar converters
* json: prevent number precision & whitespace runaways in example grammars
* json: add doc to grammar readme 
						
						
					 
					
						2024-06-11 01:00:30 +01:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						55b2d0849d 
					 
					
						
						
							
							grammars: x{min,max} repetition operator ( #6640 )  
						
						... 
						
						
						
						* grammars: x{min,max} repetition operator + tweak +/*/? to avoid duplication of original over alternates
* grammars: handle `x{n}` and fix `x{n,n}`
* grammars: document new repetition operators
* grammars: uniform use of int for min & max
* grammars: refactor parser test
* grammar: parsing tests w/ natural pretty print of updated expectations
* grammars: much prettier print of expectations (+ TEST_GRAMMAR_PARSER_PRINT_ALL=1 to force all)
* grammars: improve test pretty print again
* grammars: pretty print rules and chars
* grammars: fix copy rule skipping
* grammars: disallow `a{,}` (not allowed in regexps)
* Update common/grammar-parser.cpp
Co-authored-by: Clint Herron <hanclinto@gmail.com >
* grammars: fix copy rule skipping (again) & display of expectations
* grammars: more test cases
* grammars: update reps parsing to bring ? / * / + closer to before
* json: use new GBNF repetitions{m,n} syntax
* grammars: update performance gotchas w/ repetition advice
* Update examples/json_schema_to_grammar.py
Co-authored-by: Clint Herron <hanclinto@gmail.com >
* Update examples/server/public/json-schema-to-grammar.mjs
Co-authored-by: Clint Herron <hanclinto@gmail.com >
* grammars: comment on rule repetitions
* grammars: ensure unambiguous number alternatives
* grammar: nit typo switched error msgs
* grammar: nit numbering in comment
* json: update numeric rule to be unambiguous
* Apply suggestions from code review
Co-authored-by: Clint Herron <hanclinto@gmail.com >
* Update examples/server/public/json-schema-to-grammar.mjs
Co-authored-by: Clint Herron <hanclinto@gmail.com >
* json: fix integral-part
* grammar: add repetition tests
---------
Co-authored-by: Clint Herron <hanclinto@gmail.com > 
						
						
					 
					
						2024-06-06 10:07:06 +01:00 
						 
				 
			
				
					
						
							
							
								omahs 
							
						 
					 
					
						
						
							
						
						04976db7a8 
					 
					
						
						
							
							docs: fix typos ( #7124 )  
						
						... 
						
						
						
						* fix typo
* fix typos
* fix typo
* fix typos
* fix typo
* fix typos 
						
						
					 
					
						2024-05-07 18:20:33 +03:00 
						 
				 
			
				
					
						
							
							
								Olivier Chafik 
							
						 
					 
					
						
						
							
						
						ab9a3240a9 
					 
					
						
						
							
							JSON schema conversion:  ⚡ ️ faster repetitions, min/maxLength for strings, cap number length ( #6555 )  
						
						... 
						
						
						
						* json: rename python schema converter to make import easier
* server: skip null json_schema / grammar fields
* json: deps management for primitive rules (+ allow null values)
* json: optimize repetitions for minItems/maxItems and regexps: `a{,3}` goes from `"a"? "a"? "a"?` (explosive combos) to `(a (a (a)?)?)?`
* grammars: add troubleshooting section to readme
* json: cap length of numbers to 15 digits before/after decimal point
(avoids infinite gen, e.g. "one third" -> `0.333333333333...`)
* json: unify all repetition code (w/ or w/o sep)
* json: support string minLength/maxLength
* server+json: update server/README w/ result_format
* nits
* json: fix type error w/ python 3.8
* json: fix server/README (json_schema in /completion vs. result_format in /v1/chat/completions)
* json: simplify DOT `{"type": "string", "pattern": "^.$"}`
* json: remove recursion in opt_repetitions (avoids Python stack overflow)
* json: rm dead code
* json: rm useless assert & ggml.h import 
						
						
					 
					
						2024-04-12 19:43:38 +01:00 
						 
				 
			
				
					
						
							
							
								ExtReMLapin 
							
						 
					 
					
						
						
							
						
						3de31677d3 
					 
					
						
						
							
							grammars : blacklists character control set ( #5888 )  
						
						... 
						
						
						
						* Prevent control characters from being served in json string
* Prevent control characters from being served in json string (array) 
						
						
					 
					
						2024-03-05 18:33:08 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						82cb31eb93 
					 
					
						
						
							
							Revert "grammars : don't allow to output unescaped new line in string ( #5885 )"  
						
						... 
						
						
						
						This reverts commit b1a4e994fd 
						
						
					 
					
						2024-03-05 15:56:24 +02:00 
						 
				 
			
				
					
						
							
							
								ExtReMLapin 
							
						 
					 
					
						
						
							
						
						b1a4e994fd 
					 
					
						
						
							
							grammars : don't allow to output unescaped new line in string ( #5885 )  
						
						... 
						
						
						
						* Don't allow grammar json array to output unescaped new line in string
* Don't allow new line in json object string 
						
						
					 
					
						2024-03-05 15:44:29 +02:00 
						 
				 
			
				
					
						
							
							
								Richard Kiss 
							
						 
					 
					
						
						
							
						
						532dd74e38 
					 
					
						
						
							
							Fix some documentation typos/grammar mistakes ( #4032 )  
						
						... 
						
						
						
						* typos
* Update examples/parallel/README.md
Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com >
---------
Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com > 
						
						
					 
					
						2023-11-11 23:04:58 -07:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						921772104b 
					 
					
						
						
							
							speculative : add grammar support ( #2991 )  
						
						... 
						
						
						
						* speculative : add grammar support
* grammars : add json_arr.gbnf
* grammar : add comments to new grammar file
* grammar : remove one nested level
* common : warm-up with 2 tokens - seems to work better
* speculative : print draft token pieces
* speculative : reuse grammar parser + better logs and comments
* speculative : avoid grammar_mem
* make : fix speculative build 
						
						
					 
					
						2023-09-05 08:46:17 +03:00 
						 
				 
			
				
					
						
							
							
								Ben Siraphob 
							
						 
					 
					
						
						
							
						
						d8d6977f48 
					 
					
						
						
							
							examples : add C grammar ( #2357 )  
						
						
						
						
					 
					
						2023-09-01 16:32:14 +03:00 
						 
				 
			
				
					
						
							
							
								Evan Jones 
							
						 
					 
					
						
						
							
						
						f5fe98d11b 
					 
					
						
						
							
							docs : add grammar docs ( #2701 )  
						
						... 
						
						
						
						* docs : add grammar docs
* tweaks to grammar guide
* rework GBNF example to be a commented grammar 
						
						
					 
					
						2023-08-22 21:01:57 -04:00 
						 
				 
			
				
					
						
							
							
								Evan Jones 
							
						 
					 
					
						
						
							
						
						8183159cf3 
					 
					
						
						
							
							examples : generate JSON according to schema ( #1887 )  
						
						... 
						
						
						
						* examples : add JSON schema grammars
* complete JSON grammar
* ensure primitive types can be used as root of schema
* support integer type and adjust usage text 
						
						
					 
					
						2023-08-02 22:05:44 -04:00 
						 
				 
			
				
					
						
							
							
								Evan Jones 
							
						 
					 
					
						
						
							
						
						84e09a7d8b 
					 
					
						
						
							
							llama : add grammar-based sampling ( #1773 )  
						
						... 
						
						
						
						* llama, main : constrain sampling to grammar
* allow loading grammar from file
* fix whitespace errors
* handle & print parser errors
* add comments to grammar syntax and allow newlines where unambiguous
* add missing include
* support alternates in root rule
* fix bugs with empty token and EOS
* adjust JSON grammar
* remove swp file
* rewrite ternary expressions
Co-authored-by: Henri Vasserman <henv@hot.ee >
* use struct for grammar elements and add Unicode support
* add unicode escapes
* add inverse char ranges
* only sample full tokens (no peeking or truncation)
* llama : minor style changes
blindly applied in online editor - hopefully I didn't break something
* update help text
* add warning message if EOS is disabled
---------
Co-authored-by: Henri Vasserman <henv@hot.ee >
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com > 
						
						
					 
					
						2023-07-23 23:58:10 -04:00