Ali Tariq 
							
						 
					 
					
						
						
							
						
						c2ab6fe661 
					 
					
						
						
							
							ci : Cloud-V for RISC-V builds ( #3160 )  
						
						... 
						
						
						
						* Added Cloud-V File
* Replaced Makefile with original one
---------
Co-authored-by: moiz.hussain <moiz.hussain@10xengineers.ai > 
						
						
					 
					
						2023-09-15 11:06:56 +03:00 
						 
				 
			
				
					
						
							
							
								hongbo.mo 
							
						 
					 
					
						
						
							
						
						a21baeb122 
					 
					
						
						
							
							docker : add git to full-cuda.Dockerfile main-cuda.Dockerfile ( #3044 )  
						
						
						
						
					 
					
						2023-09-08 13:57:55 +03:00 
						 
				 
			
				
					
						
							
							
								Henri Vasserman 
							
						 
					 
					
						
						
							
						
						71d6975559 
					 
					
						
						
							
							[Docker] fix tools.sh argument passing. ( #2884 )  
						
						... 
						
						
						
						* [Docker] fix tools.sh argument passing.
This should allow passing multiple arguments to containers with
the full image that are using the tools.sh frontend.
Fix from https://github.com/ggerganov/llama.cpp/issues/2535#issuecomment-1697091734  
						
						
					 
					
						2023-08-30 19:14:53 +03:00 
						 
				 
			
				
					
						
							
							
								JohnnyB 
							
						 
					 
					
						
						
							
						
						3e8ff47af6 
					 
					
						
						
							
							devops : added systemd units and set versioning to use date. ( #2835 )  
						
						... 
						
						
						
						* Corrections and systemd units
* Missing dependency clblast 
						
						
					 
					
						2023-08-28 09:31:24 +03:00 
						 
				 
			
				
					
						
							
							
								Henri Vasserman 
							
						 
					 
					
						
						
							
						
						6bbc598a63 
					 
					
						
						
							
							ROCm Port ( #1087 )  
						
						... 
						
						
						
						* use hipblas based on cublas
* Update Makefile for the Cuda kernels
* Expand arch list and make it overrideable
* Fix multi GPU on multiple amd architectures with rocblas_initialize() (#5 )
* add hipBLAS to README
* new build arg LLAMA_CUDA_MMQ_Y
* fix half2 decomposition
* Add intrinsics polyfills for AMD
* AMD assembly optimized __dp4a
* Allow overriding CC_TURING
* use "ROCm" instead of "CUDA"
* ignore all build dirs
* Add Dockerfiles
* fix llama-bench
* fix -nommq help for non CUDA/HIP
---------
Co-authored-by: YellowRoseCx <80486540+YellowRoseCx@users.noreply.github.com >
Co-authored-by: ardfork <134447697+ardfork@users.noreply.github.com >
Co-authored-by: funnbot <22226942+funnbot@users.noreply.github.com >
Co-authored-by: Engininja2 <139037756+Engininja2@users.noreply.github.com >
Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com >
Co-authored-by: jammm <2500920+jammm@users.noreply.github.com >
Co-authored-by: jdecourval <7315817+jdecourval@users.noreply.github.com > 
						
						
					 
					
						2023-08-25 12:09:42 +03:00 
						 
				 
			
				
					
						
							
							
								JohnnyB 
							
						 
					 
					
						
						
							
						
						f19dca04ea 
					 
					
						
						
							
							devops : RPM Specs ( #2723 )  
						
						... 
						
						
						
						* Create llama-cpp.srpm
* Rename llama-cpp.srpm to llama-cpp.srpm.spec
Correcting extension.
* Tested spec success.
* Update llama-cpp.srpm.spec
* Create lamma-cpp-cublas.srpm.spec
* Create lamma-cpp-clblast.srpm.spec
* Update lamma-cpp-cublas.srpm.spec
Added BuildRequires
* Moved to devops dir 
						
						
					 
					
						2023-08-23 17:28:22 +03:00 
						 
				 
			
				
					
						
							
							
								Bodo Graumann 
							
						 
					 
					
						
						
							
						
						b782422a3e 
					 
					
						
						
							
							devops : add missing quotes to bash script ( #2193 )  
						
						... 
						
						
						
						This prevents accidentally expanding arguments that contain spaces. 
						
						
					 
					
						2023-07-13 16:49:14 +03:00 
						 
				 
			
				
					
						
							
							
								Jinwoo Jeong 
							
						 
					 
					
						
						
							
						
						3ec7e596b2 
					 
					
						
						
							
							docker : add '--server' option ( #2174 )  
						
						
						
						
					 
					
						2023-07-11 19:12:35 +03:00 
						 
				 
			
				
					
						
							
							
								dylan 
							
						 
					 
					
						
						
							
						
						84525e7962 
					 
					
						
						
							
							docker : add support for CUDA in docker ( #1461 )  
						
						... 
						
						
						
						Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com >
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com > 
						
						
					 
					
						2023-07-07 21:25:25 +03:00 
						 
				 
			
				
					
						
							
							
								qingfengfenga 
							
						 
					 
					
						
						
							
						
						8fc8179919 
					 
					
						
						
							
							Add llama.cpp docker support for non-latin languages ( #1673 )  
						
						... 
						
						
						
						* Modify Dockerfile default character set to improve compatibility (#1673 ) 
						
						
					 
					
						2023-06-08 00:58:53 -07:00 
						 
				 
			
				
					
						
							
							
								Jiří Podivín 
							
						 
					 
					
						
						
							
						
						b5c85468a3 
					 
					
						
						
							
							Docker: change to calling convert.py ( #1641 )  
						
						... 
						
						
						
						Deprecation disclaimer was added to convert-pth-to-ggml.py 
						
						
					 
					
						2023-06-03 15:11:53 +03:00 
						 
				 
			
				
					
						
							
							
								Jiří Podivín 
							
						 
					 
					
						
						
							
						
						0e730dd23b 
					 
					
						
						
							
							Adding git in container package dependencies ( #1621 )  
						
						... 
						
						
						
						Git added to build packages for version information in docker image
Signed-off-by: Jiri Podivin <jpodivin@gmail.com > 
						
						
					 
					
						2023-05-28 21:45:50 -07:00 
						 
				 
			
				
					
						
							
							
								Pavol Rusnak 
							
						 
					 
					
						
						
							
						
						859fee6dfb 
					 
					
						
						
							
							quantize : use map to assign quantization type from string ( #1191 )  
						
						... 
						
						
						
						instead of `int` (while `int` option still being supported)
This allows the following usage:
`./quantize ggml-model-f16.bin ggml-model-q4_0.bin q4_0`
instead of:
`./quantize ggml-model-f16.bin ggml-model-q4_0.bin 2` 
						
						
					 
					
						2023-04-26 18:43:27 +02:00 
						 
				 
			
				
					
						
							
							
								Pavol Rusnak 
							
						 
					 
					
						
						
							
						
						a32f7acc9f 
					 
					
						
						
							
							py : cleanup dependencies ( #962 )  
						
						... 
						
						
						
						after #545  we do not need torch, tqdm and requests in the dependencies 
						
						
					 
					
						2023-04-14 15:37:11 +02:00 
						 
				 
			
				
					
						
							
							
								Pavol Rusnak 
							
						 
					 
					
						
						
							
						
						8b679987cd 
					 
					
						
						
							
							Fix whitespace, add .editorconfig, add GitHub workflow ( #883 )  
						
						
						
						
					 
					
						2023-04-11 19:45:44 +00:00 
						 
				 
			
				
					
						
							
							
								bsilvereagle 
							
						 
					 
					
						
						
							
						
						a0c0516416 
					 
					
						
						
							
							Remove torch GPU dependencies from the Docker.full image ( #665 )  
						
						... 
						
						
						
						By using `pip install torch --index-url https://download.pytorch.org/whl/cpu `
instead of `pip install torch` we can specify we want to install a CPU-only version
of PyTorch without any GPU dependencies. This reduces the size of the Docker image
from 7.32 GB to 1.62 GB 
						
						
					 
					
						2023-04-03 00:13:03 +02:00 
						 
				 
			
				
					
						
							
							
								Georgi Gerganov 
							
						 
					 
					
						
						
							
						
						4cc053b6d5 
					 
					
						
						
							
							Remove oboslete command from Docker script  
						
						
						
						
					 
					
						2023-03-23 22:39:44 +02:00 
						 
				 
			
				
					
						
							
							
								Stephan Walter 
							
						 
					 
					
						
						
							
						
						5cb63e2493 
					 
					
						
						
							
							Add tqdm to Python requirements ( #293 )  
						
						... 
						
						
						
						* Add tqdm to Python requirements
* Remove torchvision torchaudio, add requests 
						
						
					 
					
						2023-03-20 09:24:11 +01:00 
						 
				 
			
				
					
						
							
							
								Stephan Walter 
							
						 
					 
					
						
						
							
						
						367946c668 
					 
					
						
						
							
							Don't tell users to use a bad number of threads ( #243 )  
						
						... 
						
						
						
						The readme tells people to use the command line option "-t 8", causing 8
threads to be started. On systems with fewer than 8 cores, this causes a
significant slowdown. Remove the option from the example command lines
and use /proc/cpuinfo on Linux to determine a sensible default. 
						
						
					 
					
						2023-03-17 19:47:35 +02:00 
						 
				 
			
				
					
						
							
							
								Bernat Vadell 
							
						 
					 
					
						
						
							
						
						2af23d3043 
					 
					
						
						
							
							🚀  Dockerize llamacpp ( #132 )  
						
						... 
						
						
						
						* feat: dockerize llamacpp
* feat: split build & runtime stages
* split dockerfile into main & tools
* add quantize into tool docker image
* Update .devops/tools.sh
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
* add docker action pipeline
* change CI to publish at github docker registry
* fix name runs-on macOS-latest is macos-latest (lowercase)
* include docker versioned images
* fix github action docker
* fix docker.yml
* feat: include all-in-one command tool & update readme.md
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com > 
						
						
					 
					
						2023-03-17 10:47:06 +01:00