llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Author	SHA1	Message	Date
Aaron Teo	a1912c7fa9	devops: fix copying process Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 18:07:59 +08:00
Aaron Teo	03e642a9d1	devops: attempt at making it cache the build Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 18:05:43 +08:00
Aaron Teo	0084c88929	devops: attempt at fixing missing dir Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:52:43 +08:00
Aaron Teo	73679520ce	devops: formalise llama.cpp loc Signed-off-by: Aaron Teo <aaron.teo1@ibm.com> (cherry picked from commit `0a7664af84`) Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:51:20 +08:00
Aaron Teo	bff187d717	Revert "devops: formalise llama.cpp loc" This reverts commit `0a7664af84`. Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:47:02 +08:00
Aaron Teo	0a7664af84	devops: formalise llama.cpp loc Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:40:27 +08:00
Aaron Teo	244d6cf56f	devops: update debian target Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:29:00 +08:00
Aaron Teo	17a9985086	devops: fix missing shared libraries in base Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:24:23 +08:00
Aaron Teo	489e0ab54f	devops: fix typos Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:19:30 +08:00
Aaron Teo	a0b22c8a29	devops: add cli target Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 17:14:33 +08:00
Aaron Teo	f6baab6be8	devops: finalise hardened server stage Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:59:53 +08:00
Aaron Teo	10714efb6d	devops: move libggml-cpu and blas into bin Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:54:06 +08:00
Aaron Teo	ab79c0bb80	devops: remove move shared objects Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:45:17 +08:00
Aaron Teo	944ef7f0bc	devops: fix missing ggml shared object failure to load model Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:38:05 +08:00
Aaron Teo	b23e72e1d0	devops: attempt at fixing model loading failure Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:19:35 +08:00
Aaron Teo	451aceb9a0	devops: fix unknown model loading failures Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 16:16:49 +08:00
Aaron Teo	c3ab7855fd	devops: fix permission issue Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 15:43:59 +08:00
Aaron Teo	7027c14d3c	devops: fix missing stage ref Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 15:35:29 +08:00
Aaron Teo	74767bbc16	devops: add collector stage Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 15:34:47 +08:00
Aaron Teo	3a09c656a7	devops: fix shared libs Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 15:25:01 +08:00
Aaron Teo	28b41f73ed	devops: use correct libs path Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-09 02:59:06 +08:00
Aaron Teo	2ff6694a0f	devops: fix shared libs in distroless Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 18:31:58 +08:00
Aaron Teo	a070157511	devops: remove apt commands from distroless Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 18:16:32 +08:00
Aaron Teo	23d34f9a98	devops: remove apt clean steps as distroless misses it Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 17:57:48 +08:00
Aaron Teo	e172b00445	devops: add server build step Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 17:50:10 +08:00
Aaron Teo	e53e1c450c	devops: copy more tools Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 15:36:41 +08:00
Aaron Teo	ce7bd1955d	devops: rework s390x docker Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 15:19:41 +08:00
Aaron Teo	955c426620	devops: move s390x docker into cpu docker Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 14:56:07 +08:00
Aaron Teo	75846921d8	devops: add missing ninja Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 14:03:38 +08:00
Aaron Teo	bdcbcaeead	devops: add s390x dockerfile Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-09-08 13:59:54 +08:00
R0CKSTAR	b55f06e1aa	vulkan.Dockerfile: install vulkan SDK using tarball (#15282 ) Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-08-23 08:58:57 +02:00
Dobri Danchev	618575c582	Fix broken build: require updated pip to support --break-system-packages (#15357 ) * Revert "devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 24.04 (#15005)" This reverts commit `e4e915912c`. * devops: Allow pip to modify externally-managed python environment (system installation) - Updated pip install commands to include the --break-system-packages flag, ensuring compatibility when working with system-managed Python environments (PEP 668). - Note: The --break-system-packages option was introduced in 2023. Ensure pip is updated to a recent version before using this flag. fixes [#15004](https://github.com/danchev/llama.cpp/issues/15004)	2025-08-18 12:50:48 +02:00
simevo	e4e915912c	devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 24.04 (#15005 ) fixes #15004 Co-authored-by: Paolo Greppi <paolo.greppi@libpf.com>	2025-08-14 18:45:27 +03:00
Christian Kastner	646944cfa8	docker : Enable GGML_CPU_ALL_VARIANTS for ARM (#15267 )	2025-08-14 16:22:58 +02:00
Ali Tariq	648ebcdb73	ci : Added CI with RISC-V RVV1.0 Hardware (#14439 ) * Changed the CI file to hw * Changed the CI file to hw * Added to sudoers for apt * Removed the clone command and used checkout * Added libcurl * Added gcc-14 * Checking gcc --version * added gcc-14 symlink * added CC and C++ variables * Added the gguf weight * Changed the weights path * Added system specification * Removed white spaces * ci: Replace Jenkins riscv native build Cloud-V pipeline with GitHub Actions workflow Removed the legacy .devops/cloud-v-pipeline Jenkins CI configuration and introduced .github/workflows/build-riscv-native.yml for native RISC-V builds using GitHub Actions. * removed trailing whitespaces --------- Co-authored-by: Akif Ejaz <akifejaz40@gmail.com>	2025-08-13 13:14:44 +03:00
diannao	2860d479b4	docker : add cann build pipline (#14591 ) * docker: add cann build pipline * docker: add cann build pipline * docker: fix cann devops * cann : fix multi card hccl * Update ggml/src/ggml-cann/ggml-cann.cpp Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com> * Update ggml-cann.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>	2025-08-01 10:02:34 +08:00
deepsek	66906cd82a	HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 (#14624 ) This commit adds support for MFMA instructions to MMQ. CDNA1/GFX908 CDNA2/GFX90a and CDNA3/GFX942 are supported by the MFMA-enabled code path added by this commit. The code path and stream-k is only enabled on CDNA3 for now as it fails to outperform blas in all cases on the other devices. Blas is currently only consistently outperformed on CDNA3 due to issues in the amd-provided blas libraries. This commit also improves the awareness of MMQ towards different warp sizes and as a side effect improves the performance of all quant formats besides q4_0 and q4_1, which regress slightly, on GCN gpus.	2025-07-27 00:28:14 +02:00
R0CKSTAR	3f4fc97f1d	musa: upgrade musa sdk to rc4.2.0 (#14498 ) * musa: apply mublas API changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: update musa version to 4.2.0 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore MUSA graph settings in CMakeLists.txt Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: disable mudnnMemcpyAsync by default Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: switch back to non-mudnn images Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * minor changes Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: restore rc in docker image tag Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-07-24 20:05:37 +01:00
Wroclaw	760b4484e3	nix : use optionalAttrs for env mkDerivation attrset argument (#14726 )	2025-07-17 15:18:16 -07:00
Vedran Miletić	e9b6350e61	scripts : make the shell scripts cross-platform (#14341 )	2025-06-30 10:17:18 +02:00
Svetlozar Georgiev	40643edb86	sycl: fix docker image (#14144 )	2025-06-13 18:32:56 +02:00
R0CKSTAR	33983057d0	musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (#13647 ) * musa: fix build warning (unused parameter) Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: upgrade MUSA SDK version to rc4.0.1 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: use mudnn::Unary::IDENTITY op to accelerate D2D memory copy Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Update ggml/src/ggml-cuda/cpy.cu Co-authored-by: Johannes Gäßler <johannesg@5d6.de> * musa: remove MUDNN_CHECK_GEN and use CUDA_CHECK_GEN instead in MUDNN_CHECK Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> Co-authored-by: Johannes Gäßler <johannesg@5d6.de>	2025-05-21 09:58:49 +08:00
Alberto Cabrera Pérez	f71f40a284	ci : upgraded oneAPI version in SYCL workflows and dockerfile (#13532 )	2025-05-19 11:46:09 +01:00
Xuan-Son Nguyen	da84c04d8f	docker : do not build tests (#13204 ) * docker : do not build tests * include "ggml-cpu.h"	2025-04-30 10:44:07 +02:00
Rudi Servo	b0091ecc1e	docker : added all CPU to GPU images (#12749 )	2025-04-10 01:17:12 +02:00
Chenguang Li	6e1c4cebdb	CANN: Support Opt CONV_TRANSPOSE_1D and ELU (#12786 ) * [CANN] Support ELU and CONV_TRANSPOSE_1D * [CANN]Modification review comments * [CANN]Modification review comments * [CANN]name adjustment * [CANN]remove lambda used in template * [CANN]Use std::func instead of template * [CANN]Modify the code according to the review comments --------- Signed-off-by: noemotiovon <noemotiovon@gmail.com>	2025-04-09 14:04:14 +08:00
Xuan-Son Nguyen	bd3f59f812	cmake : enable curl by default (#12761 ) * cmake : enable curl by default * no curl if no examples * fix build * fix build-linux-cross * add windows-setup-curl * fix * shell * fix path * fix windows-latest-cmake* * run: include_directories * LLAMA_RUN_EXTRA_LIBS * sycl: no llama_curl * no test-arg-parser on windows * clarification * try riscv64 / arm64 * windows: include libcurl inside release binary * add msg * fix mac / ios / android build * will this fix xcode? * try clearing the cache * add bunch of licenses * revert clear cache * fix xcode * fix xcode (2) * fix typo	2025-04-07 13:35:19 +02:00
Georgi Gerganov	68ff663a04	repo : update links to new url (#11886 ) * repo : update links to new url ggml-ci * cont : more urls ggml-ci	2025-02-15 16:40:57 +02:00
Georgi Gerganov	dbc2ec59b5	docker : drop to CUDA 12.4 (#11869 ) * docker : drop to CUDA 12.4 * docker : update readme [no ci]	2025-02-14 14:48:40 +02:00
R0CKSTAR	bd6e55bfd3	musa: bump MUSA SDK version to rc3.1.1 (#11822 ) * musa: Update MUSA SDK version to rc3.1.1 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * musa: Remove workaround in PR #10042 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-02-13 13:28:18 +01:00

1 2 3 4

152 Commits