Aaron Teo
ab79c0bb80
devops: remove move shared objects
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 16:45:17 +08:00
Aaron Teo
944ef7f0bc
devops: fix missing ggml shared object
...
failure to load model
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 16:38:05 +08:00
Aaron Teo
b23e72e1d0
devops: attempt at fixing model loading failure
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 16:19:35 +08:00
Aaron Teo
451aceb9a0
devops: fix unknown model loading failures
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 16:16:49 +08:00
Aaron Teo
c3ab7855fd
devops: fix permission issue
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 15:43:59 +08:00
Aaron Teo
7027c14d3c
devops: fix missing stage ref
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 15:35:29 +08:00
Aaron Teo
74767bbc16
devops: add collector stage
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 15:34:47 +08:00
Aaron Teo
3a09c656a7
devops: fix shared libs
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 15:25:01 +08:00
Aaron Teo
28b41f73ed
devops: use correct libs path
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-09 02:59:06 +08:00
Aaron Teo
2ff6694a0f
devops: fix shared libs in distroless
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 18:31:58 +08:00
Aaron Teo
a070157511
devops: remove apt commands from distroless
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 18:16:32 +08:00
Aaron Teo
23d34f9a98
devops: remove apt clean steps as distroless misses it
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 17:57:48 +08:00
Aaron Teo
e172b00445
devops: add server build step
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 17:50:10 +08:00
Aaron Teo
e53e1c450c
devops: copy more tools
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 15:36:41 +08:00
Aaron Teo
ce7bd1955d
devops: rework s390x docker
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 15:19:41 +08:00
Aaron Teo
955c426620
devops: move s390x docker into cpu docker
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 14:56:07 +08:00
Aaron Teo
75846921d8
devops: add missing ninja
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 14:03:38 +08:00
Aaron Teo
bdcbcaeead
devops: add s390x dockerfile
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-09-08 13:59:54 +08:00
R0CKSTAR
b55f06e1aa
vulkan.Dockerfile: install vulkan SDK using tarball ( #15282 )
...
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
2025-08-23 08:58:57 +02:00
Dobri Danchev
618575c582
Fix broken build: require updated pip to support --break-system-packages ( #15357 )
...
* Revert "devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 24.04 (#15005 )"
This reverts commit e4e915912c .
* devops: Allow pip to modify externally-managed python environment (system installation)
- Updated pip install commands to include the --break-system-packages
flag, ensuring compatibility when working with system-managed Python
environments (PEP 668).
- Note: The --break-system-packages option was introduced in 2023.
Ensure pip is updated to a recent version before using this flag.
fixes [#15004 ](https://github.com/danchev/llama.cpp/issues/15004 )
2025-08-18 12:50:48 +02:00
simevo
e4e915912c
devops : fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 24.04 ( #15005 )
...
fixes #15004
Co-authored-by: Paolo Greppi <paolo.greppi@libpf.com >
2025-08-14 18:45:27 +03:00
Christian Kastner
646944cfa8
docker : Enable GGML_CPU_ALL_VARIANTS for ARM ( #15267 )
2025-08-14 16:22:58 +02:00
Ali Tariq
648ebcdb73
ci : Added CI with RISC-V RVV1.0 Hardware ( #14439 )
...
* Changed the CI file to hw
* Changed the CI file to hw
* Added to sudoers for apt
* Removed the clone command and used checkout
* Added libcurl
* Added gcc-14
* Checking gcc --version
* added gcc-14 symlink
* added CC and C++ variables
* Added the gguf weight
* Changed the weights path
* Added system specification
* Removed white spaces
* ci: Replace Jenkins riscv native build Cloud-V pipeline with GitHub Actions workflow
Removed the legacy .devops/cloud-v-pipeline Jenkins CI configuration and introduced .github/workflows/build-riscv-native.yml for native RISC-V builds using GitHub Actions.
* removed trailing whitespaces
---------
Co-authored-by: Akif Ejaz <akifejaz40@gmail.com >
2025-08-13 13:14:44 +03:00
diannao
2860d479b4
docker : add cann build pipline ( #14591 )
...
* docker: add cann build pipline
* docker: add cann build pipline
* docker: fix cann devops
* cann : fix multi card hccl
* Update ggml/src/ggml-cann/ggml-cann.cpp
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com >
* Update ggml-cann.cpp
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com >
2025-08-01 10:02:34 +08:00
deepsek
66906cd82a
HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 ( #14624 )
...
This commit adds support for MFMA instructions to MMQ. CDNA1/GFX908 CDNA2/GFX90a and CDNA3/GFX942 are supported by the MFMA-enabled code path added by this commit. The code path and stream-k is only enabled on CDNA3 for now as it fails to outperform blas in all cases on the other devices.
Blas is currently only consistently outperformed on CDNA3 due to issues in the amd-provided blas libraries.
This commit also improves the awareness of MMQ towards different warp sizes and as a side effect improves the performance of all quant formats besides q4_0 and q4_1, which regress slightly, on GCN gpus.
2025-07-27 00:28:14 +02:00
R0CKSTAR
3f4fc97f1d
musa: upgrade musa sdk to rc4.2.0 ( #14498 )
...
* musa: apply mublas API changes
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: update musa version to 4.2.0
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: restore MUSA graph settings in CMakeLists.txt
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: disable mudnnMemcpyAsync by default
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: switch back to non-mudnn images
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* minor changes
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: restore rc in docker image tag
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
2025-07-24 20:05:37 +01:00
Wroclaw
760b4484e3
nix : use optionalAttrs for env mkDerivation attrset argument ( #14726 )
2025-07-17 15:18:16 -07:00
Vedran Miletić
e9b6350e61
scripts : make the shell scripts cross-platform ( #14341 )
2025-06-30 10:17:18 +02:00
Svetlozar Georgiev
40643edb86
sycl: fix docker image ( #14144 )
2025-06-13 18:32:56 +02:00
R0CKSTAR
33983057d0
musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy ( #13647 )
...
* musa: fix build warning (unused parameter)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: upgrade MUSA SDK version to rc4.0.1
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: use mudnn::Unary::IDENTITY op to accelerate D2D memory copy
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* Update ggml/src/ggml-cuda/cpy.cu
Co-authored-by: Johannes Gäßler <johannesg@5d6.de >
* musa: remove MUDNN_CHECK_GEN and use CUDA_CHECK_GEN instead in MUDNN_CHECK
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
Co-authored-by: Johannes Gäßler <johannesg@5d6.de >
2025-05-21 09:58:49 +08:00
Alberto Cabrera Pérez
f71f40a284
ci : upgraded oneAPI version in SYCL workflows and dockerfile ( #13532 )
2025-05-19 11:46:09 +01:00
Xuan-Son Nguyen
da84c04d8f
docker : do not build tests ( #13204 )
...
* docker : do not build tests
* include "ggml-cpu.h"
2025-04-30 10:44:07 +02:00
Rudi Servo
b0091ecc1e
docker : added all CPU to GPU images ( #12749 )
2025-04-10 01:17:12 +02:00
Chenguang Li
6e1c4cebdb
CANN: Support Opt CONV_TRANSPOSE_1D and ELU ( #12786 )
...
* [CANN] Support ELU and CONV_TRANSPOSE_1D
* [CANN]Modification review comments
* [CANN]Modification review comments
* [CANN]name adjustment
* [CANN]remove lambda used in template
* [CANN]Use std::func instead of template
* [CANN]Modify the code according to the review comments
---------
Signed-off-by: noemotiovon <noemotiovon@gmail.com >
2025-04-09 14:04:14 +08:00
Xuan-Son Nguyen
bd3f59f812
cmake : enable curl by default ( #12761 )
...
* cmake : enable curl by default
* no curl if no examples
* fix build
* fix build-linux-cross
* add windows-setup-curl
* fix
* shell
* fix path
* fix windows-latest-cmake*
* run: include_directories
* LLAMA_RUN_EXTRA_LIBS
* sycl: no llama_curl
* no test-arg-parser on windows
* clarification
* try riscv64 / arm64
* windows: include libcurl inside release binary
* add msg
* fix mac / ios / android build
* will this fix xcode?
* try clearing the cache
* add bunch of licenses
* revert clear cache
* fix xcode
* fix xcode (2)
* fix typo
2025-04-07 13:35:19 +02:00
Georgi Gerganov
68ff663a04
repo : update links to new url ( #11886 )
...
* repo : update links to new url
ggml-ci
* cont : more urls
ggml-ci
2025-02-15 16:40:57 +02:00
Georgi Gerganov
dbc2ec59b5
docker : drop to CUDA 12.4 ( #11869 )
...
* docker : drop to CUDA 12.4
* docker : update readme [no ci]
2025-02-14 14:48:40 +02:00
R0CKSTAR
bd6e55bfd3
musa: bump MUSA SDK version to rc3.1.1 ( #11822 )
...
* musa: Update MUSA SDK version to rc3.1.1
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
* musa: Remove workaround in PR #10042
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com >
2025-02-13 13:28:18 +01:00
Xuan-Son Nguyen
d0c08040b6
ci : fix build CPU arm64 ( #11472 )
...
* ci : fix build CPU arm64
* failed, trying ubuntu 22
* vulkan: ubuntu 24
* vulkan : jammy --> noble
2025-01-29 00:02:56 +01:00
Nuno
d7d1eccacc
docker: allow installing pip packages system-wide ( #11437 )
...
Signed-off-by: rare-magma <rare-magma@posteo.eu >
2025-01-28 14:17:25 +00:00
Nuno
f643120bad
docker: add perplexity and bench commands to full image ( #11438 )
...
Signed-off-by: rare-magma <rare-magma@posteo.eu >
2025-01-28 10:42:32 +00:00
Xuan Son Nguyen
caf773f249
docker : fix ARM build and Vulkan build ( #11434 )
...
* ci : do not fail-fast for docker
* build arm64/amd64 separatedly
* fix pip
* no fast fail
* vulkan: try jammy
2025-01-26 22:45:32 +01:00
Nuno
6f53d8a6b4
docker: add missing vulkan library to base layer and update to 24.04 ( #11422 )
...
Signed-off-by: rare-magma <rare-magma@posteo.eu >
2025-01-26 18:22:43 +01:00
Diego Devesa
6e264a905b
docker : add GGML_CPU_ARM_ARCH arg to select ARM architecture to build for ( #11419 )
2025-01-25 17:22:41 +01:00
Diego Devesa
20a758155b
docker : fix CPU ARM build ( #11403 )
...
* docker : fix CPU ARM build
* add CURL to other builds
2025-01-25 15:22:29 +01:00
Rudi Servo
7c0e285858
devops : add docker-multi-stage builds ( #10832 )
2024-12-22 23:22:58 +01:00
Evgeny Kurnevsky
e52aba537a
nix: allow to override rocm gpu targets ( #10794 )
...
This allows to reduce compile time when you are building for a single GPU.
2024-12-14 10:17:36 -08:00
Corentin REGAL
11e07fd63b
fix: graceful shutdown for Docker images ( #10815 )
2024-12-13 18:23:50 +01:00
Diego Devesa
59f4db1088
ggml : add predefined list of CPU backend variants to build ( #10626 )
...
* ggml : add predefined list of CPU backend variants to build
* update CPU dockerfiles
2024-12-04 14:45:40 +01:00
Diego Devesa
3420909dff
ggml : automatic selection of best CPU backend ( #10606 )
...
* ggml : automatic selection of best CPU backend
* amx : minor opt
* add GGML_AVX_VNNI to enable avx-vnni, fix checks
2024-12-01 16:12:41 +01:00