Aaron Teo
1b23fec005
ggml-cpu: remove mistaken fallback macro
...
fallback logic was already implemented but i was too sleepy to realise
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-25 01:31:53 +08:00
Aaron Teo
a02b360f2c
ggml-cpu: rename all fp16<->fp32 macros to prefix with ggml_cpu
...
ref: https://github.com/ggml-org/llama.cpp/pull/14317#discussion_r2164449406
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-25 01:07:58 +08:00
Aaron Teo
e4a7f84d37
ggml-cpu: move nnpa together with other fp16<->fp32 simd
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 22:31:39 +08:00
Aaron Teo
e4666f93d3
ggml-cpu: attempt at fixing loongarch failing build
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 22:29:08 +08:00
Aaron Teo
17b032fab8
ggml: refactor fp16<->fp32 simd to ggml-cpu
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 20:42:15 +08:00
Aaron Teo
8a5e011cb5
Revert "ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu"
...
This reverts commit bd288e8fa5 .
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 15:54:33 +08:00
Aaron Teo
e73413bb98
Revert "ggml-cpu: fix duplicate func names during compile"
...
This reverts commit fbb733451f .
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 15:54:18 +08:00
Aaron Teo
fbb733451f
ggml-cpu: fix duplicate func names during compile
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 15:18:20 +08:00
Aaron Teo
bd288e8fa5
ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-24 15:05:46 +08:00
Aaron Teo
5834dee1fc
ggml-cpu: move nnpa fp16->fp32 and fp32->fp16 to simd-mappings
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-23 17:52:28 +08:00
Aaron Teo
489cdf44bf
ggml-cpu: clarify naming of dlf16
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 23:34:42 +08:00
Aaron Teo
8ef51b9055
ggml-cpu: bring back fp32->fp16 store nnpa
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 17:49:36 +08:00
Aaron Teo
27131e5f34
ggml-cpu: disable fp32->fp16 nnpa conversions for now
...
there are some conversion failures in nnpa that requires the eyes of an
ibm stsm. will create a separate pr to introduce the fp32->fp16 change.
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 16:58:43 +08:00
Aaron Teo
9330454cb8
ggml-cpu: remove sigint from fp16 store
...
for some reason, the function is not getting a hit when debugged with
gdb. we will need to investigate further
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 15:06:31 +08:00
Aaron Teo
575ea9f6c6
ggml-cpu: fp16 load ensured to hit
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 15:00:46 +08:00
Aaron Teo
8f3a5af6c0
ggml-cpu: ensure fp16 and fp32 load and stores are called
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 14:57:25 +08:00
Aaron Teo
94f10ca189
ggml-cpu: fix float placeholder
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 14:53:15 +08:00
Aaron Teo
d9cc63a94a
ggml-cpu: fix print vs printf
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 14:51:38 +08:00
Aaron Teo
48b820d05f
ggml-cpu: add debugging prints to see if dlf16 is correct
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-21 14:50:33 +08:00
Aaron Teo
ffe296457e
ggml-cpu: better variable names
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
(cherry picked from commit 2f58bbcbb8 )
2025-06-21 14:47:46 +08:00
Aaron Teo
ebf9f34a38
ggml-cpu: add fp32->fp16
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
(cherry picked from commit 0ff0d65162 )
2025-06-21 14:47:23 +08:00
Aaron Teo
45a4cf651c
ggml-cpu: add fp16->fp32 nnpa first
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
(cherry picked from commit 8d4a7987f9 )
2025-06-21 14:47:12 +08:00
Aaron Teo
50d2227953
ggml-cpu: reduce asm calls for hsum ( #14037 )
...
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
2025-06-18 18:10:08 +01:00
Vineel Abhinav
1b8fb8152d
ggml: aarch64: Implement SVE F32 kernels for vector functions ( #13843 )
...
* F32-Mamba-SVE
* F32-Mamba-SVE
* Resolve test errors-1
* Resolve test errors-2
* F32-vec-SVE
* F32-vec-SVE
* F32-vec-SVE
2025-05-29 09:01:33 +03:00
shalinib-ibm
416313773b
ggml : fix ppc64le build ( #13176 )
...
Build fails with compilation error on power pc.
This patch fixes the same.
Tested with unit tests run via
--build <build_dir> && cd <build_dir> && make test
Signed-off-by: Shalini Salomi Bodapati <Shalini.Salomi.Bodapati@ibm.com >
2025-04-30 13:17:08 +02:00
Aaron Teo
0fed24c347
ggml: fix compilation error s390x ( #12848 )
...
* ggml: fixes #12846 compilation error
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
Co-authored-by: Aleksei Nikiforov <aleksei.nikiforov@ibm.com >
* ggml: add documentation for code change
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
Co-authored-by: Aleksei Nikiforov <aleksei.nikiforov@ibm.com >
* ggml: refactor to type-cast and update documentation
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
Co-authored-by: Aleksei Nikiforov <aleksei.nikiforov@ibm.com >
* ggml: update documentation to provide full issue link
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com >
Co-authored-by: Aleksei Nikiforov <aleksei.nikiforov@ibm.com >
---------
Co-authored-by: Aleksei Nikiforov <aleksei.nikiforov@ibm.com >
2025-04-11 08:20:07 +03:00
Prajwal B Mehendarkar
11d07e1e69
Fixes #12823 ( #12830 )
...
* Including limits file on AIX
* Fixes #12823
2025-04-10 01:18:01 +02:00
Georgi Gerganov
ff067dbcb9
ggml : simplify Arm fp16 CPU logic (ggml/1177)
...
* ggml : simlpify Arm fp16 CPU logic
ggml-ci
* cont : bring back CUDA/MUSA checks
ggml-ci
2025-04-07 18:44:17 +03:00
cmdr2
995083e4ed
cpu: move all the operators into a separate c++ file (except mul_mat) (ggml/1167)
...
* cpu: refactor SIMD mappings and vectorized op functions into separate files
* Fix warning for ggml_float to float
* Fix warnings
* cpu: move all the operations (except mul_mat) to a separate c++ file
* fix whitespace
* Update ggml/src/ggml-cpu/vec.h
Co-authored-by: Diego Devesa <slarengh@gmail.com >
* Fix PR comments - use GGML_UNUSED, use cassert in ops.cpp
* Reverse the order of import for ops.h and vec.h, to match what was present in ggml-cpu.c previously
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com >
2025-04-07 18:44:17 +03:00