Aaron Teo
|
ab60ae6ca2
|
ggml-zdnn: add zdnn_init call for static libs
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:55:44 +08:00 |
|
Aaron Teo
|
0ae2d30302
|
ggml-zdnn: add nnpa installed detection
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:39:55 +08:00 |
|
Aaron Teo
|
a9438925f2
|
ggml-zdnn: add parmblkformat detections
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:36:55 +08:00 |
|
Aaron Teo
|
1c6ca76c2e
|
ggml-zdnn: remove free_buffer debug info
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:27:16 +08:00 |
|
Aaron Teo
|
1a0520a540
|
ggml-zdnn: add logging to debug free buffer
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:12:18 +08:00 |
|
Aaron Teo
|
2872276d8a
|
ggml-zdnn: fix invalid ztensor buffer release
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-29 00:09:00 +08:00 |
|
Aaron Teo
|
2cfa118fa9
|
ggml-zdnn: fix missing load tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 23:42:24 +08:00 |
|
Aaron Teo
|
fc9260deab
|
ggml-zdnn: attempt to fix sigsegv
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 23:37:50 +08:00 |
|
Aaron Teo
|
e0549c2925
|
ggml-zdnn: fix missing vector import in header
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 23:33:37 +08:00 |
|
Aaron Teo
|
f99b274cac
|
ggml-zdnn: fix missing vector import
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 23:30:48 +08:00 |
|
Aaron Teo
|
0905168388
|
ggml-zdnn: rewrite into mre
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 23:26:15 +08:00 |
|
Aaron Teo
|
03ec5d3ed3
|
ggml-zdnn: bring back working matmul
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 18:14:44 +08:00 |
|
Aaron Teo
|
4cc62cb693
|
ggml-zdnn: move bias data to local also
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 18:10:14 +08:00 |
|
Aaron Teo
|
6f42570194
|
ggml-zdnn: move everything back to local declaration
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 18:08:47 +08:00 |
|
Aaron Teo
|
eefa943b0a
|
ggml-zdnn: fix sigsegv
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 18:03:17 +08:00 |
|
Aaron Teo
|
fc692ed498
|
ggml-zdnn: figure out why sigtrap is happening
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 18:00:28 +08:00 |
|
Aaron Teo
|
08de84ef85
|
ggml-zdnn: bugfix transform ztensor vs origtensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:57:57 +08:00 |
|
Aaron Teo
|
032dce5a6a
|
ggml-zdnn: fix sequencing of transforms
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:46:17 +08:00 |
|
Aaron Teo
|
cf0e190c40
|
ggml-zdnn: add more safeguards in matmul
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:44:39 +08:00 |
|
Aaron Teo
|
f239bbb02d
|
ggml-zdnn: move weights transform into mulmat
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:38:44 +08:00 |
|
Aaron Teo
|
092fa3a328
|
ggml-zdnn: activate bias transform in matmul
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:27:35 +08:00 |
|
Aaron Teo
|
f7e8d6f2b2
|
ggml-zdnn: add logger to check if mat mul ops go through set_tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:17:12 +08:00 |
|
Aaron Teo
|
6d71749c26
|
ggml-zdnn: add more debug info for extra buffer transform
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:10:07 +08:00 |
|
Aaron Teo
|
4b2f1cb1b8
|
ggml-zdnn: add bias data transform
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 16:05:53 +08:00 |
|
Aaron Teo
|
f800c80281
|
ggml-zdnn: add bias ztensor and data free
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 15:59:52 +08:00 |
|
Aaron Teo
|
bee7dd3020
|
ggml-zdnn: tighten memory usage, change string allocation
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 15:55:42 +08:00 |
|
Aaron Teo
|
aef93b3908
|
ggml-zdnn: add bias init_tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-28 15:41:56 +08:00 |
|
Aaron Teo
|
f263f5d9ae
|
ggml-zdnn: fix missing data transform call
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 18:30:10 +08:00 |
|
Aaron Teo
|
1c75ed63e5
|
ggml-zdnn: fix compiler error missing type
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 18:22:34 +08:00 |
|
Aaron Teo
|
a1d8568c14
|
ggml-zdnn: impl matmul
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 18:13:07 +08:00 |
|
Aaron Teo
|
59e9805ab0
|
ggml-zdnn: code clean up
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 16:26:29 +08:00 |
|
Aaron Teo
|
c1653ab639
|
ggml-zdnn: fix incorrect ztensor shape, reduce memory padding
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 16:22:06 +08:00 |
|
Aaron Teo
|
828519659b
|
ggml-zdnn: update supports_op matmul matrix
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 16:11:37 +08:00 |
|
Aaron Teo
|
18658b8607
|
ggml-zdnn: impl init_tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 12:02:20 +08:00 |
|
Aaron Teo
|
da2e0e70ba
|
ggml-zdnn: switch buffers back and set to arbitrary number
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 02:31:22 +08:00 |
|
Aaron Teo
|
63fbc45ed6
|
ggml-zdnn: switch to std vector instead of array
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 01:09:01 +08:00 |
|
Aaron Teo
|
b7f4b6fde3
|
ggml-zdnn: rework init_tensor to create new buffers
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 01:03:53 +08:00 |
|
Aaron Teo
|
ee0ed78d54
|
ggml-zdnn: add check for view tensors to prevent init_tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:56:32 +08:00 |
|
Aaron Teo
|
13c64448bd
|
ggml-zdnn: assign tensor->extra to buffer
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:48:32 +08:00 |
|
Aaron Teo
|
13c05872f2
|
ggml-zdnn: implement at least 1 op to test
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:44:05 +08:00 |
|
Aaron Teo
|
9e84742e72
|
ggml-zdnn: test ztensor finding in init_tensor
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:40:22 +08:00 |
|
Aaron Teo
|
af9f4f0039
|
ggml-zdnn: fix compiler warnings and bugfixes
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:25:41 +08:00 |
|
Aaron Teo
|
ae2f656d7e
|
ggml-zdnn: bugfix new impl
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:18:53 +08:00 |
|
Aaron Teo
|
7c6395f826
|
ggml-zdnn: rewrite the backend implementation
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-24 00:14:45 +08:00 |
|
Aaron Teo
|
04ddb2ac95
|
ggml-zdnn: update op out_prod to use tensor->extra
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-23 19:51:37 +08:00 |
|
Aaron Teo
|
77a753297b
|
ggml-zdnn: support op out_prod
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-23 19:28:51 +08:00 |
|
Aaron Teo
|
11d58d29de
|
ggml-zdnn: add comments to prevent accidentally deleting lines
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-23 14:54:44 +08:00 |
|
Aaron Teo
|
529bdb9fbd
|
ggml-zdnn: last working matmul version
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-22 00:29:47 +08:00 |
|
Aaron Teo
|
60b9874dea
|
ggml-zdnn: update set_tensor logging to check only for matmul
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-21 21:11:39 +08:00 |
|
Aaron Teo
|
b9756b6dd4
|
ggml-zdnn: add more loggers
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
|
2025-07-21 21:09:21 +08:00 |
|