Francis Couture-Harpin
|
e33de128c7
|
common : move string_remove_suffix from quantize and imatrix
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
|
2025-06-23 16:24:06 -04:00 |
|
Francis Couture-Harpin
|
118d52fefc
|
Merge branch 'master' into compilade/imatrix-batched-chunks
|
2025-06-23 12:54:56 -04:00 |
|
Francis Couture-Harpin
|
0e79355075
|
quantize : fix dataset name loading from gguf imatrix
|
2025-06-23 12:43:25 -04:00 |
|
Francis Couture-Harpin
|
43cd2b3eb5
|
imatrix : support 3d tensors with MUL_MAT
|
2025-06-23 12:20:55 -04:00 |
|
Ed Addario
|
fa4a9f2a1c
|
quantize : handle user-defined pruning of whole layers (blocks) (#13037)
|
2025-06-22 23:16:26 +02:00 |
|
Francis Couture-Harpin
|
2c0945027a
|
Merge branch 'master' into compilade/imatrix-batched-chunks
|
2025-06-18 16:32:35 -04:00 |
|
Ed Addario
|
e5c834f718
|
quantize : improve tensor-type pattern matching (#13033)
|
2025-05-13 19:12:31 +02:00 |
|
Diego Devesa
|
1d36b3670b
|
llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2025-05-02 20:27:13 +02:00 |
|