Commit Graph

8 Commits

Author SHA1 Message Date
Francis Couture-Harpin
e33de128c7 common : move string_remove_suffix from quantize and imatrix
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-06-23 16:24:06 -04:00
Francis Couture-Harpin
118d52fefc Merge branch 'master' into compilade/imatrix-batched-chunks 2025-06-23 12:54:56 -04:00
Francis Couture-Harpin
0e79355075 quantize : fix dataset name loading from gguf imatrix 2025-06-23 12:43:25 -04:00
Francis Couture-Harpin
43cd2b3eb5 imatrix : support 3d tensors with MUL_MAT 2025-06-23 12:20:55 -04:00
Ed Addario
fa4a9f2a1c quantize : handle user-defined pruning of whole layers (blocks) (#13037) 2025-06-22 23:16:26 +02:00
Francis Couture-Harpin
2c0945027a Merge branch 'master' into compilade/imatrix-batched-chunks 2025-06-18 16:32:35 -04:00
Ed Addario
e5c834f718 quantize : improve tensor-type pattern matching (#13033) 2025-05-13 19:12:31 +02:00
Diego Devesa
1d36b3670b llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory

---------

Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
2025-05-02 20:27:13 +02:00