Default Branch

3479efd112 · CANN: Improve device ID handling and aclnnArange checks (#16752) · Updated 2025-10-28 02:54:53 +00:00

Branches

9127800d83 · wip · Updated 2024-08-16 23:51:06 +00:00    CS348Project

3283
2

62d7b6c87f · cuda : re-add q4_0 · Updated 2024-08-14 10:37:03 +00:00    CS348Project

3279
3

93ec58b932 · server : fix typo in comment · Updated 2024-08-14 02:12:26 +00:00    CS348Project

3281
4

faaac59d16 · llama : support NUL bytes in tokens · Updated 2024-08-12 01:00:03 +00:00    CS348Project

3292
1

73bc9350cd · gguf-py : Numpy dequantization for grid-based i-quants · Updated 2024-08-10 03:47:31 +00:00    CS348Project

3312
2

9329953a61 · llama : avoid double tensor copy when saving session to buffer · Updated 2024-08-07 20:03:34 +00:00    CS348Project

3320
2

7764ab911d · update guide · Updated 2024-08-07 14:01:02 +00:00    CS348Project

3321
1

cad8abb49b · add tool to allow plotting tensor allocation maps within buffers · Updated 2024-08-06 20:09:51 +00:00    CS348Project

3329
1

6e299132e7 · clip : style changes · Updated 2024-08-06 08:44:29 +00:00    CS348Project

3653
56

16dab13bde · correct cmd name · Updated 2024-08-05 16:15:33 +00:00    CS348Project

3338
1

bddcc5f985 · llama : better replace_all · Updated 2024-08-04 10:42:08 +00:00    CS348Project

3354
1

229c35cb59 · gguf-py : remove LlamaFileTypeMap · Updated 2024-08-04 01:22:37 +00:00    CS348Project

3357
5

eab4a88210 · Using dp4a ptx intrinsics for an improved Mul8MAT perf [By Alcpz] · Updated 2024-07-29 15:52:29 +00:00    CS348Project

3375
1

9cddd9aeec · llama : cast seq_id in comparison with unsigned n_seq_max · Updated 2024-07-27 19:50:23 +00:00    CS348Project

3413
7

9aeb0e1f75 · sycl add conv support · Updated 2024-07-25 12:15:02 +00:00    CS348Project

3402
1

5934580905 · ggml : add and use ggml_cpu_has_llamafile() · Updated 2024-07-24 08:31:41 +00:00    CS348Project

3413
1

fe28a7b9d8 · llama : clean-up · Updated 2024-07-23 05:38:50 +00:00    CS348Project

3421
11

57349e1db3 · llama : allow overrides for tokenizer flags · Updated 2024-07-21 11:42:15 +00:00    CS348Project

3431
1

1932a1b871 · gguf-py : do not use title case for naming convention · Updated 2024-07-20 20:55:06 +00:00    CS348Project

3439
5

c8ee1bccdd · Fix Vulkan matmul tests compile errors · Updated 2024-07-20 06:01:18 +00:00    CS348Project

3439
1