Default Branch

945501f5ea · llama: fix leaked buffers for mmap + split files (#16765) · Updated 2025-10-27 08:17:31 +00:00

Branches

b83cae088c · speculative : add infill mode · Updated 2024-11-26 09:14:17 +00:00    CS348Project

2679
1

4ff0831ce6 · metal : use F16 math in mul_mat kernels · Updated 2024-11-25 13:15:26 +00:00    CS348Project

2692
1

f7b0233eca · wip · Updated 2024-11-16 08:33:55 +00:00    CS348Project

2754
1

5e6dad9322 · speculative : experimenting with Qwen2.5 · Updated 2024-11-14 09:31:31 +00:00    CS348Project

2776
2

33bdee667e · speculative : fix out-of-bounds access · Updated 2024-11-14 09:23:45 +00:00    CS348Project

2776
1

8c1b186cb5 · metal : minor Q4_0 optimization · Updated 2024-11-12 13:30:51 +00:00    CS348Project

2786
21

3d1fe1bb4d · metal : int -> short, style · Updated 2024-11-09 08:32:16 +00:00    CS348Project

2797
2

bd1198a67a · metal : fix build and some more comments · Updated 2024-11-09 08:09:50 +00:00    CS348Project

2797
1

a2385da59c · make : clean-up [no ci] · Updated 2024-11-08 11:46:20 +00:00    CS348Project

2804
9

94accca4c2 · vec move mask to shmem · Updated 2024-11-07 18:58:10 +00:00    CS348Project

2814
19

c5d8bb5a81 · leave only basic functions for SYCL CI · Updated 2024-11-06 07:47:50 +00:00    CS348Project

2879
2

4fc8673d09 · llama-bench : skip repeated values in consecutive lines · Updated 2024-11-02 14:37:33 +00:00    CS348Project

2839
1

20e12112fd · llama : suggest reduce ctx size when kv init fails · Updated 2024-11-01 23:55:19 +00:00    CS348Project

2842
2

afc4a7de65 · llama : enable flash attn automatically when supported · Updated 2024-10-30 22:30:06 +00:00    CS348Project

2859
1

8233009d4d · Support SYCL device register · Updated 2024-10-20 02:06:51 +00:00    CS348Project

2940
1

bc82fc2ed8 · llama-bench : add time-to-first-byte stat · Updated 2024-10-18 13:40:02 +00:00    CS348Project

2911
1

2d3fc54ac6 · add amx kernel for gemm · Updated 2024-10-18 03:35:49 +00:00    CS348Project

2921
1

630bce5a7f · ggml : fix possible buffer use after free in sched reserve · Updated 2024-10-17 22:21:54 +00:00    CS348Project

2919
1

17b3a3e8cc · llama : minor llama_grammar refactoring · Updated 2024-10-17 09:23:27 +00:00    CS348Project

2947
4

a34fc0dd86 · ci : reduce severity of unused Pyright ignore comments · Updated 2024-09-30 17:59:40 +00:00    CS348Project

3002
1