Default Branch

945501f5ea · llama: fix leaked buffers for mmap + split files (#16765) · Updated 2025-10-27 08:17:31 +00:00

Branches

6b9554a740 · metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7 · Updated 2023-10-08 06:55:13 +00:00    CS348Project

5436
5

ba44776dc2 · bump version · Updated 2023-10-07 18:47:48 +00:00    CS348Project

5435
6

5ab6c2132a · server-parallel : add "--reverse-prompt" + compiler warning fixes · Updated 2023-10-06 11:32:19 +00:00    CS348Project

5448
4

5418932b71 · llama : fix comments for llama_kv_cache API · Updated 2023-10-03 18:01:52 +00:00    CS348Project

5473
5

c5650ed470 · server : avoid context swaps by shifting the KV cache · Updated 2023-09-28 16:03:36 +00:00    CS348Project

5497
57

72e7ef4e53 · simple : fixes · Updated 2023-09-26 21:19:36 +00:00    CS348Project

5523
48

784d14ed31 · llama : store non-RoPEd K cache (WIP) · Updated 2023-09-17 20:43:07 +00:00    CS348Project

5535
5

92a4f86879 · llama : make starcoder graph build more consistent with others · Updated 2023-09-15 14:57:10 +00:00    CS348Project

5545
20

e7e7b11455 · llama : remove experimental stuff · Updated 2023-09-14 19:52:01 +00:00    CS348Project

5557
3

2f689dee06 · metal : minor · Updated 2023-09-07 12:33:21 +00:00    CS348Project

5590
5

30ac7a4117 · gitignore : metal · Updated 2023-09-04 19:23:16 +00:00    CS348Project

5602
12

f3a84b2e0d · llama : better express the KV cache dependencies in the graph · Updated 2023-09-04 18:44:48 +00:00    CS348Project

5602
5

c79d130f74 · make : fix speculative build · Updated 2023-09-04 12:50:04 +00:00    CS348Project

5603
9

847896aba7 · speculative : add --draft CLI arg · Updated 2023-09-03 10:51:07 +00:00    CS348Project

5609
3

8c2b881281 · cuda : poc for norm quants (only -b 1 works) · Updated 2023-08-30 18:42:28 +00:00    CS348Project

5650
3

b4e70822f6 · metal : add poc for normalized Q4_0 and Q4_1 · Updated 2023-08-30 15:47:16 +00:00    CS348Project

5650
7

488e03200e · Merge branch 'master' into gguf-publish-ci · Updated 2023-08-30 08:34:55 +00:00    CS348Project

5655
4

33a5517d87 · llama.cpp : print gguf version · Updated 2023-08-26 21:56:48 +00:00    CS348Project

5697
10

d34472c124 · Fix HellaSwag · Updated 2023-08-26 07:55:39 +00:00    CS348Project

5710
1

0248ca811e · gguf : add notes for tests · Updated 2023-08-25 06:08:05 +00:00    CS348Project

5722
10