Default Branch

945501f5ea · llama: fix leaked buffers for mmap + split files (#16765) · Updated 2025-10-27 08:17:31 +00:00

Branches

ed58975f51 · server : improve infill stop criteria · Updated 2025-03-12 13:28:48 +00:00    CS348Project

1907
1

87dae2fd15 · Vulkan: Print coopmat shapes, then exit · Updated 2025-03-09 10:53:55 +00:00    CS348Project

1921
1

25840747e6 · Vulkan: Add device architecture enum and logic to recognize AMD generations · Updated 2025-03-08 08:04:45 +00:00    CS348Project

2101
2

c75753a01b · server : infill gen ends on new line · Updated 2025-03-07 15:19:55 +00:00    CS348Project

1924
1

aefa65e442 · ci : fix save-load test invokations · Updated 2025-03-07 10:17:33 +00:00    CS348Project

1929
1

aae2903e0b · clang-tidy : disable bugprone-branch-clone · Updated 2025-03-07 09:36:55 +00:00    CS348Project

1930
1

624f7bd03b · graph : add comments · Updated 2025-02-28 19:13:08 +00:00    CS348Project

1994
95

0f2bf55502 · speculative : do not discard the last drafted token · Updated 2025-02-19 07:21:39 +00:00    CS348Project

2037
2

8654805027 · docker : publish to both ggerganov and ggml-org · Updated 2025-02-15 14:18:04 +00:00    CS348Project

2078
1

f30aca84b2 · Revert "HIP: Switch to std::vector in rocblas version check (#11820)" · Updated 2025-02-12 18:22:04 +00:00    CS348Project

2082
1

d86e23101e · server : minor log updates · Updated 2025-02-08 14:23:37 +00:00    CS348Project

2108
1

3b6a0a817a · llama : add log about loading model tensors · Updated 2025-02-06 07:24:07 +00:00    CS348Project

2129
1

947158ee52 · Specify podman works in Container documentation · Updated 2025-02-05 13:47:21 +00:00    CS348Project

2134
1

de9d2c6f09 · test [pack] · Updated 2025-01-24 22:24:31 +00:00    CS348Project

2231
3

969b264657 · Revert "TMP : push artifacts" · Updated 2025-01-24 15:58:09 +00:00    CS348Project

2237
15

ff4cb6ef4c · release : pack /lib and /include in the packages · Updated 2025-01-24 11:28:37 +00:00    CS348Project

2237
1

c9e7cbb08b · safer jinja llama_chat_templates struct · Updated 2025-01-20 15:58:29 +00:00    CS348Project

2276
34

90a0349349 · recommended way to check if the version is 0.3, as requested by ngxson · Updated 2025-01-19 13:43:59 +00:00    CS348Project

2274
2

ba421dd04e · gguf-test: tensor data comparison · Updated 2025-01-18 08:49:47 +00:00    CS348Project

2276
7

492eaad571 · ci : change python3 -> python · Updated 2025-01-15 14:18:56 +00:00    CS348Project

2290
1