Georgi Gerganov
|
6378112cb5
|
graph : remove the build_kv_... API from llama_graph_i
ggml-ci
|
2025-02-23 19:39:22 +02:00 |
|
Georgi Gerganov
|
372fa3a894
|
cont : enc should work now, next is dec
ggml-ci
|
2025-02-23 12:20:23 +02:00 |
|
Georgi Gerganov
|
f5e80208c5
|
wip enc-dec
|
2025-02-21 19:17:47 +02:00 |
|
Georgi Gerganov
|
548c230dff
|
graph : remove worst_case from the API
ggml-ci
|
2025-02-21 13:29:25 +02:00 |
|
Georgi Gerganov
|
b1554be1d7
|
context : add cache-less llama_context
ggml-ci
|
2025-02-20 18:30:04 +02:00 |
|
Georgi Gerganov
|
f95b04a21c
|
model : fix order kvq -> qkv
ggml-ci
|
2025-02-19 18:52:20 +02:00 |
|
Georgi Gerganov
|
2eacb4c1bf
|
graph : simplify attention api
ggml-ci
|
2025-02-19 18:43:49 +02:00 |
|
Georgi Gerganov
|
e17e4b72d1
|
context : add llama_context_recurrent
ggml-ci
|
2025-02-19 16:07:27 +02:00 |
|
Georgi Gerganov
|
5f11a5502a
|
kv-cache : remove llama_kv_cache_i
|
2025-02-19 14:36:27 +02:00 |
|
Georgi Gerganov
|
f5cedbcaaa
|
kv-cache : prepare for abstraction
ggml-ci
|
2025-02-18 21:28:58 +02:00 |
|
Georgi Gerganov
|
172f61690c
|
cont : return important tensors
ggml-ci
|
2025-02-18 13:48:43 +02:00 |
|
Georgi Gerganov
|
c23590319a
|
graph : add llama_graph_result
ggml-ci
|
2025-02-18 13:48:21 +02:00 |
|
Georgi Gerganov
|
1d801d27b9
|
graph : update attn/kv_self names
|
2025-02-14 17:22:55 +02:00 |
|
Georgi Gerganov
|
fbe6a07256
|
context : rename to llama_context_kv_self
|
2025-02-12 17:16:44 +02:00 |
|
Georgi Gerganov
|
6ee86e5e0f
|
graph : restore ubatch in build_cb
ggml-ci
|
2025-02-12 16:29:15 +02:00 |
|
Georgi Gerganov
|
f63aeecce6
|
llama : models now build their graphs using llama_graph_i
ggml-ci
|
2025-02-12 15:08:40 +02:00 |
|
Georgi Gerganov
|
e633dc171a
|
context : introduce llama_graph_i
ggml-ci
|
2025-02-12 13:49:44 +02:00 |
|