llama : refactor tensor offloading as callback · 1e9c5443c2 - llama.cpp - Gitea - Peisong Xiao

CS348Project/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

llama : refactor tensor offloading as callback

This commit is contained in:

Georgi Gerganov

2023-10-29 12:35:07 +02:00

parent da936188d8

commit 1e9c5443c2

1 changed files with 704 additions and 726 deletions

1242

llama.cpp

View File

File diff suppressed because it is too large Load Diff