llama : refactor tensor offloading as callback

This commit is contained in:
Georgi Gerganov
2023-10-29 12:35:07 +02:00
parent da936188d8
commit 1e9c5443c2

1242
llama.cpp

File diff suppressed because it is too large Load Diff