llama.cpp/ggml-rpc.cpp at bfaa676b0841617d4ef3596e63aca6be1a8eb1b5

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-10-30 08:42:00 +00:00

Files

Radoslav Gerganov bde7cd3cd9 llama : offload to RPC in addition to other backends (#7640 )

* llama : offload to RPC in addition to other backends

* - fix copy_tensor being called on the src buffer instead of the dst buffer

- always initialize views in the view_src buffer

- add RPC backend to Makefile build

- add endpoint to all RPC object names

* add rpc-server to Makefile

* Update llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

---------

Co-authored-by: slaren <slarengh@gmail.com>

2024-06-03 20:03:26 +03:00

43 KiB

Raw Blame History

View Raw

43 KiB Raw Blame History

43 KiB

Raw Blame History