llama.cpp/ggml.c at e1886cf4fe0d0f31661dda52a4a9f34bd9b9009a

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Georgi Gerganov ce2c7d72e2 metal : handle buffers larger than device's maxBufferLength (#1826 )

* metal : handle buffers larger than device's maxBufferLength

* metal : print more verbose device info + handle errors

* metal : fix prints for overlapping views

* metal : minimize view overlap to try to utilize device memory better

2023-06-18 09:09:47 +03:00

575 KiB

Raw Blame History

View Raw

575 KiB Raw Blame History

575 KiB

Raw Blame History