llama.cpp/ggml-metal.h at dd0dc366dab10e8df28d3924e7f313b5c695e908

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-01 09:01:57 +00:00

Files

Georgi Gerganov f55538c3cc metal : fix memory leak (#2762 )

* metal : fix memory leak

* metal : fix encoders memory leak

* metal : clean up more memory resources

* metal : fix more leaks

* metal : reuse dispatch queue + autoreleasepool

* metal : reuse array for command buffers and encoders

* ggml : assert for odd number of blocks on ARM

15M tinyllama is an example

2023-08-28 10:59:08 +03:00

3.3 KiB

Raw Blame History

View Raw

3.3 KiB Raw Blame History

3.3 KiB

Raw Blame History