llama.cpp/ggml/include/ggml-cann.h at gg/speculative-experiments

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-11-02 09:12:03 +00:00

Files

leo-pony 6b8447352d [CANN] Adapt to dynamically loadable backends mechanism (#9970 )

* [CANN] Adapt to dynamically loadable backends mechanism

* Fix the Bug: inference running result is garbled in debug running model for LM models who's type is Q4_0 class

* Handle the review comments of this pull request

2024-10-22 16:16:01 +08:00

4.4 KiB

Raw Permalink Blame History

View Raw

4.4 KiB Raw Permalink Blame History

4.4 KiB

Raw Permalink Blame History