LLM inference in C/C++. This is a mirror of https://github.com/ggml-org/llama.cpp. All credit to the upstream authors.
Updated 2025-10-27 08:17:31 +00:00