LLM inference in C/C++. This is a mirror of https://github.com/ggml-org/llama.cpp. All credit to the upstream authors.
Updated 2025-10-26 06:34:35 +00:00