added README with instructions on setting up the backend

2025-10-20 20:15:26 -04:00
parent b0c6cfbf62
commit c22496493b
1 changed files with 26 additions and 0 deletions
--- a/backend/README.md
+++ b/backend/README.md
@@ -0,0 +1,26 @@
+# MIND Backend
+
+## Setup
+Below will setup the backend including the `go` orchestration layer
+and a `llama.cpp` inference server on `localhost:8081` and
+`localhost:8080` for local testing.
+### Building `llama.cpp`
+In `$REPO/third_party/llama.cpp` run `make` to build.
+
+### Running `llama.cpp`
+#### Getting a `GGUF` format model
+Run `./backend/get-qwen3-1.7b.sh` to download the Qwen 3 1.7B model
+from HuggingFace.
+#### Running the inference server
+Run `./llama-server -m <path-to-gguf-model> --port 8081` to run the
+inference server at `localhost:8081`.
+
+### Running the backend layer
+Run `go run main.go`.  This will run the backend layer at
+`localhost:8080`.
+
+## A simple CLI client
+A simple CLI-based client can be found under `backend/cli.py`, which
+will connect to the backend layer at `localhost:8080`.
+
+Please use the `\help` command to view specific operations.