Add option to use existing llama.cpp build

This commit is contained in:
Oracle 2026-06-02 16:39:02 +02:00
parent 80a91e278e
commit 1f4e9e119c
Signed by: Oracle
SSH key fingerprint: SHA256:x4/RtnjUyuHkdvmwNDsWSfcfF1V5PNr3OpriZqOvCX8
2 changed files with 69 additions and 48 deletions

View file

@ -38,10 +38,12 @@ run-pipeline.sh → Run finetune → merge/convert → run in sequenc
`setup.sh` will:
1. Create a Python virtual environment and install Python dependencies
2. Clone [llama.cpp](https://github.com/ggml-org/llama.cpp)
3. Build llama.cpp with your selected GPU backend
2. Clone [llama.cpp](https://github.com/ggml-org/llama.cpp) or symlink an existing build
3. Build llama.cpp with your selected GPU backend (skip if using existing)
4. Install llama-cpp-python bindings with matching backend flags
**Using an existing llama.cpp build:** Choose option 2 and provide the absolute path to your existing build. Setup will create a symlink at `./llama.cpp`.
### Backend Selection
| Choice | Backend | Requirements |