mirror of
https://github.com/MODSetter/SurfSense.git
synced 2026-04-25 00:36:31 +02:00
227 lines
8.8 KiB
Text
227 lines
8.8 KiB
Text
---
|
|
title: Docker Installation
|
|
description: Setting up SurfSense using Docker
|
|
---
|
|
|
|
This guide explains how to run SurfSense using Docker, with options ranging from a single-command install to a fully manual setup.
|
|
|
|
## Quick Start
|
|
|
|
### Option 1 — Install Script (recommended)
|
|
|
|
Downloads the compose files, generates a `SECRET_KEY`, and starts all services automatically:
|
|
|
|
```bash
|
|
curl -fsSL https://raw.githubusercontent.com/MODSetter/SurfSense/main/docker/scripts/install.sh | bash
|
|
```
|
|
|
|
This creates a `./surfsense/` directory with `docker-compose.yml` and `.env`, then runs `docker compose up -d`.
|
|
|
|
### Option 2 — Manual Docker Compose
|
|
|
|
```bash
|
|
git clone https://github.com/MODSetter/SurfSense.git
|
|
cd SurfSense/docker
|
|
cp .env.example .env
|
|
# Edit .env — at minimum set SECRET_KEY
|
|
docker compose up -d
|
|
```
|
|
|
|
After starting, access SurfSense at:
|
|
|
|
- **Frontend**: [http://localhost:3000](http://localhost:3000)
|
|
- **Backend API**: [http://localhost:8000](http://localhost:8000)
|
|
- **API Docs**: [http://localhost:8000/docs](http://localhost:8000/docs)
|
|
- **Electric SQL**: [http://localhost:5133](http://localhost:5133)
|
|
|
|
---
|
|
|
|
## Configuration
|
|
|
|
All configuration lives in a single `docker/.env` file (or `surfsense/.env` if you used the install script). Copy `.env.example` to `.env` and edit the values you need.
|
|
|
|
### Required
|
|
|
|
| Variable | Description |
|
|
|----------|-------------|
|
|
| `SECRET_KEY` | JWT secret key. Generate with: `openssl rand -base64 32`. Auto-generated by the install script. |
|
|
|
|
### Core Settings
|
|
|
|
| Variable | Description | Default |
|
|
|----------|-------------|---------|
|
|
| `SURFSENSE_VERSION` | Image tag to deploy. Pin to a version (e.g. `0.0.13.1`) or use `latest` | `latest` |
|
|
| `AUTH_TYPE` | Authentication method: `LOCAL` (email/password) or `GOOGLE` (OAuth) | `LOCAL` |
|
|
| `ETL_SERVICE` | Document parsing: `DOCLING` (local), `UNSTRUCTURED`, or `LLAMACLOUD` | `DOCLING` |
|
|
| `EMBEDDING_MODEL` | Embedding model for vector search | `sentence-transformers/all-MiniLM-L6-v2` |
|
|
| `TTS_SERVICE` | Text-to-speech provider for podcasts | `local/kokoro` |
|
|
| `STT_SERVICE` | Speech-to-text provider for audio files | `local/base` |
|
|
| `REGISTRATION_ENABLED` | Allow new user registrations | `TRUE` |
|
|
|
|
### Ports
|
|
|
|
| Variable | Description | Default |
|
|
|----------|-------------|---------|
|
|
| `FRONTEND_PORT` | Frontend service port | `3000` |
|
|
| `BACKEND_PORT` | Backend API service port | `8000` |
|
|
| `ELECTRIC_PORT` | Electric SQL service port | `5133` |
|
|
|
|
### Custom Domain / Reverse Proxy
|
|
|
|
Only set these if serving SurfSense on a real domain via a reverse proxy (Caddy, Nginx, Cloudflare Tunnel, etc.). Leave commented out for standard localhost deployments.
|
|
|
|
| Variable | Description |
|
|
|----------|-------------|
|
|
| `NEXT_FRONTEND_URL` | Public frontend URL (e.g. `https://app.yourdomain.com`) |
|
|
| `BACKEND_URL` | Public backend URL for OAuth callbacks (e.g. `https://api.yourdomain.com`) |
|
|
| `NEXT_PUBLIC_FASTAPI_BACKEND_URL` | Backend URL used by the frontend (e.g. `https://api.yourdomain.com`) |
|
|
| `NEXT_PUBLIC_ELECTRIC_URL` | Electric SQL URL used by the frontend (e.g. `https://electric.yourdomain.com`) |
|
|
|
|
### Database
|
|
|
|
Defaults work out of the box. Change for security in production.
|
|
|
|
| Variable | Description | Default |
|
|
|----------|-------------|---------|
|
|
| `DB_USER` | PostgreSQL username | `surfsense` |
|
|
| `DB_PASSWORD` | PostgreSQL password | `surfsense` |
|
|
| `DB_NAME` | PostgreSQL database name | `surfsense` |
|
|
| `DB_HOST` | PostgreSQL host | `db` |
|
|
| `DB_PORT` | PostgreSQL port | `5432` |
|
|
| `DB_SSLMODE` | SSL mode: `disable`, `require`, `verify-ca`, `verify-full` | `disable` |
|
|
| `DATABASE_URL` | Full connection URL override. Use for managed databases (RDS, Supabase, etc.) | *(built from above)* |
|
|
|
|
### Electric SQL
|
|
|
|
| Variable | Description | Default |
|
|
|----------|-------------|---------|
|
|
| `ELECTRIC_DB_USER` | Replication user for Electric SQL | `electric` |
|
|
| `ELECTRIC_DB_PASSWORD` | Replication password for Electric SQL | `electric_password` |
|
|
| `ELECTRIC_DATABASE_URL` | Full connection URL override for Electric. Set to `host.docker.internal` when pointing at a local Postgres instance | *(built from above)* |
|
|
|
|
### Authentication
|
|
|
|
| Variable | Description |
|
|
|----------|-------------|
|
|
| `GOOGLE_OAUTH_CLIENT_ID` | Google OAuth client ID (required if `AUTH_TYPE=GOOGLE`) |
|
|
| `GOOGLE_OAUTH_CLIENT_SECRET` | Google OAuth client secret (required if `AUTH_TYPE=GOOGLE`) |
|
|
|
|
Create credentials at the [Google Cloud Console](https://console.cloud.google.com/apis/credentials).
|
|
|
|
### External API Keys
|
|
|
|
| Variable | Description |
|
|
|----------|-------------|
|
|
| `FIRECRAWL_API_KEY` | Firecrawl API key for web crawling |
|
|
| `UNSTRUCTURED_API_KEY` | Unstructured.io API key (required if `ETL_SERVICE=UNSTRUCTURED`) |
|
|
| `LLAMA_CLOUD_API_KEY` | LlamaCloud API key (required if `ETL_SERVICE=LLAMACLOUD`) |
|
|
|
|
### Connector OAuth Keys
|
|
|
|
Uncomment the connectors you want to use. Redirect URIs follow the pattern `http://localhost:8000/api/v1/auth/<connector>/connector/callback`.
|
|
|
|
| Connector | Variables |
|
|
|-----------|-----------|
|
|
| Google Drive / Gmail / Calendar | `GOOGLE_DRIVE_REDIRECT_URI`, `GOOGLE_GMAIL_REDIRECT_URI`, `GOOGLE_CALENDAR_REDIRECT_URI` |
|
|
| Notion | `NOTION_CLIENT_ID`, `NOTION_CLIENT_SECRET`, `NOTION_REDIRECT_URI` |
|
|
| Slack | `SLACK_CLIENT_ID`, `SLACK_CLIENT_SECRET`, `SLACK_REDIRECT_URI` |
|
|
| Discord | `DISCORD_CLIENT_ID`, `DISCORD_CLIENT_SECRET`, `DISCORD_BOT_TOKEN`, `DISCORD_REDIRECT_URI` |
|
|
| Jira & Confluence | `ATLASSIAN_CLIENT_ID`, `ATLASSIAN_CLIENT_SECRET`, `JIRA_REDIRECT_URI`, `CONFLUENCE_REDIRECT_URI` |
|
|
| Linear | `LINEAR_CLIENT_ID`, `LINEAR_CLIENT_SECRET`, `LINEAR_REDIRECT_URI` |
|
|
| ClickUp | `CLICKUP_CLIENT_ID`, `CLICKUP_CLIENT_SECRET`, `CLICKUP_REDIRECT_URI` |
|
|
| Airtable | `AIRTABLE_CLIENT_ID`, `AIRTABLE_CLIENT_SECRET`, `AIRTABLE_REDIRECT_URI` |
|
|
| Microsoft Teams | `TEAMS_CLIENT_ID`, `TEAMS_CLIENT_SECRET`, `TEAMS_REDIRECT_URI` |
|
|
|
|
For Airtable, create an OAuth integration at the [Airtable Developer Hub](https://airtable.com/create/oauth).
|
|
|
|
### Observability (optional)
|
|
|
|
| Variable | Description |
|
|
|----------|-------------|
|
|
| `LANGSMITH_TRACING` | Enable LangSmith tracing (`true` / `false`) |
|
|
| `LANGSMITH_ENDPOINT` | LangSmith API endpoint |
|
|
| `LANGSMITH_API_KEY` | LangSmith API key |
|
|
| `LANGSMITH_PROJECT` | LangSmith project name |
|
|
|
|
### Advanced (optional)
|
|
|
|
| Variable | Description | Default |
|
|
|----------|-------------|---------|
|
|
| `SCHEDULE_CHECKER_INTERVAL` | How often to check for scheduled connector tasks (e.g. `5m`, `1h`) | `5m` |
|
|
| `RERANKERS_ENABLED` | Enable document reranking for improved search | `FALSE` |
|
|
| `RERANKERS_MODEL_NAME` | Reranker model name (e.g. `ms-marco-MiniLM-L-12-v2`) | |
|
|
| `RERANKERS_MODEL_TYPE` | Reranker model type (e.g. `flashrank`) | |
|
|
| `PAGES_LIMIT` | Max pages per user for ETL services | unlimited |
|
|
|
|
---
|
|
|
|
## Docker Services
|
|
|
|
| Service | Description |
|
|
|---------|-------------|
|
|
| `db` | PostgreSQL with pgvector extension |
|
|
| `redis` | Message broker for Celery |
|
|
| `backend` | FastAPI application server |
|
|
| `celery_worker` | Background task processing (document indexing, etc.) |
|
|
| `celery_beat` | Periodic task scheduler (connector sync) |
|
|
| `electric` | Electric SQL — real-time sync for the frontend |
|
|
| `frontend` | Next.js web application |
|
|
|
|
All services start automatically with `docker compose up -d`.
|
|
|
|
---
|
|
|
|
## Updating
|
|
|
|
**Option 1 — Watchtower (recommended):**
|
|
|
|
```bash
|
|
docker run --rm -v /var/run/docker.sock:/var/run/docker.sock \
|
|
nickfedor/watchtower --run-once \
|
|
--label-filter "com.docker.compose.project=surfsense"
|
|
```
|
|
|
|
<Callout type="warn">
|
|
Use `nickfedor/watchtower`. The original `containrrr/watchtower` is no longer maintained and may fail with newer Docker versions.
|
|
</Callout>
|
|
|
|
**Option 2 — Manual:**
|
|
|
|
```bash
|
|
cd surfsense # or SurfSense/docker if you cloned manually
|
|
docker compose pull && docker compose up -d
|
|
```
|
|
|
|
Database migrations are applied automatically on every startup.
|
|
|
|
---
|
|
|
|
## Useful Commands
|
|
|
|
```bash
|
|
# View logs (all services)
|
|
docker compose logs -f
|
|
|
|
# View logs for a specific service
|
|
docker compose logs -f backend
|
|
docker compose logs -f electric
|
|
|
|
# Stop all services
|
|
docker compose down
|
|
|
|
# Restart a specific service
|
|
docker compose restart backend
|
|
|
|
# Stop and remove all containers + volumes (destructive!)
|
|
docker compose down -v
|
|
```
|
|
|
|
---
|
|
|
|
## Troubleshooting
|
|
|
|
- **Ports already in use** — Change the relevant `*_PORT` variable in `.env` and restart.
|
|
- **Permission errors on Linux** — You may need to prefix `docker` commands with `sudo`.
|
|
- **Electric SQL not connecting** — Check `docker compose logs electric`. If it shows `domain does not exist: db`, ensure `ELECTRIC_DATABASE_URL` is not set to a stale value in `.env`.
|
|
- **Real-time updates not working in browser** — Open DevTools → Console and look for `[Electric]` errors. Check that `NEXT_PUBLIC_ELECTRIC_URL` matches the running Electric SQL address.
|
|
- **Line ending issues on Windows** — Run `git config --global core.autocrlf true` before cloning.
|