mirror of
https://github.com/0xMassi/webclaw.git
synced 2026-06-08 22:25:12 +02:00
Initial release: webclaw v0.1.0 — web content extraction for LLMs
CLI + MCP server for extracting clean, structured content from any URL. 6 Rust crates, 10 MCP tools, TLS fingerprinting, 5 output formats. MIT Licensed | https://webclaw.io
This commit is contained in:
commit
c99ec684fa
79 changed files with 24074 additions and 0 deletions
43
env.example
Normal file
43
env.example
Normal file
|
|
@ -0,0 +1,43 @@
|
|||
# ============================================
|
||||
# Webclaw Configuration
|
||||
# Copy to .env and fill in your values
|
||||
# ============================================
|
||||
|
||||
# --- LLM Providers ---
|
||||
|
||||
# Ollama (local, default provider)
|
||||
OLLAMA_HOST=http://localhost:11434
|
||||
OLLAMA_MODEL=qwen3:8b
|
||||
|
||||
# OpenAI (optional cloud fallback)
|
||||
# OPENAI_API_KEY — set your OpenAI key
|
||||
# OPENAI_BASE_URL — defaults to https://api.openai.com/v1
|
||||
# OPENAI_MODEL — defaults to gpt-4o-mini
|
||||
|
||||
# Anthropic (optional cloud fallback)
|
||||
# ANTHROPIC_API_KEY — set your Anthropic key
|
||||
# ANTHROPIC_MODEL — defaults to claude-sonnet-4-20250514
|
||||
|
||||
# --- Proxy ---
|
||||
|
||||
# Single proxy
|
||||
# WEBCLAW_PROXY=http://user:pass@host:port
|
||||
|
||||
# Proxy file (one per line: host:port:user:pass)
|
||||
# WEBCLAW_PROXY_FILE=/path/to/proxies.txt
|
||||
|
||||
# --- Server (webclaw-server only) ---
|
||||
# WEBCLAW_PORT=3000
|
||||
# WEBCLAW_HOST=0.0.0.0
|
||||
# WEBCLAW_AUTH_KEY=your-auth-key
|
||||
# WEBCLAW_MAX_CONCURRENCY=50
|
||||
# WEBCLAW_JOB_TTL_SECS=3600
|
||||
# WEBCLAW_MAX_JOBS=100
|
||||
|
||||
# --- CLI LLM overrides ---
|
||||
# WEBCLAW_LLM_PROVIDER=ollama
|
||||
# WEBCLAW_LLM_MODEL=qwen3:8b
|
||||
# WEBCLAW_LLM_BASE_URL=http://localhost:11434
|
||||
|
||||
# --- Logging ---
|
||||
# WEBCLAW_LOG=info
|
||||
Loading…
Add table
Add a link
Reference in a new issue