2026-04-02 11:33:43 +02:00
< p align = "center" >
< a href = "https://webclaw.io" >
< picture >
< source media = "(prefers-color-scheme: dark)" srcset = "https://raw.githubusercontent.com/0xMassi/webclaw/main/.github/banner.png" / >
< img src = "https://raw.githubusercontent.com/0xMassi/webclaw/main/.github/banner.png" alt = "webclaw" width = "700" / >
< / picture >
< / a >
< / p >
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
< h3 align = "center" >
One command to give your AI agent reliable web access.< br / >
< sub > No headless browser. No Puppeteer. No 403s.< / sub >
< / h3 >
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
< p align = "center" >
< a href = "https://www.npmjs.com/package/create-webclaw" > < img src = "https://img.shields.io/npm/dt/create-webclaw?style=for-the-badge&logo=npm&logoColor=white&label=Installs&color=CB3837" alt = "npm installs" / > < / a >
< a href = "https://github.com/0xMassi/webclaw" > < img src = "https://img.shields.io/github/stars/0xMassi/webclaw?style=for-the-badge&logo=github&logoColor=white&label=Stars&color=181717" alt = "Stars" / > < / a >
< a href = "https://github.com/0xMassi/webclaw/blob/main/LICENSE" > < img src = "https://img.shields.io/badge/License-AGPL--3.0-10B981?style=for-the-badge" alt = "License" / > < / a >
< / p >
---
## Quick Start
2026-03-23 18:31:11 +01:00
```bash
npx create-webclaw
```
2026-04-02 11:33:43 +02:00
That's it. Auto-detects your AI tools, downloads the MCP server, configures everything.
Works with **Claude Desktop** , **Claude Code** , **Cursor** , **Windsurf** , **VS Code** , **OpenCode** , **Codex CLI** , and **Antigravity** .
---
## The Problem
Your AI agent calls `fetch()` and gets a 403. Cloudflare, Akamai, and every major CDN fingerprint the TLS handshake and block non-browser clients before the request hits the server.
When it does work, you get 100KB+ of raw HTML — navigation, ads, cookie banners, scripts. Your agent burns 4,000+ tokens parsing noise.
## The Fix
webclaw impersonates Chrome 146 at the TLS protocol level. Perfect JA4 fingerprint. Perfect HTTP/2 Akamai hash. 99% bypass rate on 102 tested sites.
Then it extracts just the content — clean markdown, 67% fewer tokens.
```
Raw HTML webclaw
┌──────────────────────────────────┐ ┌──────────────────────────────────┐
│ < div class = "ad-wrapper" > │ │ # Breaking: AI Breakthrough │
│ < nav class = "global-nav" > │ │ │
│ < script > w i n d o w . _ _ N E X T _ D A T A _ _ │ │ R e s e a r c h e r s a c h i e v e d 9 4 % │
│ ={...8KB of JSON...}< / script > │ │ accuracy on cross-domain │
│ < div class = "social-share" > │ │ reasoning benchmarks. │
│ <!-- 142,847 characters --> │ │ │
│ │ │ ## Key Findings │
│ 4,820 tokens │ │ 1,590 tokens │
└──────────────────────────────────┘ └──────────────────────────────────┘
```
---
## What It Does
```bash
npx create-webclaw
```
1. Detects installed AI tools (Claude, Cursor, Windsurf, VS Code, OpenCode, Codex, Antigravity)
2. Downloads the `webclaw-mcp` binary for your platform (macOS arm64/x86, Linux x86/arm64)
3. Asks for your API key (optional — **works locally without one** )
4. Writes the MCP config for each detected tool
## 10 MCP Tools
After setup, your AI agent has access to:
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
| Tool | What it does | API key needed? |
|------|-------------|-----------------|
| **scrape** | Extract content from any URL | No |
| **crawl** | Recursively crawl a website | No |
| **search** | Web search + parallel scrape | Yes (Serper) |
| **map** | Discover URLs from sitemaps | No |
| **batch** | Extract multiple URLs in parallel | No |
| **extract** | LLM-powered structured extraction | Yes |
| **summarize** | Content summarization | Yes |
| **diff** | Track content changes | No |
| **brand** | Extract brand identity | No |
| **research** | Deep multi-page research | Yes |
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
**8 of 10 tools work fully offline.** No API key, no cloud, no tracking.
## Supported Tools
2026-03-23 18:31:11 +01:00
| Tool | Config location |
|------|----------------|
| Claude Desktop | `~/Library/Application Support/Claude/claude_desktop_config.json` |
| Claude Code | `~/.claude.json` |
| Cursor | `.cursor/mcp.json` |
| Windsurf | `~/.codeium/windsurf/mcp_config.json` |
| VS Code (Continue) | `~/.continue/config.json` |
2026-04-02 11:33:43 +02:00
| OpenCode | `~/.opencode/config.json` |
| Codex CLI | `~/.codex/config.json` |
| Antigravity | `~/.antigravity/mcp.json` |
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
## Sites That Work
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
webclaw gets through where default `fetch()` gets blocked:
Nike, Cloudflare, Bloomberg, Zillow, Indeed, Viagogo, Fansale, Wikipedia, Stripe, and 93 more. Tested on 102 sites with **99% success rate** .
## Alternative Install Methods
### Homebrew
```bash
brew tap 0xMassi/webclaw & & brew install webclaw
```
### Docker
```bash
docker run --rm ghcr.io/0xmassi/webclaw https://example.com
```
### Cargo
```bash
cargo install --git https://github.com/0xMassi/webclaw.git webclaw-cli
```
### Prebuilt Binaries
Download from [GitHub Releases ](https://github.com/0xMassi/webclaw/releases ) for macOS (arm64, x86_64) and Linux (x86_64, aarch64).
---
## Links
- [Website ](https://webclaw.io )
- [Documentation ](https://webclaw.io/docs )
- [GitHub ](https://github.com/0xMassi/webclaw )
- [TLS Library ](https://github.com/0xMassi/webclaw-tls )
- [Discord ](https://discord.gg/KDfd48EpnW )
- [Status ](https://status.webclaw.io )
## License
2026-03-23 18:31:11 +01:00
2026-04-02 11:33:43 +02:00
AGPL-3.0