webclaw/crates
Valerio 8d29382b25 feat: extract __NEXT_DATA__ into structured_data
Next.js pages embed server-rendered data in <script id="__NEXT_DATA__">.
Now extracted as structured JSON (pageProps) in the structured_data field.

Tested on 45 sites — 13 return rich structured data including prices,
product info, and page state not visible in the DOM.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-02 16:04:51 +02:00
..
webclaw-cli feat: add --cookie-file support for JSON cookie files 2026-03-31 10:54:53 +02:00
webclaw-core feat: extract __NEXT_DATA__ into structured_data 2026-04-02 16:04:51 +02:00
webclaw-fetch style: cargo fmt 2026-04-01 18:25:40 +02:00
webclaw-llm Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00
webclaw-mcp feat: add --cookie-file support for JSON cookie files 2026-03-31 10:54:53 +02:00
webclaw-pdf Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00