webclaw/crates
Valerio 6316b1a6e7
Some checks are pending
CI / Test (push) Waiting to run
CI / Lint (push) Waiting to run
CI / Docs (push) Waiting to run
fix: handle raw newlines in JSON-LD strings
Sites like Bluesky emit JSON-LD with literal newline characters inside
string values (technically invalid JSON). Add sanitize_json_newlines()
fallback that escapes control characters inside quoted strings before
retrying the parse. This recovers ProfilePage, Product, and other
structured data that was previously silently dropped.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-16 11:40:25 +02:00
..
webclaw-cli feat: add allow_subdomains and allow_external_links to CrawlConfig 2026-04-14 19:33:06 +02:00
webclaw-core fix: handle raw newlines in JSON-LD strings 2026-04-16 11:40:25 +02:00
webclaw-fetch fix: handle raw newlines in JSON-LD strings 2026-04-16 11:40:25 +02:00
webclaw-llm Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00
webclaw-mcp fix: MCP research saves to file, returns compact response 2026-04-03 16:05:45 +02:00
webclaw-pdf Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00