webclaw/crates/webclaw-fetch
Valerio 6316b1a6e7
Some checks are pending
CI / Test (push) Waiting to run
CI / Lint (push) Waiting to run
CI / Docs (push) Waiting to run
fix: handle raw newlines in JSON-LD strings
Sites like Bluesky emit JSON-LD with literal newline characters inside
string values (technically invalid JSON). Add sanitize_json_newlines()
fallback that escapes control characters inside quoted strings before
retrying the parse. This recovers ProfilePage, Product, and other
structured data that was previously silently dropped.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-16 11:40:25 +02:00
..
src feat: add allow_subdomains and allow_external_links to CrawlConfig 2026-04-14 19:33:06 +02:00
tests fix: handle raw newlines in JSON-LD strings 2026-04-16 11:40:25 +02:00
Cargo.toml feat: replace custom TLS stack with wreq (BoringSSL), bump v0.3.3 2026-04-01 18:04:55 +02:00