From 23544f8facee15958ea1f60f2befef79f22c001e Mon Sep 17 00:00:00 2001 From: Valerio Date: Sun, 3 May 2026 21:17:23 +0200 Subject: [PATCH] docs(claude): note youtube.rs role and yt-dlp short-circuit in server The webclaw-core youtube module produces structured markdown but no transcript; document that and point at the production server's youtube_transcript.rs short-circuit for the full YoutubeData + caption text shape. --- CLAUDE.md | 1 + 1 file changed, 1 insertion(+) diff --git a/CLAUDE.md b/CLAUDE.md index c33d61f..b30bd84 100644 --- a/CLAUDE.md +++ b/CLAUDE.md @@ -38,6 +38,7 @@ Three binaries: `webclaw` (CLI), `webclaw-mcp` (MCP server), `webclaw-server` (R - `filter.rs` — CSS selector include/exclude filtering (ExtractionOptions) - `diff.rs` — Content change tracking engine (snapshot diffing) - `brand.rs` — Brand identity extraction from DOM structure and CSS +- `youtube.rs` — `ytInitialPlayerResponse` parser, structured markdown for `youtube.com/watch` URLs (title, channel, views, published, duration, description). Produces the legacy markdown shape — for transcripts and a structured `YoutubeData` block see the production server's `youtube_transcript.rs` short-circuit (yt-dlp via proxy pool). ### Fetch Modules (`webclaw-fetch`) - `client.rs` — FetchClient with wreq BoringSSL TLS impersonation; implements the public `Fetcher` trait so callers (including server adapters) can swap in alternative implementations