webclaw/crates/webclaw-fetch/src
devnen fe6e9b5d28 feat(fetch): periodic progress stderr line on slow fetches
Webclaw's default -t timeout is 30s; slow sites previously sat
silently with no feedback. Now during a fetch, every 10s of elapsed
time webclaw writes one line to stderr:

  # webclaw: still fetching <URL> (Ns)

Fetches completing in under 10s emit nothing (the timer never fires).
Stdout output is untouched - pure feedback signal on stderr.

No timeout change. No new flags. Default behavior is augmented at
stderr only.

Implemented via tokio::select! between the fetch future and a
tokio::time::interval. Latency cost: a single tokio task spawn
and a 10s tick - microseconds on the fast path.

10 new tests in webclaw-fetch::progress::tests (none ignored; the
slow-future test uses a 50ms test interval to keep cargo test fast).
Workspace total 710 -> 720.

(cherry picked from commit 06f065cb08)
2026-06-09 10:47:52 +02:00
..
extractors feat(reddit): parse old.reddit.com HTML instead of the dead .json API 2026-06-04 17:36:02 +02:00
browser.rs Bump to 0.5.4: SafariIos profile + Chrome fingerprint alignment + locale helper 2026-04-23 12:58:24 +02:00
client.rs feat(reddit): parse old.reddit.com HTML instead of the dead .json API 2026-06-04 17:36:02 +02:00
cloud.rs feat(core): endpoints module for API surface extraction from HTML and JS (#47) 2026-05-19 19:05:16 +02:00
crawler.rs polish(fetch,mcp): robots parser + firefox client cache + Acquire ordering (P3) (#23) 2026-04-16 20:21:32 +02:00
document.rs feat: replace custom TLS stack with wreq (BoringSSL), bump v0.3.3 2026-04-01 18:04:55 +02:00
error.rs feat: replace custom TLS stack with wreq (BoringSSL), bump v0.3.3 2026-04-01 18:04:55 +02:00
fetcher.rs feat(fetch): Fetcher trait so vertical extractors work under any HTTP backend 2026-04-22 21:17:50 +02:00
lib.rs feat(fetch): periodic progress stderr line on slow fetches 2026-06-09 10:47:52 +02:00
linkedin.rs Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00
locale.rs Bump to 0.5.4: SafariIos profile + Chrome fingerprint alignment + locale helper 2026-04-23 12:58:24 +02:00
progress.rs feat(fetch): periodic progress stderr line on slow fetches 2026-06-09 10:47:52 +02:00
proxy.rs Initial release: webclaw v0.1.0 — web content extraction for LLMs 2026-03-23 18:31:11 +01:00
reddit.rs feat(reddit): parse old.reddit.com HTML instead of the dead .json API 2026-06-04 17:36:02 +02:00
sitemap.rs polish(fetch,mcp): robots parser + firefox client cache + Acquire ordering (P3) (#23) 2026-04-16 20:21:32 +02:00
tls.rs fix: harden resource limits, path safety, and WASM build (#46) 2026-05-19 17:03:52 +02:00
url_security.rs fix(security): harden local fetch surfaces 2026-05-12 12:00:25 +02:00