Add Puppeteer fallback for Cloudflare-protected sites

When HTTP requests are blocked with 403 (e.g., B&H Photo's Cloudflare
protection), the scraper now automatically retries using a headless
Chrome browser via Puppeteer. Also updated Dockerfile to include
Chromium dependencies for container deployment.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
clucraft 2026-01-21 20:55:49 -05:00
parent 7c8ab0721b
commit c96861fefb
4 changed files with 1415 additions and 36 deletions

View file

@ -18,7 +18,8 @@
"express": "^4.18.2",
"jsonwebtoken": "^9.0.2",
"node-cron": "^3.0.3",
"pg": "^8.11.3"
"pg": "^8.11.3",
"puppeteer": "^22.0.0"
},
"devDependencies": {
"@types/bcrypt": "^5.0.2",