Insights & Engineering

Deep dives into web data infrastructure, extraction techniques, and the future of structured data at scale.

Latest Articles

How to Web Scrape with Puppeteer and NodeJS 2026 Guide

TL;DR: Puppeteer gives you full control of a headless Chrome instance from Node.js, making it the go-to tool for scraping JavaScript-rendered pages. This guide walks you through installation, selector-based extraction, infinite scroll, form login, request interception, stealth plugins, structured data export, and Docker deployment, so you can move from a toy script to a production-grade scraper.

Gabriel Cioci15 min read
May 1, 2026