Tutorials.

Working code, real numbers, no fluff. The patterns behind every demo on this site.

7 min · Beginner

Getting Started with Web Scraping in 2026: From Zero to First Working Scraper in 30 Minutes

If you've never scraped a website before, start here. The minimum tools, the first working script, and the three things that will trip you up. Written for total beginners; no Python experience assumed.
3 min · Beginner

How to Scrape eBay Listings in 2026

eBay is the friendliest major e-commerce scraping target — light anti-bot, generous official API (5k requests/day free), and CSS structures that haven't drifted much in years. Here's the working stack.
4 min · Beginner

How to Scrape Reddit in 2026: Use the Official API (It's Cheap and the Workarounds Aren't)

Reddit closed public scraping in 2023 but kept the API affordable. PRAW + free OAuth tier handles 90% of use cases. The DIY scraping route exists but is brittle, ToS-risky, and unnecessary.
4 min · Beginner

How to Scrape Wikipedia (The Easy Target Everyone Overcomplicates)

Wikipedia is the simplest legitimate scraping target on the public internet. CC-BY-SA license, official APIs, no anti-bot. Here are the four ways to extract data and which to use when.
5 min · Beginner

How to Scrape YouTube Videos, Transcripts, and Channel Data in 2026

YouTube has three things you might want to extract: video files, transcripts, and metadata. Each has its own toolchain. yt-dlp + youtube-transcript-api + the Data API v3 cover 99% of use cases.
8 min · Beginner

Web Scraping FAQ: Every Question I Get Asked

Direct answers to the 25 most-common web scraping questions: legality, costs, tools, anti-bot, languages, time-to-build, what to do when sites change. No vendor weasel-language.
7 min · Beginner

Web Scraping Glossary: Every Term Defined Plainly

If you're new to web scraping you'll see jargon everywhere — TLS fingerprinting, headless browsers, user agents, rate limits, residential proxies. This is every term you'll encounter, defined in one or two sentences each, in plain English.
6 min · Beginner

Web Scraping Legal & Ethics: 2026 State of Play

What's legal, what's not, and where the gray zones are. The hiQ ruling, GDPR, CFAA, ToS, robots.txt, and the practical rules I follow on every job. Not legal advice — but the realistic landscape.
6 min · Intermediate

100 Production Web Scrapers, One Repo: The Patterns That Repeat

After shipping 100 scrapers across 40+ brief classes, the patterns are obvious. The full taxonomy of web-scraping work, with a real example for each.
6 min · Intermediate

Why a $5/mo VPS Beats a $1,200/mo ScrapingBee Plan

The actual stack, the actual cost math, and the operational discipline that turns a $5 Hetzner box into a 100k-page-per-day scraping pipeline. Pipeline-as-product, not script-as-deliverable.
5 min · Intermediate

Bypassing Cloudflare, DataDome, and PerimeterX in 2026: A Working Playbook

How to scrape sites behind modern anti-bot stacks without paying $1,200/mo for ScrapingBee. curl_cffi, nodriver, residential rotation, headers, and the math behind each choice.
6 min · Intermediate

Best Residential Proxy Services 2026: Honest Comparison (Webshare vs Bright Data vs Oxylabs vs IPRoyal)

Tested all four for production scraping work. The pricing is opaque, the per-GB math is misleading, and the 'best' depends entirely on your volume and country mix. Here's the math, the gotchas, and which one I actually use.
5 min · Intermediate

How to Scrape Amazon Product Data in 2026 (And Whether You Should)

Amazon is the hardest mainstream scraping target — Cloudflare-equivalent anti-bot, aggressive ToS enforcement, and a paid official API. Here's what actually works, what gets you blocked, and when you should just pay Apify $50 instead.
3 min · Intermediate

How to Scrape Google Search Results in 2026 (and the Two Real Alternatives)

Google search is the highest-friction scraping target on the public web — TLS fingerprinting, CAPTCHA escalation, IP-rotation requirements. Here's what works at small scale, what works at large scale, and the two cheap alternatives that solve 90% of use cases.
4 min · Intermediate

How to Scrape Indeed Job Listings in 2026

Indeed has Cloudflare Turnstile, aggressive anti-bot, and no public API for non-employers. Here's the working DIY approach for low volume, the official ATS partner path for serious work, and the public-data alternatives.
5 min · Intermediate

How to Scrape LinkedIn in 2026 (Honest: You Can't Do It Safely)

LinkedIn has actively litigated scrapers since 2017 (and won most of the contract-law cases). The hiQ ruling does not protect you from the contract claim. Here's the realistic landscape and the four legitimate alternatives.
4 min · Intermediate

How to Scrape Twitter / X in 2026 (Honest: Don't, Use the API)

Twitter/X aggressively litigates scrapers, broke every public scraping library in 2023, and gates content behind login. The official API is your real answer. Here's the unvarnished landscape and a working alternative for the 5% of cases where the API genuinely doesn't fit.
4 min · Intermediate

How to Scrape Yelp Business Listings in 2026

Yelp has anti-bot at the Cloudflare-Turnstile tier and an official API ($95/mo) for the use cases people typically want. Here's the working DIY approach for low volume, the API path for serious work, and the lead-gen alternatives.
5 min · Intermediate

Scrapy vs Playwright vs Selenium: 2026 Decision Tree (with the Honest Verdict)

Three tools for three different jobs. Most tutorials mix them up. Here's when each one wins, when each one loses, and the simple flowchart that picks the right tool for any scraping brief.
6 min · Intermediate

SEC EDGAR + XBRL: From Filings to Clean CSV in 30 Seconds

How to pull structured financial data from SEC filings without paying $20K/year for Bloomberg or $400/month for AlphaSense. The XBRL multi-candidate problem and the resolver that solves it.
6 min · Intermediate

Self-Healing AI Web Extractors: A Complete Implementation Guide

How to build web scrapers that survive site redesigns. LLM + JSON Schema as the contract, not CSS selectors. Stress-tested against full DOM scrambles. Working code, real numbers.
6 min · Intermediate

Web Scraping Tools Comparison 2026: Scrapy vs Playwright vs Beautiful Soup vs ScrapingBee vs DIY

Honest, no-affiliate comparison of every web scraping tool you'll evaluate. When to use Scrapy, when to use Playwright, when to use Beautiful Soup, when to pay for ScrapingBee/Bright Data/Apify, and when to roll your own. Decision tree included.