Tutorials.
Working code, real numbers, no fluff. The patterns behind every demo on this site.
-
7 min · Beginner
Getting Started with Web Scraping in 2026: From Zero to First Working Scraper in 30 Minutes
If you've never scraped a website before, start here. The minimum tools, the first working script, and the three things that will trip you up. Written for total beginners; no Python experience assumed.
-
3 min · Beginner
How to Scrape eBay Listings in 2026
eBay is the friendliest major e-commerce scraping target — light anti-bot, generous official API (5k requests/day free), and CSS structures that haven't drifted much in years. Here's the working stack.
-
4 min · Beginner
How to Scrape Reddit in 2026: Use the Official API (It's Cheap and the Workarounds Aren't)
Reddit closed public scraping in 2023 but kept the API affordable. PRAW + free OAuth tier handles 90% of use cases. The DIY scraping route exists but is brittle, ToS-risky, and unnecessary.
-
4 min · Beginner
How to Scrape Wikipedia (The Easy Target Everyone Overcomplicates)
Wikipedia is the simplest legitimate scraping target on the public internet. CC-BY-SA license, official APIs, no anti-bot. Here are the four ways to extract data and which to use when.
-
5 min · Beginner
How to Scrape YouTube Videos, Transcripts, and Channel Data in 2026
YouTube has three things you might want to extract: video files, transcripts, and metadata. Each has its own toolchain. yt-dlp + youtube-transcript-api + the Data API v3 cover 99% of use cases.
-
8 min · Beginner
Web Scraping FAQ: Every Question I Get Asked
Direct answers to the 25 most-common web scraping questions: legality, costs, tools, anti-bot, languages, time-to-build, what to do when sites change. No vendor weasel-language.
-
7 min · Beginner
Web Scraping Glossary: Every Term Defined Plainly
If you're new to web scraping you'll see jargon everywhere — TLS fingerprinting, headless browsers, user agents, rate limits, residential proxies. This is every term you'll encounter, defined in one or two sentences each, in plain English.
-
6 min · Beginner
Web Scraping Legal & Ethics: 2026 State of Play
What's legal, what's not, and where the gray zones are. The hiQ ruling, GDPR, CFAA, ToS, robots.txt, and the practical rules I follow on every job. Not legal advice — but the realistic landscape.
-
6 min · Intermediate
100 Production Web Scrapers, One Repo: The Patterns That Repeat
After shipping 100 scrapers across 40+ brief classes, the patterns are obvious. The full taxonomy of web-scraping work, with a real example for each.
-
6 min · Intermediate
Why a $5/mo VPS Beats a $1,200/mo ScrapingBee Plan
The actual stack, the actual cost math, and the operational discipline that turns a $5 Hetzner box into a 100k-page-per-day scraping pipeline. Pipeline-as-product, not script-as-deliverable.
-
5 min · Intermediate
Bypassing Cloudflare, DataDome, and PerimeterX in 2026: A Working Playbook
How to scrape sites behind modern anti-bot stacks without paying $1,200/mo for ScrapingBee. curl_cffi, nodriver, residential rotation, headers, and the math behind each choice.
-
6 min · Intermediate
Best Residential Proxy Services 2026: Honest Comparison (Webshare vs Bright Data vs Oxylabs vs IPRoyal)
Tested all four for production scraping work. The pricing is opaque, the per-GB math is misleading, and the 'best' depends entirely on your volume and country mix. Here's the math, the gotchas, and which one I actually use.
-
5 min · Intermediate
How to Scrape Amazon Product Data in 2026 (And Whether You Should)
Amazon is the hardest mainstream scraping target — Cloudflare-equivalent anti-bot, aggressive ToS enforcement, and a paid official API. Here's what actually works, what gets you blocked, and when you should just pay Apify $50 instead.
-
3 min · Intermediate
How to Scrape Google Search Results in 2026 (and the Two Real Alternatives)
Google search is the highest-friction scraping target on the public web — TLS fingerprinting, CAPTCHA escalation, IP-rotation requirements. Here's what works at small scale, what works at large scale, and the two cheap alternatives that solve 90% of use cases.
-
4 min · Intermediate
How to Scrape Indeed Job Listings in 2026
Indeed has Cloudflare Turnstile, aggressive anti-bot, and no public API for non-employers. Here's the working DIY approach for low volume, the official ATS partner path for serious work, and the public-data alternatives.
-
5 min · Intermediate
How to Scrape LinkedIn in 2026 (Honest: You Can't Do It Safely)
LinkedIn has actively litigated scrapers since 2017 (and won most of the contract-law cases). The hiQ ruling does not protect you from the contract claim. Here's the realistic landscape and the four legitimate alternatives.
-
4 min · Intermediate
How to Scrape Twitter / X in 2026 (Honest: Don't, Use the API)
Twitter/X aggressively litigates scrapers, broke every public scraping library in 2023, and gates content behind login. The official API is your real answer. Here's the unvarnished landscape and a working alternative for the 5% of cases where the API genuinely doesn't fit.
-
4 min · Intermediate
How to Scrape Yelp Business Listings in 2026
Yelp has anti-bot at the Cloudflare-Turnstile tier and an official API ($95/mo) for the use cases people typically want. Here's the working DIY approach for low volume, the API path for serious work, and the lead-gen alternatives.
-
5 min · Intermediate
Scrapy vs Playwright vs Selenium: 2026 Decision Tree (with the Honest Verdict)
Three tools for three different jobs. Most tutorials mix them up. Here's when each one wins, when each one loses, and the simple flowchart that picks the right tool for any scraping brief.
-
6 min · Intermediate
SEC EDGAR + XBRL: From Filings to Clean CSV in 30 Seconds
How to pull structured financial data from SEC filings without paying $20K/year for Bloomberg or $400/month for AlphaSense. The XBRL multi-candidate problem and the resolver that solves it.
-
6 min · Intermediate
Self-Healing AI Web Extractors: A Complete Implementation Guide
How to build web scrapers that survive site redesigns. LLM + JSON Schema as the contract, not CSS selectors. Stress-tested against full DOM scrambles. Working code, real numbers.
-
6 min · Intermediate
Web Scraping Tools Comparison 2026: Scrapy vs Playwright vs Beautiful Soup vs ScrapingBee vs DIY
Honest, no-affiliate comparison of every web scraping tool you'll evaluate. When to use Scrapy, when to use Playwright, when to use Beautiful Soup, when to pay for ScrapingBee/Bright Data/Apify, and when to roll your own. Decision tree included.