n8n-nodes-alterlab

Web scraping node for n8n with anti-bot bypass, JavaScript rendering, and structured data extraction.

Get Started Free → $1 free balance on signup — up to 5,000 scrapes.

Why Use AlterLab Instead of the HTTP Request Node?

The n8n HTTP Request node works for simple APIs but fails on real-world websites. It cannot bypass Cloudflare, DataDome, PerimeterX, or Akamai. It cannot render JavaScript. It cannot solve CAPTCHAs.

AlterLab replaces chains of Browserless, proxy rotators, and custom retry logic with one node:

Capability	HTTP Request Node	AlterLab Node
Anti-bot bypass (Cloudflare, DataDome, Akamai)	No	Yes — automatic
JavaScript rendering (React, Angular, Vue SPAs)	No	Yes — full Chromium
Structured data extraction (JSON, Schema.org)	No	Yes — built-in profiles
Smart tier escalation (cheapest method first)	N/A	Yes — saves 60-80% vs browser-only
Residential proxy rotation	No	Yes — 195+ countries
Screenshot and PDF capture	No	Yes
OCR text extraction from images	No	Yes

How Does AlterLab Web Scraping Work?

AlterLab uses a multi-tier scraping architecture that automatically selects the cheapest method capable of fetching each URL:

Curl ($0.0002/req) — Direct HTTP for static pages, RSS feeds, public APIs
HTTP ($0.0003/req) — TLS fingerprint rotation for moderately protected sites
Stealth ($0.0005/req) — Browser impersonation for Cloudflare/DataDome-protected sites
Light JS ($0.0007/req) — Lightweight JS extraction from server-rendered HTML
Browser ($0.001/req) — Full headless Chromium for JavaScript-heavy SPAs

Auto mode starts at Tier 1 and escalates only when blocked. Most websites resolve at Tiers 1-2, so $1 gets you 1,000 to 5,000 scrapes depending on the sites you target.

Installation

Install from n8n UI

In your n8n instance: Settings → Community Nodes → Install → n8n-nodes-alterlab

Install via CLI

npm install n8n-nodes-alterlab

Connect Your Account

Option A — One-click OAuth (recommended):

Add an AlterLab node to your workflow
Click the credential dropdown → Create New → AlterLab OAuth2 API
Click Connect → sign in → done. No API key copy-paste needed.

Option B — API key:

Sign up free and copy your API key
In n8n: Credentials → New → AlterLab API → paste key

Quick Start — Scrape a Page in 30 Seconds

Add the AlterLab node to any workflow
Enter a URL (e.g., https://www.amazon.com/dp/B0BSHF7WHW)
Run

The node returns markdown, structured JSON, raw HTML, and metadata in a flat JSON object — ready for the next node in your workflow:

{{ $json.markdown }}            → Clean markdown (best for LLMs and AI agents)
{{ $json.text }}                → Plain text
{{ $json.json }}                → Structured data (Schema.org, extracted fields)
{{ $json.html }}                → Raw HTML
{{ $json.title }}               → Page title
{{ $json.filteredContent }}     → Custom schema extraction results
{{ $json.billing.cost }}        → Amount charged from your balance
{{ $json.billing.tier }}        → Scraping method used (curl, http, stealth, etc.)
{{ $json.billing.suggestion }}  → Cost optimization tip
{{ $json.screenshotUrl }}       → Screenshot URL (if enabled)
{{ $json.pdfUrl }}              → PDF URL (if enabled)

What Can You Scrape with AlterLab?

E-Commerce Product Data (Amazon, Walmart, Shopify)

Scrape Amazon, Walmart, Target, Best Buy, Shopify stores, and any product page. The Product extraction profile returns structured JSON: product name, price, currency, rating, review count, availability, image URLs, and description.

News Articles and Blog Content

Scrape news sites, blogs, and publications. The Article extraction profile returns: title, author, published date, body text as clean markdown, and featured image URL. Markdown output is optimized for LLM context windows.

Job Board Listings (Indeed, LinkedIn, Glassdoor)

Scrape Indeed, LinkedIn job posts, Glassdoor, and company career pages. The Job Posting profile returns: job title, company, location, salary range, description, and requirements as structured JSON.

Any Website with Custom Extraction

Define a JSON Schema for your exact output structure, or write a natural language extraction prompt like "extract the price, title, and all review texts as an array". AlterLab maps page content to your schema automatically.

n8n Workflow Examples

Price Monitoring Automation

Schedule → AlterLab (Product profile) → Compare to Google Sheet → IF price dropped → Slack notification

Monitor competitor prices daily. AlterLab handles anti-bot protection on e-commerce sites. Compare extracted prices against stored values and alert your team when prices change.

AI-Powered Content Pipeline

Schedule → AlterLab (Article profile, Markdown output) → OpenAI Summarize → Notion Database

Scrape industry news sources, get clean markdown (not messy HTML), summarize with GPT-4, and store in your knowledge base. AlterLab's markdown output is token-efficient for LLM processing.

Lead Generation from Job Boards

Schedule → AlterLab (Job Posting profile) → Filter by keywords → Airtable → Email notification

Monitor job boards for roles that match your product. Extract structured listings, filter for relevant titles, and push qualified leads to your CRM.

Competitor Intelligence Dashboard

Schedule → AlterLab (Custom schema) → Compare to previous scrape → Google Sheets → Looker Studio

Scrape competitor pages weekly. Define a custom schema for the data points you care about (pricing, features, team size). Track changes over time in a dashboard.

Real Estate Listing Monitor

Schedule → AlterLab (Custom schema) → IF new listing → Telegram notification

Monitor property listing sites for new entries matching your criteria. Extract price, location, square footage, and images. Get notified the moment a matching property appears.

Pricing — Pay-As-You-Go Web Scraping

No subscriptions. No monthly minimums. Add balance and use it whenever you need it.

Base Scraping Costs

Tier	Cost per Request	Use Case
Curl	$0.0002	Static pages, RSS feeds, public APIs
HTTP	$0.0003	Sites with basic TLS fingerprinting
Stealth	$0.0005	Cloudflare, DataDome, PerimeterX protected sites
Light JS	$0.0007	Server-rendered pages needing JSON extraction
Browser	$0.001	Full JavaScript SPAs (React, Angular, Vue)

Optional Add-Ons

Add-On	Extra Cost per Request	Description
JavaScript Rendering	+$0.0006	Headless Chromium for dynamic content
Screenshot Capture	+$0.0002	Full-page PNG screenshot
PDF Export	+$0.0004	Rendered page as downloadable PDF
OCR Text Extraction	+$0.001	Extract text from images on the page
Premium Residential Proxy	+$0.0002	Geo-targeted proxy (US, DE, GB, JP, and 190+ more)

Node Reference

Input Parameters

Parameter	Required	Default	Description
URL	Yes	—	The webpage URL to scrape
Mode	No	`auto`	Scraping mode: `auto`, `html`, `js`, `pdf`, `ocr`

Advanced Options

Option	Description
Extraction Profile	Pre-built schemas: Product, Article, Job Posting, FAQ, Recipe, Event
Extraction Prompt	Natural language instructions for custom data extraction
Extraction Schema	JSON Schema definition for structured output
Render JavaScript	Enable full Chromium rendering for SPAs and dynamic content
Screenshot	Capture a full-page PNG screenshot
PDF Export	Generate a PDF of the rendered page
OCR	Extract text from images on the page
Proxy	Route through residential proxies with country targeting
Cache	Cache responses from 60 seconds to 24 hours
Cost Controls	Set maximum spend per request, force specific tiers, prefer cost or speed

Frequently Asked Questions

How does AlterLab bypass anti-bot protection like Cloudflare?

AlterLab uses a multi-tier system that automatically escalates scraping methods. It starts with a simple HTTP request. If the site blocks it, AlterLab retries with TLS fingerprint rotation, then browser impersonation, then a full headless Chromium browser — all transparently. You send a URL and get content back. The anti-bot bypass is fully automatic.

Can AlterLab scrape JavaScript-heavy websites (React, Angular, Vue)?

Yes. Set the mode to js or enable "Render JavaScript" in the Advanced Options. AlterLab spins up a full headless Chromium browser, renders the page including all JavaScript, waits for dynamic content to load, then extracts content from the fully rendered DOM.

How is AlterLab different from Apify, Browserless, or ScrapingBee?

AlterLab starts at $0.0002 per request — 20x cheaper than most scraping APIs — because it only uses expensive browser rendering when a site actually requires it. Most scraping APIs charge browser-tier prices for every request. AlterLab's smart tier escalation means you only pay for what each site requires. No subscriptions, no monthly minimums.

Can I scrape Amazon, Walmart, and other e-commerce sites?

Yes. AlterLab handles all major e-commerce anti-bot protection including Cloudflare, DataDome, PerimeterX, and Akamai. The Product extraction profile returns structured JSON with product name, price, rating, availability, and images — ready for price monitoring or catalog building.

Does AlterLab work with n8n Cloud and self-hosted n8n?

Yes, both. Install via Settings → Community Nodes on n8n Cloud, or npm install n8n-nodes-alterlab for self-hosted. OAuth2 authentication works on both — click Connect, sign in, done.

Is there rate limiting?

Free-tier accounts have rate limits. Adding any balance to your account removes rate limits. Concurrent request limits scale with your balance.

What output format is best for AI and LLM workflows?

Use markdown format (the default). It preserves document structure — headings, tables, lists, links — while being token-efficient. GPT-4, Claude, and other LLMs process markdown significantly better than raw HTML. AlterLab's markdown output is optimized for AI agent context windows.

Can I use AlterLab for large-scale scraping (thousands of URLs)?

Yes. The n8n node processes input items in a loop, so you can feed it thousands of URLs from a spreadsheet, database query, or upstream node. Enable caching and cost controls to manage spend at scale. Async job polling handles long-running scrapes automatically.

Contributing

git clone https://github.com/RapierCraft/n8n-nodes-alterlab.git
cd n8n-nodes-alterlab
npm install
npm run build
npm run lint

CI/CD

This project uses GitHub Actions for automation:

CI — Runs on every push and PR to main. Builds and type-checks across Node.js 18, 20, and 22. Verifies all dist output files are present.
Publish — Automatically publishes to npm when a GitHub Release is created. Uses npm provenance for supply-chain security.
Dependabot — Automated weekly dependency updates for npm packages and GitHub Actions.

Required GitHub Secrets

Secret	Description	Where to Get It
`NPM_TOKEN`	npm automation token for publishing	npmjs.com → Access Tokens → Generate — use "Automation" type

Support

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
assets		assets
credentials		credentials
nodes/AlterLab		nodes/AlterLab
.gitignore		.gitignore
.npmignore		.npmignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
gulpfile.js		gulpfile.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

n8n-nodes-alterlab

Why Use AlterLab Instead of the HTTP Request Node?

How Does AlterLab Web Scraping Work?

Installation

Install from n8n UI

Install via CLI

Connect Your Account

Quick Start — Scrape a Page in 30 Seconds

What Can You Scrape with AlterLab?

E-Commerce Product Data (Amazon, Walmart, Shopify)

News Articles and Blog Content

Job Board Listings (Indeed, LinkedIn, Glassdoor)

Any Website with Custom Extraction

n8n Workflow Examples

Price Monitoring Automation

AI-Powered Content Pipeline

Lead Generation from Job Boards

Competitor Intelligence Dashboard

Real Estate Listing Monitor

Pricing — Pay-As-You-Go Web Scraping

Base Scraping Costs

Optional Add-Ons

Node Reference

Input Parameters

Advanced Options

Frequently Asked Questions

How does AlterLab bypass anti-bot protection like Cloudflare?

Can AlterLab scrape JavaScript-heavy websites (React, Angular, Vue)?

How is AlterLab different from Apify, Browserless, or ScrapingBee?

Can I scrape Amazon, Walmart, and other e-commerce sites?

Does AlterLab work with n8n Cloud and self-hosted n8n?

Is there rate limiting?

What output format is best for AI and LLM workflows?

Can I use AlterLab for large-scale scraping (thousands of URLs)?

Contributing

CI/CD

Required GitHub Secrets

Support

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages