Amazon Scraper – Modern, robust and responsive full‑stack

This project was also built to learn and improve knowledge about APIs, backend, Node.js, etc.

It's also useful when searching for products.

Backend

[] [] [] [] [] [] [] []

Frontend

[] [] [] [] [] [] []

Testes (frontend)

[] []

Preview

A complete full‑stack project showcasing a modern product search experience for Amazon Brazil, with a production‑ready backend and a performant, responsive frontend.

It combines engineering best practices (security, caching, rate limits, observability) with modern UI/UX (Tailwind, Dark Mode, animations, filters, infinite scroll). Use it as a portfolio piece, learning base, or a foundation for advanced solutions.

Why this project matters

Demonstrates the full cycle: API → parsing (JSDOM) → modern UI → live metrics.
Production‑grade backend: rate limiting, cache, logs, Helmet, compression, health check, metrics, graceful shutdown.
Modern UX: Tailwind, Dark Mode, rich product cards, filters, animations, infinite scroll.
Clear separation of concerns, easy to understand and extend.

Architecture

server/ (Node.js + Express)
- REST endpoints: /api/scrape, /api/health, /api/metrics.
- Scraping with axios + jsdom and robust CSS selectors.
- Security/perf: helmet, compression, morgan, express-rate-limit.
- In‑memory cache with TTL to reduce latency and load.
- Input validation, centralized error handling, graceful shutdown.
client/ (Vite + Tailwind)
- Search with rich feedback (spinner, friendly errors).
- Product cards with hover, rating badge, external link.
- Persistent Dark Mode (localStorage), mobile‑first responsive grid.
- Filters (min/max price, min rating, Prime) and infinite scroll pagination.
- Metrics panel pulling /api/metrics (5s polling).

Simplified flow:

User searches → frontend calls /api/scrape?keyword=....
Backend fetches Amazon HTML, parses with jsdom, normalizes and returns data.
Frontend renders in batches (better UX), supports filters and live metrics.

Getting started

Prereq: Node 20.19+ or 22.12+ (recommended). On Windows, prefer nvm‑windows.

Install dependencies

cd C:\Users\User\Desktop\amazon_scraper_fullstackAPI-master
npm install
npm run install-client

Development (two terminals)

Backend

npm start

Frontend

cd client
npm run dev

Open: http://localhost:5173

Production (serve built frontend)

cd client
npm run build
cd ..
npm start

Open: http://localhost:3000

Environment variables (optional – create .env at the root)

PORT=3000
REQUEST_TIMEOUT_MS=12000
CACHE_TTL_MS=60000
RATE_LIMIT_MAX=15

Endpoints

GET /api/health – basic health check.
GET /api/metrics – in‑memory metrics (totalRequests, scrapeRequests, cacheHits, rateLimited, memory, uptime).
GET /api/scrape?keyword=keyboard – returns products with { id, title, price, rating, reviews, imageUrl, productUrl } and total.

Example (short):

{
  "success": true,
  "keyword": "keyboard",
  "products": [ { "id": 1, "title": "...", "price": "R$ 199,90" } ],
  "total": 24,
  "timestamp": "2024-..."
}

Key improvements

Frontend (UI/UX)
- Tailwind CSS with modern look, gradients and glassmorphism.
- Dark Mode toggle with persistence.
- Rich product cards, rating badge, “View on Amazon” button.
- Smooth loading spinner, friendly error messages, auto‑scroll to results.
- Filters: min/max price (BRL), min rating, Prime only.
- Infinite scroll with batched rendering.
- Live metrics panel (polling /api/metrics).
Backend (hardening)
- Helmet, compression, morgan (logs), CORS, configurable rate limiting.
- In‑memory cache with TTL and cacheHits counter.
- Input validation and clearer error messages.
- /api/metrics for observability.
- Graceful shutdown and centralized error handler.

Technical notes

jsdom parsing is more reliable than regex for dynamic DOM; selectors may need updates as Amazon changes HTML.
In‑memory cache is simple and effective; for production, prefer Redis.
Rate limiting guards against abuse; tune via .env.
Tailwind enables fast, consistent UI iteration.
Infinite scroll improves perceived performance; classic pagination can be added if SEO matters.

Limitations & responsibility

Scraping may violate Terms of Service; use for education, controlled tests, or with authorization.
Amazon’s structure can change; selectors require maintenance.
Not designed to bypass anti‑bot systems or operate at massive scale.

Roadmap ideas

Sorting (price, rating) and more filters (shipping, category).
Favorites and product comparison (localStorage or backend).
External cache (Redis) and queues (BullMQ) for scale.
Auth for a private metrics dashboard.
PWA and E2E tests (Playwright).

Tech stack

Backend: Node.js, Express, Axios, JSDOM, Helmet, Compression, Morgan, Rate Limit.
Frontend: Vite, Tailwind CSS, Font Awesome, Google Fonts (Inter).
Tests (frontend): Vitest + jsdom.

License

MIT – use freely with attribution. Respect Amazon’s policies.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
client		client
server		server
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
INSTALACAO.md		INSTALACAO.md
README.md		README.md
STATUS_FINAL.md		STATUS_FINAL.md
deploy.sh		deploy.sh
exemplo_uso.html		exemplo_uso.html
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Amazon Scraper – Modern, robust and responsive full‑stack

This project was also built to learn and improve knowledge about APIs, backend, Node.js, etc.

It's also useful when searching for products.

Backend

Frontend

Testes (frontend)

Preview

Why this project matters

Architecture

Getting started

Endpoints

Key improvements

Technical notes

Limitations & responsibility

Roadmap ideas

Tech stack

License

About

Uh oh!

Releases

Packages

Languages

NatanLuz/amazon_scraper_fullstackAPI

Folders and files

Latest commit

History

Repository files navigation

Amazon Scraper – Modern, robust and responsive full‑stack

This project was also built to learn and improve knowledge about APIs, backend, Node.js, etc.

It's also useful when searching for products.

Backend

Frontend

Testes (frontend)

Preview

Why this project matters

Architecture

Getting started

Endpoints

Key improvements

Technical notes

Limitations & responsibility

Roadmap ideas

Tech stack

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages