@ismail-kattakath/mediapipe-react

Production-ready React hooks for MediaPipe AI tasks — GenAI, Vision, and Audio.

Overview

@ismail-kattakath/mediapipe-react is a monorepo that provides React developers with a clean, hooks-based API for integrating Google's MediaPipe AI capabilities. Built with Turborepo and pnpm workspaces, this project is designed for scalability, modularity, and developer experience.

Three Target Areas

GenAI — LLM inference (Gemma, Llama) with Web Worker orchestration
Vision — Hand tracking, face mesh, object detection, pose estimation
Audio — Audio classification and custom model support

The library uses subpath exports to ensure tree-shaking and minimal bundle sizes. Import only what you need.

Architecture

This is a monorepo managed with:

Turborepo — Task orchestration and caching
pnpm workspaces — Dependency management

Project Structure

mediapipe-react/
├── packages/
│   └── core/              # @ismail-kattakath/mediapipe-react (published to npm)
│       ├── src/
│       │   ├── index.ts   # Core provider and utilities
│       │   ├── genai.ts   # GenAI subpath (useLlm, etc.)
│       │   ├── vision.ts  # Vision subpath (planned)
│       │   └── audio.ts   # Audio subpath (planned)
│       └── package.json
├── apps/
│   ├── playground-next/   # Next.js App Router playground
│   └── playground-vite/   # Vite + React playground
├── turbo.json             # Turborepo configuration
└── pnpm-workspace.yaml    # pnpm workspace config

Key Design Decisions

Subpath Exports: Import from @ismail-kattakath/mediapipe-react/genai to avoid bundling unused code
SSR Safety: All hooks include isBrowser() guards for Next.js compatibility
Web Workers: Heavy inference tasks run in background threads to keep the UI responsive
TypeScript-first: Full type safety across all APIs

Installation

For library users, see the Core Package README for detailed usage instructions.

pnpm add @ismail-kattakath/mediapipe-react

Roadmap

We are following a phased rollout to cover the full breadth of MediaPipe's capabilities while maintaining a React-idiomatic developer experience.

Phase 1: Generative AI ✅

LLM Inference: Support for Gemma and Llama models
Web Worker Orchestration: Offload heavy inference to background threads
Streaming Hooks: Real-time token streaming for chat interfaces
Hooks: useLlm, useLlmChat

Phase 2: Vision Core (Next)

Phase 2: Vision Core (In Progress)

Face Landmarker: High-fidelity face landmark detection
Hand Tracking: 2D and 3D hand landmark detection
Face Mesh: High-fidelity face landmark detection
Object Detection: Identifying and locating multiple objects in images/video
Hooks: useHandTracking, useFaceMesh, useObjectDetection

Phase 3: Advanced Perception

Holistic Tracking: Simultaneous tracking of body, hands, and face
Selfie Segmentation: Real-time background removal/blurring
Pose Tracking: 3D body pose estimation
Hooks: useHolistic, useSelfieSegmentation, usePoseTracking

Phase 4: Audio & Customization

Audio Classification: Identify sounds from a predefined set of categories
Custom Model Assets: Support for uploading and using custom .tflite or GenAI model files
Hooks: useAudioClassifier, useCustomModel

Contributing

We welcome contributions! This project is designed to be contributor-friendly.

Getting Started

Clone the repository:

git clone https://github.com/ismail-kattakath/mediapipe-react.git
cd mediapipe-react

Install dependencies:
```
pnpm install
```
Run the development playgrounds:
```
pnpm dev
```
This starts both the Next.js playground (port 3000) and Vite playground (port 5173).
Make changes to packages/core/src/
Run tests:
```
pnpm test
```
Create a changeset (if your changes affect the public API):
```
pnpm changeset
```

Development Workflow

Turborepo automatically rebuilds packages/core when you edit files
Lint-staged runs on pre-commit to enforce code quality
Changesets manages versioning and changelog generation
GitHub Actions runs CI on every PR (lint, build, test)

For detailed contribution guidelines, see CONTRIBUTING.md.

🤝 Call for Contributors

We are actively looking for help with Vision capabilities!

If you're interested in implementing the vision.ts subpath using our established Web Worker pattern (see genai.ts as a reference), we'd love to collaborate with you.

Help us make MediaPipe the standard for AI in React!

Resources

Core Package Documentation — API reference and usage examples
CONTRIBUTING.md — How to add new MediaPipe tasks
CODE_OF_CONDUCT.md — Community guidelines
Playground-Next README — Local development setup

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.changeset		.changeset
.github		.github
.husky		.husky
apps		apps
e2e		e2e
packages/core		packages/core
.dockerignore		.dockerignore
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.prettierignore		.prettierignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package.json		package.json
playwright.config.ts		playwright.config.ts
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

@ismail-kattakath/mediapipe-react

Overview

Three Target Areas

Architecture

Project Structure

Key Design Decisions

Installation

Roadmap

Phase 1: Generative AI ✅

Phase 2: Vision Core (Next)

Phase 2: Vision Core (In Progress)

Phase 3: Advanced Perception

Phase 4: Audio & Customization

Contributing

Getting Started

Development Workflow

🤝 Call for Contributors

Resources

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

@ismail-kattakath/mediapipe-react

Overview

Three Target Areas

Architecture

Project Structure

Key Design Decisions

Installation

Roadmap

Phase 1: Generative AI ✅

Phase 2: Vision Core (Next)

Phase 2: Vision Core (In Progress)

Phase 3: Advanced Perception

Phase 4: Audio & Customization

Contributing

Getting Started

Development Workflow

🤝 Call for Contributors

Resources

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages