Pose Estimator

A simple pose estimation tool that detects human body poses in video and draws a skeleton overlay using MediaPipe and OpenCV.

Features

Detects 33 body landmarks (joints) per person
Draws skeleton overlay with joints and bone connections
Face landmarks excluded by default (optional)
Processes video files with progress reporting
Optional live preview window

Requirements

Python 3.8+
OpenCV
MediaPipe

Setup

Windows (Native)

Install Python (if not already installed)
- Download from https://www.python.org/downloads/
- During installation, check "Add Python to PATH"

Create a virtual environment (recommended)

python -m venv venv
venv\Scripts\activate

Install dependencies
```
pip install opencv-python mediapipe
```

Verify installation

python -c "import cv2; import mediapipe; print('Ready!')"

Windows Subsystem for Linux (WSL)

Ensure WSL is installed

# In PowerShell (as Administrator)
wsl --install

Open WSL terminal and install Python

sudo apt update
sudo apt install python3 python3-pip python3-venv

Create a virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install opencv-python mediapipe
```
Note on video preview in WSL
- The --preview flag requires a display server
- If using WSL2 with WSLg (Windows 11), it should work automatically
- Otherwise, skip --preview and just generate output files

macOS / Linux

Create a virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies
```
pip install opencv-python mediapipe
```

Usage

Basic Usage

# Process a video and save output
python pose_estimator.py input.mp4 -o output.mp4

# Windows example with full paths
python pose_estimator.py "C:\Users\Name\Videos\workout.mp4" -o "C:\Users\Name\Videos\output.mp4"

# WSL example (accessing Windows files)
python pose_estimator.py /mnt/c/Users/Name/Videos/workout.mp4 -o /mnt/c/Users/Name/Videos/output.mp4

Options

# Show live preview while processing (press 'q' to quit)
python pose_estimator.py input.mp4 -o output.mp4 --preview

# Include face landmarks (eyes, ears, nose, mouth)
python pose_estimator.py input.mp4 -o output.mp4 --with-face

# Adjust detection confidence (higher = fewer false positives)
python pose_estimator.py input.mp4 -o output.mp4 --detection-confidence 0.7

# Adjust tracking confidence (higher = more stable between frames)
python pose_estimator.py input.mp4 -o output.mp4 --tracking-confidence 0.7

Full Options List

usage: pose_estimator.py [-h] [-o OUTPUT] [--preview]
                         [--detection-confidence DETECTION_CONFIDENCE]
                         [--tracking-confidence TRACKING_CONFIDENCE]
                         [--with-face]
                         input

positional arguments:
  input                 Input video file path

options:
  -h, --help            Show help message
  -o, --output          Output video file path
  --preview             Show live preview window
  --detection-confidence
                        Minimum detection confidence 0.0-1.0 (default: 0.5)
  --tracking-confidence
                        Minimum tracking confidence 0.0-1.0 (default: 0.5)
  --with-face           Include face landmarks

Example Workflow

# 1. Activate virtual environment
# Windows:
venv\Scripts\activate
# Linux/macOS/WSL:
source venv/bin/activate

# 2. Process a workout video
python pose_estimator.py workout.mp4 -o workout_pose.mp4

# 3. Check the output
# The output video will have skeleton overlay drawn on detected persons

Troubleshooting

"ModuleNotFoundError: No module named 'cv2'"

pip install opencv-python

"ModuleNotFoundError: No module named 'mediapipe'"

pip install mediapipe

Video output is empty or corrupted

Ensure the input video codec is supported
Try converting input to MP4 with H.264 codec first

Preview window doesn't appear (WSL)

WSL requires WSLg or an X server for GUI
Use without --preview flag and view the output file instead

Slow processing

Lower resolution videos process faster
Consider reducing model_complexity in the code (0=lite, 1=full, 2=heavy)

How It Works

See HOW_IT_WORKS.md for a technical explanation of the pose estimation pipeline.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
animal_models		animal_models
video		video
.gitignore		.gitignore
HOW_IT_WORKS.md		HOW_IT_WORKS.md
README.md		README.md
cat_paw_swipe.gif		cat_paw_swipe.gif
cat_pounce.gif		cat_pounce.gif
demo_cat_skeleton.py		demo_cat_skeleton.py
demo_dog_skeleton.py		demo_dog_skeleton.py
dog_grab_shake.gif		dog_grab_shake.gif
pose_estimator.py		pose_estimator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pose Estimator

Features

Requirements

Setup

Windows (Native)

Windows Subsystem for Linux (WSL)

macOS / Linux

Usage

Basic Usage

Options

Full Options List

Example Workflow

Troubleshooting

"ModuleNotFoundError: No module named 'cv2'"

"ModuleNotFoundError: No module named 'mediapipe'"

Video output is empty or corrupted

Preview window doesn't appear (WSL)

Slow processing

How It Works

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

amacbride/pose-estimator

Folders and files

Latest commit

History

Repository files navigation

Pose Estimator

Features

Requirements

Setup

Windows (Native)

Windows Subsystem for Linux (WSL)

macOS / Linux

Usage

Basic Usage

Options

Full Options List

Example Workflow

Troubleshooting

"ModuleNotFoundError: No module named 'cv2'"

"ModuleNotFoundError: No module named 'mediapipe'"

Video output is empty or corrupted

Preview window doesn't appear (WSL)

Slow processing

How It Works

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages