This repository is a fork of the LipNet implementation by Nicholas Renotte. Watch his tutorial on YouTube
-
Automatic Face Tracking:
- Utilizes dlib's facial landmark detector to locate and track faces in video frames.
-
ROI Extraction for Lips:
- Extracts a dynamic Region of Interest (ROI) around the lips from the detected facial landmarks.
-
Tracked Videos Caching
- Extracted ROIs are cached to avoid reruning dlib tracking during data loading.
