LatentSync LipSync – Serverless & Local Deployment

This repository provides a serverless-ready and local-capable deployment of ByteDance’s LatentSync 1.6 lip-sync model. It supports explicit environment selection for local, staging, and production deployments. This system was ran and tested on Nvidia RTX 3090 and A40 and consumed ~19 GB video ram

🚀 Key Features

Serverless GPU inference (RunPod compatible)
Explicit environment selection (local, stag, prod)
Dockerized CUDA environment
Preloaded models (UNet, Whisper, VAE, InsightFace)
No runtime model downloads
Global pipeline reuse
Clean runtime cleanup & GPU memory handling

🎬 Demo – Before & After (LatentSync)

Before (Input Video)

actor_5.mp4

After (LatentSync Output)

video3.mp4

🔧 Environment Levels (Required)

Important: The level field is mandatory for all runs.

🖥️ Local (Development / Debugging)

{
  "level": "local",
  "ref_video_path": "/absolute/path/to/video.mp4",
  "ref_audio_path": "/absolute/path/to/audio.wav"
}

Uses local filesystem
No cloud credentials required
Intended for development and debugging only

☁️ Staging (AWS)

{
  "level": "stag",
  "ref_video_path": "s3://staging-bucket/path/video.mp4",
  "ref_audio_path": "s3://staging-bucket/path/audio.wav"
}

Uses staging AWS resources
Separate credentials and buckets
Mirrors production setup safely

🚀 Production (AWS)

{
  "level": "prod",
  "ref_video_path": "s3://production-bucket/path/video.mp4",
  "ref_audio_path": "s3://production-bucket/path/audio.wav"
}

Uses production AWS infrastructure
Strict access and IAM policies
Intended for live workloads

🧪 Info / Health Check Mode

{
  "aleef": true
}

Returns service metadata without running inference.

📁 Repository Structure

.
├── app.py
├── Dockerfile
├── requirements.txt
├── utils/
├── LatentSync/
├── checkpoints/
└── test_input.json

📦 Docker Build

docker build -t latentsync-lipsync-serverless .

All models are preloaded at build time, ensuring fully offline runtime execution.

🛠 Tech Stack

Python 3.10
PyTorch (CUDA)
Diffusers
LatentSync 1.6
Whisper
InsightFace
RunPod Serverless
AWS S3

🧹 Runtime Behavior

Temp files created in /tmp
GPU memory cleared after each job
Global pipeline reused across warm invocations

📄 License

LatentSync: Apache 2.0
Other dependencies follow upstream licenses

✅ Status

✔ Local, staging, and production modes supported ✔ Serverless Docker image deployed ✔ Models preloaded and locked

🙏 Acknowledgement

Special thanks and sincere appreciation to the ByteDance LatentSync team for their outstanding work on this model. This deployment builds upon their research and engineering excellence, and we acknowledge their contribution with deep respect and gratitude.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.runpod		.runpod
utils		utils
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
pre_model.py		pre_model.py
requirements.txt		requirements.txt
test_input.json		test_input.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LatentSync LipSync – Serverless & Local Deployment

This repository provides a serverless-ready and local-capable deployment of ByteDance’s LatentSync 1.6 lip-sync model. It supports explicit environment selection for local, staging, and production deployments. This system was ran and tested on Nvidia RTX 3090 and A40 and consumed ~19 GB video ram

🚀 Key Features

🎬 Demo – Before & After (LatentSync)

Before (Input Video)

After (LatentSync Output)

🔧 Environment Levels (Required)

🖥️ Local (Development / Debugging)

☁️ Staging (AWS)

🚀 Production (AWS)

🧪 Info / Health Check Mode

📁 Repository Structure

📦 Docker Build

🛠 Tech Stack

🧹 Runtime Behavior

📄 License

✅ Status

About

Uh oh!

Releases 4

Packages

Languages

AleefBilal/LatentSync-lipysync-Serverless

Folders and files

Latest commit

History

Repository files navigation

LatentSync LipSync – Serverless & Local Deployment

This repository provides a serverless-ready and local-capable deployment of ByteDance’s LatentSync 1.6 lip-sync model. It supports explicit environment selection for local, staging, and production deployments. This system was ran and tested on Nvidia RTX 3090 and A40 and consumed ~19 GB video ram

🚀 Key Features

🎬 Demo – Before & After (LatentSync)

Before (Input Video)

After (LatentSync Output)

🔧 Environment Levels (Required)

🖥️ Local (Development / Debugging)

☁️ Staging (AWS)

🚀 Production (AWS)

🧪 Info / Health Check Mode

📁 Repository Structure

📦 Docker Build

🛠 Tech Stack

🧹 Runtime Behavior

📄 License

✅ Status

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages