Wan 2.0 Video Generation Project

Generate stunning AI-powered videos from text prompts or images using the Wan 2.0 model by Alibaba.

✨ Features

🎬 Text-to-Video Generation - Create videos from descriptive text prompts
🖼️ Image-to-Video Animation - Bring static images to life with AI
🌐 Modern Web Interface - Beautiful, responsive UI with glassmorphism design
⚡ Real-time Processing - Fast video generation with progress tracking
📱 Mobile Responsive - Works seamlessly on all devices
🎨 Customizable Settings - Control resolution, FPS, and duration

🚀 Quick Start

Prerequisites

Python 3.8 or higher
CUDA-compatible GPU (recommended) or CPU
8GB+ RAM (16GB+ recommended)

Installation

Clone or navigate to the project directory:

cd d:\Freelancing\ImagetoVideo

Create a virtual environment:

python -m venv venv

Activate the virtual environment:

Windows:

venv\Scripts\activate

Linux/Mac:

source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

copy .env.example .env

Edit .env file to configure your settings (optional).

Running the Application

Start the Flask server:

python app.py

Open your browser and navigate to:

http://localhost:5000

Start creating videos!

📖 Usage

Text-to-Video

Select the Text to Video tab
Enter a descriptive prompt (e.g., "A majestic dragon flying over a medieval castle at sunset")
Optionally add a negative prompt to avoid unwanted elements
Configure settings (duration, FPS, resolution)
Click Generate Video
Wait for processing and download your video!

Image-to-Video

Select the Image to Video tab
Upload an image (PNG, JPG, JPEG, or WEBP)
Optionally add a motion prompt (e.g., "gentle camera zoom")
Configure settings
Click Generate Video
Download your animated video!

🎨 Example Prompts

Text-to-Video Examples:

"A serene lake at sunrise with mist rising from the water, cinematic 4k"
"Futuristic city with flying cars and neon lights, cyberpunk style"
"Ocean waves crashing on a rocky shore, slow motion"
"Northern lights dancing in the night sky over snowy mountains"

Image-to-Video Motion Prompts:

"Slow zoom in with gentle camera movement"
"Pan from left to right smoothly"
"Character walking forward naturally"
"Leaves rustling in the wind"

⚙️ Configuration

Edit the .env file to customize:

# Model Configuration
MODEL_NAME=alibaba-pai/wan-2.0-5b
DEVICE=cuda  # Options: cuda, cpu, mps
USE_FP16=True

# Generation Settings
DEFAULT_RESOLUTION=720
DEFAULT_FPS=24
DEFAULT_DURATION=5
MAX_VIDEO_LENGTH=10

# Server Settings
HOST=0.0.0.0
PORT=5000

🏗️ Project Structure

ImagetoVideo/
├── app.py                 # Flask backend server
├── model_handler.py       # Wan 2.0 model integration
├── requirements.txt       # Python dependencies
├── .env.example          # Environment configuration template
├── .gitignore            # Git ignore rules
├── static/               # Frontend files
│   ├── index.html        # Main web interface
│   ├── css/
│   │   └── style.css     # Styling with glassmorphism
│   └── js/
│       └── app.js        # Client-side JavaScript
├── uploads/              # Temporary image uploads
└── outputs/              # Generated videos

🔧 API Endpoints

Health Check

GET /api/health

Text-to-Video

POST /api/text-to-video
Content-Type: application/json

{
  "prompt": "Your prompt here",
  "negative_prompt": "Optional",
  "duration": 5,
  "fps": 24,
  "resolution": 720
}

Image-to-Video

POST /api/image-to-video
Content-Type: multipart/form-data

image: <file>
prompt: "Optional motion description"
duration: 5
fps: 24
resolution: 720

Download Video

GET /api/download/<filename>

🎯 System Requirements

Minimum Requirements:

CPU: 4+ cores
RAM: 8GB
Storage: 10GB free space
GPU: Not required (CPU mode available)

Recommended Requirements:

CPU: 8+ cores
RAM: 16GB+
Storage: 20GB+ free space
GPU: NVIDIA GPU with 8GB+ VRAM (RTX 3060 or better)

🐛 Troubleshooting

Model Loading Issues

If the model fails to load, the application will run in mock mode for demonstration purposes. To use the actual Wan 2.0 model:

Ensure you have sufficient GPU memory
Try setting USE_FP16=True in .env to reduce memory usage
Consider using a smaller model variant
Check your internet connection for model downloads

CUDA Out of Memory

Reduce resolution in settings
Decrease video duration
Set USE_FP16=True
Close other GPU-intensive applications

Slow Generation

Use GPU instead of CPU (set DEVICE=cuda)
Reduce resolution and FPS
Enable half-precision (USE_FP16=True)

📝 Notes

First run will download the Wan 2.0 model (~10GB), which may take time
Video generation time depends on duration, resolution, and hardware
Generated videos are saved in the outputs/ directory
Uploaded images are temporarily stored and automatically cleaned up

🌟 Technologies Used

Backend: Flask, Python
AI/ML: PyTorch, Hugging Face Diffusers, Wan 2.0
Frontend: HTML5, CSS3, Vanilla JavaScript
Video Processing: OpenCV, imageio

📄 License

This project is for educational and demonstration purposes. Please refer to the Wan 2.0 model license for commercial usage restrictions.

🤝 Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

🔗 Resources

Made with ❤️ using Wan 2.0 AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wan 2.0 Video Generation Project

✨ Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

📖 Usage

Text-to-Video

Image-to-Video

🎨 Example Prompts

Text-to-Video Examples:

Image-to-Video Motion Prompts:

⚙️ Configuration

🏗️ Project Structure

🔧 API Endpoints

Health Check

Text-to-Video

Image-to-Video

Download Video

🎯 System Requirements

Minimum Requirements:

Recommended Requirements:

🐛 Troubleshooting

Model Loading Issues

CUDA Out of Memory

Slow Generation

📝 Notes

🌟 Technologies Used

📄 License

🤝 Contributing

🔗 Resources

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
outputs		outputs
static		static
uploads		uploads
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
api_client.py		api_client.py
app.py		app.py
model_handler.py		model_handler.py
requirements.txt		requirements.txt

TechScape/ImagetoVideo

Folders and files

Latest commit

History

Repository files navigation

Wan 2.0 Video Generation Project

✨ Features

🚀 Quick Start

Prerequisites

Installation

Running the Application

📖 Usage

Text-to-Video

Image-to-Video

🎨 Example Prompts

Text-to-Video Examples:

Image-to-Video Motion Prompts:

⚙️ Configuration

🏗️ Project Structure

🔧 API Endpoints

Health Check

Text-to-Video

Image-to-Video

Download Video

🎯 System Requirements

Minimum Requirements:

Recommended Requirements:

🐛 Troubleshooting

Model Loading Issues

CUDA Out of Memory

Slow Generation

📝 Notes

🌟 Technologies Used

📄 License

🤝 Contributing

🔗 Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages