The Complete Resource for Alibaba Wan Video Generation Models
The world's leading AI video generation model family — open-source models (2.1/2.2) & cutting-edge API (2.5/2.6). Wan 2.7 coming March 2026! Curated tools, prompts, guides, and resources.
🚀 Generate Wan Videos via API — from $0.03/s 🚀
25% bonus on first recharge (up to $100) • Uncensored Wan Spicy available • Wan 2.7 coming soon!
🔒 Enterprise-Grade Security — Atlas Cloud is SOC I & II Certified | HIPAA Compliant | 99.9% uptime SLA.
🎨 NSFW Whitelist Update — In addition to Seedance and Kling, the Vidu series (Q3-Pro, Q3-Turbo) is now also whitelisted for uncensored content generation on Atlas Cloud.
- Wan 2.7 — Coming Soon
- About Wan Models
- Model Comparison
- Quick Start
- Prompt Collection
- Local Deployment Guide
- Prompt Engineering
- API Pricing Comparison
- Official Resources
- Community & Ecosystem
- FAQ
- Star History
- Contributing
- License
Status: Expected March 2026
The Wan model family has followed a rapid and consistent improvement trajectory: 2.1 → 2.2 → 2.5 → 2.6 → 2.7. Each version has brought major leaps in capability, and Wan 2.7 is poised to continue that trend.
Wan 2.6 already set a high bar with its feature set:
- Multi-shot narrative generation — create coherent stories across multiple shots in a single generation
- Character roleplay — maintain character identity across scenes using reference videos (R2V)
- Voice-driven generation — native audio synchronization for dialogue and ambient sound
- 15-second video duration — the longest in the Wan series
- 5 video reference inputs — use multiple reference clips for character and scene consistency
- 1080p resolution — full HD output quality
Based on the version history progression and community feedback, Wan 2.7 is expected to push boundaries further:
| Area | Wan 2.6 (Current) | Wan 2.7 (Expected) |
|---|---|---|
| Video Duration | Up to 15s | 20s+ (likely) |
| Resolution | 1920×1080 | 4K support possible |
| Motion Control | Good | More precise & controllable |
| Character Consistency | Reference-based | Enhanced identity preservation |
| Audio Sync | Supported | Improved lip sync & sound design |
These are speculative improvements based on the consistent pattern of each Wan release addressing top community requests. Alibaba has not yet published official Wan 2.7 specs.
| Version | Release | Key Milestone |
|---|---|---|
| Wan 1.0 | 2024 Q3 | Initial release |
| Wan 2.1 | 2024 Q4 | VBench #1 (86.22%), open-source 14B & 1.3B |
| Wan 2.2 | 2025 Q1 | Quality improvements, Spicy variant |
| Wan 2.5 | 2025 Q2 | Audio sync, 1080p, 10s duration, API-only |
| Wan 2.6 | 2025 Q3 | Multi-shot, 15s, character roleplay, R2V |
| Wan 2.7 | 2026 Q1 (Expected) | Coming Soon |
🚀 Wan 2.7 will be available on Atlas Cloud upon release. Sign up now to get early access notifications and a 25% bonus on first recharge.
Wan is Alibaba's video generation model series that has taken the AI world by storm. Wan 2.1 and 2.2 are fully open-source (Apache 2.0) with weights available on Hugging Face, while Wan 2.5 and 2.6 are closed-source commercial API models with advanced capabilities. From its debut to its current #1 position on the VBench leaderboard, Wan has consistently pushed the boundaries of video AI.
- 🏆 VBench Champion — Wan 2.1 scored 86.22% on VBench, surpassing Sora (84.28%), Luma (83.61%), and all other models
- 🔓 Open Source (2.1/2.2) — Unlike Sora, Kling, or Runway, Wan 2.1/2.2 are fully open-source (Apache 2.0) with weights available on Hugging Face for self-deployment
- 💻 Consumer GPU Friendly — The 1.3B version needs only 8.19GB VRAM, runs on a single RTX 3060
- 📊 Massive Training Data — Trained on 1.5 billion videos + 10 billion images
- 🔥 2.2M+ Downloads — Proven and validated by the global AI community on Hugging Face and ModelScope
- 🎬 Uncensored Option — Wan 2.2 Spicy enables unrestricted creative freedom
Wan 1.0 (2024 Q3)
└── Wan 2.1 (2024 Q4) — VBench #1, 14B & 1.3B [Open Source, Apache 2.0]
└── Wan 2.2 (2025 Q1) — Improved quality, Spicy variant 🌶️ [Open Source, Apache 2.0]
├── Wan 2.2 Spicy — NSFW/Uncensored generation [Proprietary, Atlas Cloud LoRA]
└── Wan 2.5 (2025 Q2) — Audio sync, 1080p, 10s [Closed Source, API only]
└── Wan 2.6 (2025 Q3) — Multi-shot, 15s, character stability [Closed Source, API only]
└── Wan 2.7 (March 2026?) — 🔜 Longer duration, higher res, better control
| Feature | Wan 2.1/2.2 | Wan 2.5/2.6 | Sora | Kling | Runway Gen-3 |
|---|---|---|---|---|---|
| Open Source | ✅ Apache 2.0 | ❌ API only | ❌ | ❌ | ❌ |
| Local Deployment | ✅ | ❌ | ❌ | ❌ | ❌ |
| Max Resolution | 720p | 1920×1080 | 1920×1080 | 1920×1080 | 1920×1080 |
| Max Duration | 5s | 15s | 20s | 10s | 10s |
| Native Audio | ❌ | ✅ | ✅ | ❌ | ❌ |
| Multi-Shot | ❌ | ✅ | ❌ | ❌ | ❌ |
| NSFW Support | ✅ (Spicy) | ❌ | ❌ | ❌ | ❌ |
| Price / Video | Self-host | From $0.03 | $0.50+ | $0.30+ | $0.50+ |
| Consumer GPU | ✅ (1.3B) | N/A | N/A | N/A | N/A |
| Fine-tuning | ✅ LoRA | ❌ | ❌ | ❌ | ❌ |
| Version | Parameters | Resolution | Duration | Audio | Multi-Shot | Open Source | NSFW | API Price |
|---|---|---|---|---|---|---|---|---|
| Wan 2.7 🔜 | TBD | 4K? | 20s+? | ✅ | ✅ | TBD | TBD | Coming Soon |
| Wan 2.6 | 14B | 1080p | Up to 15s | ✅ | ✅ | ❌ (API only) | ❌ | $0.07 |
| Wan 2.5 | 14B | 1080p | Up to 10s | ✅ | ❌ | ❌ (API only) | ❌ | Lower |
| Wan 2.2 Spicy 🌶️ | 14B | 720p | Up to 8s | ❌ | ❌ | ✅ | ✅ | $0.03 |
| Wan 2.2 | 14B / 1.3B | 720p | Up to 5s | ❌ | ❌ | ✅ | ❌ | — |
| Wan 2.1 | 14B / 1.3B | 720p | Up to 5s | ❌ | ❌ | ✅ | ❌ | — |
| Model | Type | Price | Highlights |
|---|---|---|---|
| Wan 2.6 T2V | Text-to-Video | from $0.07/s (70% off) | 5/10/15s, up to 1920×1080, multi-shot, audio |
| Wan 2.6 I2V | Image-to-Video | Available | High-fidelity image animation |
| Wan 2.6 I2V Flash | Image-to-Video | Faster & cheaper | Speed-optimized variant |
| Wan 2.6 V2V | Video-to-Video | Available | Style transfer & editing |
| Wan 2.6 Image-Edit | Image Editing | Available | AI-powered image editing |
| Wan 2.6 T2I | Text-to-Image | Available | High-quality image generation |
| Wan 2.5 T2V | Text-to-Video | Available | Reliable video generation |
| Wan 2.5 I2V | Image-to-Video | Available | Stable image animation |
| Wan 2.5 Fast | Text-to-Video | Available | Speed-optimized |
| Wan 2.2 Spicy I2V 🌶️ | Image-to-Video | from $0.03/s | Uncensored generation |
| Wan 2.2 Spicy I2V LoRA | Image-to-Video | Available | LoRA fine-tuning support |
| Wan 2.2 Character Swap | Video Effects | Available | animate-mix face swap |
| Wan 2.2 Image-to-Animation | Animation | Available | animate-move motion transfer |
| Van 2.6 / 2.5 | Optimized | Available | Atlas Cloud optimized variants |
Prices shown are starting prices per second of generated video. Actual cost depends on resolution, duration, and model selected.
🎁 New user offer: 25% bonus on first recharge (up to $100). Sign up now →
cURL Example — Wan 2.6 Text-to-Video
# Wan 2.6 Text-to-Video API call
curl -X POST "https://api.atlascloud.ai/v1/video/generate" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "wan-2.6-t2v",
"prompt": "A majestic dragon soaring through clouds at sunset, cinematic lighting, 4K quality",
"duration": 10,
"resolution": "1920x1080",
"audio": true
}'cURL Example — Wan 2.2 Spicy (Uncensored)
# Wan 2.2 Spicy Uncensored Image-to-Video API call — only from $0.03/s
curl -X POST "https://api.atlascloud.ai/v1/video/generate" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "alibaba/wan-2.2-spicy/image-to-video",
"image_url": "https://example.com/your-image.jpg",
"prompt": "Smooth natural movement, artistic expression, high quality"
}'Python Example — Full Pipeline
"""
Complete Wan video generation example
Supports Wan 2.6 T2V, I2V, and Wan 2.2 Spicy
"""
import requests
import time
# Atlas Cloud API configuration
API_KEY = "YOUR_API_KEY"
BASE_URL = "https://api.atlascloud.ai/v1"
def generate_video(prompt: str, model: str = "wan-2.6-t2v", **kwargs) -> str:
"""
Call Wan model to generate video
Args:
prompt: Video description text
model: Model name, defaults to Wan 2.6
**kwargs: Other parameters such as duration, resolution, etc.
Returns:
Video download URL
"""
headers = {
"Authorization": f"Bearer {API_KEY}",
"Content-Type": "application/json"
}
payload = {
"model": model,
"prompt": prompt,
**kwargs
}
# Submit generation task
response = requests.post(f"{BASE_URL}/video/generate", json=payload, headers=headers)
task_id = response.json()["task_id"]
# Poll for results
while True:
status = requests.get(f"{BASE_URL}/video/status/{task_id}", headers=headers)
result = status.json()
if result["status"] == "completed":
return result["video_url"]
elif result["status"] == "failed":
raise Exception(f"Generation failed: {result.get('error', 'Unknown error')}")
time.sleep(2) # Check status every 2 seconds
# ====== Usage Examples ======
# Example 1: Wan 2.6 Text-to-Video (supports multi-shot + audio)
video_url = generate_video(
prompt="A chef preparing sushi in a traditional Japanese kitchen, warm lighting, close-up shots",
model="wan-2.6-t2v",
duration=10,
resolution="1920x1080",
audio=True
)
print(f"Video generated: {video_url}")
# Example 2: Wan 2.2 Spicy Uncensored generation (only $0.03)
spicy_url = generate_video(
prompt="Artistic figure study, classical painting style, natural movement",
model="alibaba/wan-2.2-spicy/image-to-video"
)
print(f"Spicy video generated: {spicy_url}")# 1. Install ComfyUI
git clone https://github.com/comfyanonymous/ComfyUI.git
cd ComfyUI
# 2. Install dependencies
pip install -r requirements.txt
# 3. Download Wan 2.6 model weights
# Download from Hugging Face (requires approximately 28GB disk space)
huggingface-cli download Wan-AI/Wan2.6-T2V-14B --local-dir models/wan2.6/
# 4. Install Wan ComfyUI nodes
cd custom_nodes
git clone https://github.com/Wan-Video/ComfyUI-Wan.git
pip install -r ComfyUI-Wan/requirements.txt
# 5. Launch ComfyUI
cd ..
python main.py --listen 0.0.0.0 --port 8188"""
Run Wan 2.1 locally using the Hugging Face Diffusers library
Suitable for users with NVIDIA GPUs
"""
from diffusers import WanPipeline
import torch
# Load model (1.3B version only needs 8.19GB VRAM)
pipe = WanPipeline.from_pretrained(
"Wan-AI/Wan2.1-T2V-1.3B",
torch_dtype=torch.float16
)
pipe = pipe.to("cuda")
# Generate video
video = pipe(
prompt="A golden retriever playing in autumn leaves, slow motion, warm sunlight",
num_frames=81, # approximately 5 second video
height=480,
width=720,
guidance_scale=7.5
).frames[0]
# Save video
from diffusers.utils import export_to_video
export_to_video(video, "output.mp4", fps=16)
print("Video saved to output.mp4")# Wan 2.6 Docker deployment configuration
FROM nvidia/cuda:12.1-runtime-ubuntu22.04
# Install system dependencies
RUN apt-get update && apt-get install -y \
python3 python3-pip git wget \
&& rm -rf /var/lib/apt/lists/*
# Install Python dependencies
RUN pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu121
RUN pip3 install diffusers transformers accelerate safetensors
# Download Wan model
RUN huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B --local-dir /models/wan
# Copy inference script
COPY inference.py /app/inference.py
WORKDIR /app
CMD ["python3", "inference.py"]# Build and run Docker container
docker build -t wan-video .
docker run --gpus all -p 8080:8080 wan-video30+ carefully crafted prompts optimized for Wan models. Each includes the full prompt text, recommended model, settings, and tips.
1. Epic Mountain Landscape
- Prompt:
Sweeping aerial shot over snow-capped mountains at golden hour, dramatic clouds casting long shadows across alpine valleys, a winding river reflecting the amber sky, cinematic camera movement slowly pushing forward, volumetric lighting, 8K film grain - Model: Wan 2.6 T2V
- Duration: 15s
- Resolution: 1920×1080
- Tips: Enable multi-shot for a more dynamic sequence. Add audio for ambient wind sounds.
2. Urban Night Chase
- Prompt:
Tracking shot through neon-lit Tokyo streets at night, rain-soaked asphalt reflecting pink and blue neon signs, a figure in a dark coat running through crowds, handheld camera shake, Blade Runner aesthetic, moody atmosphere, cinematic color grading - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Use negative prompt "blurry, low quality, static" for sharper motion.
3. Historical Battle Scene
- Prompt:
Wide establishing shot of a medieval battlefield at dawn, thousands of soldiers in formation, banners waving in the wind, mist rolling across the field, dramatic orchestral music feel, Lord of the Rings inspired cinematography, epic scale, film grain - Model: Wan 2.6 T2V
- Duration: 15s
- Resolution: 1920×1080
- Tips: Multi-shot mode works exceptionally well here for cut transitions between wide and close-up shots.
4. Underwater Documentary
- Prompt:
Slow underwater tracking shot through a vibrant coral reef, tropical fish swimming in formation, sunbeams penetrating the crystal-clear blue water, a sea turtle gliding gracefully through the frame, National Geographic style, 4K clarity - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Add "smooth camera movement, steady" to avoid shaky underwater artifacts.
5. Noir Detective Scene
- Prompt:
Close-up of a detective in a dimly lit office, cigarette smoke curling through a single shaft of light from venetian blinds, black and white film noir style, slow camera dolly in, dramatic shadows, 1940s aesthetic, grain texture - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1280×720
- Tips: Specify "black and white, monochrome" explicitly for consistent noir look.
6. Classical Figure Study
- Prompt:
Renaissance-style figure study, soft diffused lighting on skin, classical painting composition, marble sculpture coming to life with subtle breathing movement, warm golden tones, museum gallery setting, artistic and elegant - Model: Wan 2.2 Spicy I2V
- Price: from $0.03/s
- Tips: Pair with a classical painting or sculpture reference image for best results.
7. Fashion Editorial Avant-Garde
- Prompt:
High fashion editorial, model in sheer flowing fabric, dramatic wind effect, stark white studio lighting, Helmut Newton inspired, bold poses transitioning fluidly, haute couture, magazine quality - Model: Wan 2.2 Spicy I2V
- Price: from $0.03/s
- Tips: Use LoRA variant for consistent style across multiple generations.
8. Artistic Body Movement
- Prompt:
Contemporary dance performance, fluid body movement, dramatic stage lighting, dancer's silhouette against colored backdrop, artistic expression through motion, modern ballet, intimate camera angles, professional choreography - Model: Wan 2.2 Spicy I2V
- Price: from $0.03/s
- Tips: Image-to-video works best — provide a dancer reference image.
9. Mythological Scene
- Prompt:
Birth of Venus reimagined, goddess emerging from ocean waves, flowing golden hair, seashells and foam, Pre-Raphaelite painting style, ethereal glow, mythological beauty, Botticelli inspired but cinematic - Model: Wan 2.2 Spicy I2V
- Price: from $0.03/s
- Tips: Works beautifully with classical art reference images.
10. Cyberpunk Character
- Prompt:
Cyberpunk character in revealing neon-lit outfit, holographic tattoos glowing on skin, futuristic nightclub setting, pulsing LED lights, confident walking motion, sci-fi fashion, Blade Runner 2049 aesthetic - Model: Wan 2.2 Spicy I2V LoRA
- Price: from $0.03/s
- Tips: Use LoRA for consistent character appearance across scenes.
11. Luxury Watch Showcase
- Prompt:
Macro close-up of a luxury watch rotating on a black velvet surface, golden light reflections dancing on polished metal, mechanical movement visible through transparent caseback, premium product photography, smooth 360-degree rotation - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Slow, controlled rotation works best. Add "studio lighting, product photography" for commercial quality.
12. Food Commercial — Coffee Pour
- Prompt:
Slow motion pour of steaming espresso into a white ceramic cup, rich crema forming on surface, steam rising in soft morning light, rustic wooden table, out-of-focus bakery background, food photography, warm color palette, 120fps slow motion feel - Model: Wan 2.6 T2V
- Duration: 5s
- Resolution: 1920×1080
- Tips: Short duration with high detail works better for food content.
13. Sneaker Product Launch
- Prompt:
Dynamic sneaker reveal, shoe floating and rotating in mid-air against gradient background, particles and light streaks surrounding the product, dramatic lighting transitions, Nike commercial style, clean product visualization - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Use multi-shot for different angles of the same product.
14. Perfume Ad — Ethereal
- Prompt:
Glass perfume bottle on a reflective surface, golden liquid catching light, slow motion flower petals falling around it, soft bokeh background, luxury brand commercial aesthetic, Chanel No.5 inspired visual, elegant and sophisticated - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Add "glass reflections, caustics, luxury" for premium feel.
15. Tech Product Unboxing
- Prompt:
Cinematic unboxing of a sleek smartphone, hands slowly lifting the lid of a minimalist white box, device rising with a subtle glow, holographic interface appearing above the screen, Apple keynote style presentation, clean and modern - Model: Wan 2.6 I2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Use I2V with an actual product image for brand accuracy.
16. Anime Sakura Scene
- Prompt:
Anime style, a young girl standing under a blooming cherry blossom tree, pink petals falling in slow motion, gentle breeze moving her hair and school uniform, soft watercolor background, Studio Ghibli aesthetic, warm spring lighting - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1280×720
- Tips: Add "anime, cel-shaded, 2D animation" for consistent style.
17. Pixel Art Adventure
- Prompt:
Retro pixel art game scene, 16-bit hero character walking through a mystical forest, glowing mushrooms, floating fireflies, parallax scrolling background layers, SNES-era RPG aesthetic, vibrant pixel colors - Model: Wan 2.5 T2V
- Duration: 5s
- Resolution: 1280×720
- Tips: Lower resolution enhances the retro pixel art feel.
18. 3D Cartoon Character
- Prompt:
Pixar-style 3D animated character, a small robot with big expressive eyes discovering a flower in a post-apocalyptic landscape, WALL-E inspired, smooth character animation, subsurface scattering on plastic materials, Pixar render quality - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Wan 2.6 handles 3D animation styles remarkably well.
19. Comic Book Action
- Prompt:
Marvel comic book style action sequence, superhero landing pose with impact shockwave, bold ink outlines, halftone dot shading, dynamic camera angle from below, speech bubble appearing, vibrant primary colors, Jack Kirby inspired - Model: Wan 2.6 T2V
- Duration: 5s
- Resolution: 1280×720
- Tips: Short bursts of action work best for comic book style.
20. Watercolor Storybook
- Prompt:
Watercolor illustration coming to life, a fox and rabbit having a tea party in an enchanted forest clearing, soft paint bleeding effects, storybook page turning animation, children's book illustration style, gentle pastel colors, whimsical - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1280×720
- Tips: "Watercolor, illustration, painted" keywords help maintain the artistic style.
21. Celebrity Impression
- Prompt:
Professional business presentation, confident speaker at a podium, formal attire, corporate event, smooth gestures and natural speech patterns - Model: Wan 2.2 Character Swap (animate-mix)
- Tips: Upload a source video of the presentation and a target face image. Works best with front-facing, well-lit source faces.
22. Historical Figure Revival
- Prompt:
Historical documentary narration, period-appropriate setting, speaking directly to camera, educational content - Model: Wan 2.2 Character Swap (animate-mix)
- Tips: Best results with clear, high-resolution reference photos. Maintain consistent lighting between source and target.
23. Music Video Face Swap
- Prompt:
Music performance, singer on stage, dynamic lighting, emotional expression, concert atmosphere - Model: Wan 2.2 Character Swap (animate-mix)
- Tips: Works well with music videos that have stable face tracking. Avoid extreme head rotations.
24. Dance Choreography Transfer
- Prompt:
Professional dance routine, hip-hop choreography, studio setting, full body visible, clean background - Model: Wan 2.2 Image-to-Animation (animate-move)
- Tips: Upload a dance video as motion source and a character image. Full-body shots with clean backgrounds work best.
25. Cartoon Character Animation
- Prompt:
Animated character performing complex movements, cartoon style, expressive body language - Model: Wan 2.2 Image-to-Animation (animate-move)
- Tips: Use a real person's motion video to animate any 2D or 3D character image.
26. Product Mascot Animation
- Prompt:
Brand mascot performing fun actions, waving, jumping, dancing, engaging personality - Model: Wan 2.2 Image-to-Animation (animate-move)
- Tips: Perfect for bringing static brand mascots to life with natural movement.
27. Time-Lapse City
- Prompt:
Hyper-lapse of a modern city skyline from day to night, clouds racing across the sky, lights gradually turning on in skyscrapers, traffic becoming streams of light, star trails appearing, 24-hour cycle compressed into seconds, urban time-lapse - Model: Wan 2.6 T2V
- Duration: 15s
- Resolution: 1920×1080
- Tips: 15s duration is ideal for day-to-night transitions.
28. Nature Macro — Butterfly Emergence
- Prompt:
Extreme macro shot of a monarch butterfly emerging from chrysalis, iridescent wing scales unfolding in slow motion, morning dew drops on wings, shallow depth of field, nature documentary quality, BBC Earth style - Model: Wan 2.6 T2V
- Duration: 10s
- Resolution: 1920×1080
- Tips: Macro subjects with slow, deliberate motion produce stunning results.
29. Sci-Fi Space Station
- Prompt:
Exterior shot of a massive space station orbiting Earth, ISS-inspired design but futuristic, solar panels rotating to track the sun, a shuttle approaching for docking, Earth's blue glow illuminating the structure, Interstellar cinematography - Model: Wan 2.6 T2V
- Duration: 15s
- Resolution: 1920×1080
- Tips: Add audio for immersive space ambience. Multi-shot for interior/exterior cuts.
30. Abstract Art in Motion
- Prompt:
Abstract fluid art in motion, vibrant acrylic paint pouring and mixing, golden veins forming between color pools, Kandinsky meets Jackson Pollock, mesmerizing organic patterns, satisfying ASMR-like visual, overhead camera view - Model: Wan 2.5 T2V
- Duration: 10s
- Resolution: 1280×720
- Tips: Abstract content is very forgiving and consistently produces beautiful results.
31. Wan 2.6 R2V — Character Reference
- Prompt:
Upload a reference video of a person (captures look + voice), then generate them in a new scene: walking through a futuristic city, interacting with holographic displays, maintaining exact appearance and voice characteristics - Model: Wan 2.6 R2V (Reference-to-Video)
- Duration: 10s
- Resolution: 1920×1080
- Tips: Wan 2.6 R2V maintains character identity (appearance + voice) from a reference video. Provide 3-5 second reference clip for best results.
32. Multi-Shot Narrative
- Prompt:
Shot 1: Wide establishing shot of a cozy café on a rainy Parisian street. Shot 2: Close-up of hands wrapping around a steaming coffee cup. Shot 3: Looking out the window as rain streams down the glass. Shot 4: Slow zoom out revealing the full café scene with warm interior lighting. - Model: Wan 2.6 T2V (Multi-Shot)
- Duration: 15s
- Resolution: 1920×1080
- Tips: Use the multi-shot feature by separating scenes with "Shot N:" prefix. Wan 2.6 will create smooth transitions between shots.
| Component | Wan 2.6 (14B) | Wan 2.1 (14B) | Wan 2.1 (1.3B) |
|---|---|---|---|
| GPU VRAM | 24GB+ | 24GB+ | 8.19GB |
| Recommended GPU | RTX 4090 / A100 | RTX 4090 / A100 | RTX 3060 12GB |
| System RAM | 32GB+ | 32GB+ | 16GB |
| Disk Space | ~56GB | ~28GB | ~5GB |
| OS | Linux / Windows | Linux / Windows | Linux / Windows |
| CUDA | 12.1+ | 11.8+ | 11.8+ |
# ======== Step 1: Install ComfyUI ========
git clone https://github.com/comfyanonymous/ComfyUI.git
cd ComfyUI
# Create virtual environment (Recommended)
python -m venv venv
source venv/bin/activate # Linux/Mac
# venv\Scripts\activate # Windows
# Install PyTorch (CUDA 12.1)
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
# Install ComfyUI dependencies
pip install -r requirements.txt
# ======== Step 2: Install Wan Nodes ========
cd custom_nodes
git clone https://github.com/Wan-Video/ComfyUI-Wan.git
cd ComfyUI-Wan
pip install -r requirements.txt
cd ../..
# ======== Step 3: Download Models ========
# Option A: Full 14B model (Recommended, but requires 24GB+ VRAM)
huggingface-cli download Wan-AI/Wan2.6-T2V-14B \
--local-dir models/wan2.6-14b/
# Option B: Lightweight 1.3B model (only needs 8.19GB VRAM)
huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B \
--local-dir models/wan2.1-1.3b/
# ======== Step 4: Launch ComfyUI ========
python main.py --listen 0.0.0.0 --port 8188
# Open in browser http://localhost:8188
# Import Wan workflow nodes to start using"""
VRAM optimization tips to help run Wan models on consumer GPUs
"""
import torch
from diffusers import WanPipeline
# Tip 1: Use half-precision floating point (save approximately 50% VRAM)
pipe = WanPipeline.from_pretrained(
"Wan-AI/Wan2.1-T2V-14B",
torch_dtype=torch.float16 # Use FP16 instead of FP32
)
# Tip 2: Enable attention slicing (significantly reduce peak VRAM usage)
pipe.enable_attention_slicing(slice_size="auto")
# Tip 3: Enable VAE slicing (progressively decode video frames)
pipe.enable_vae_slicing()
# Tip 4: Enable CPU offload (automatically move inactive modules to CPU)
pipe.enable_model_cpu_offload()
# Tip 5: Use xformers for acceleration (requires additional installation of xformers)
# pip install xformers
pipe.enable_xformers_memory_efficient_attention()
# Tip 6: Reduce resolution and frame count
video = pipe(
prompt="Your prompt here",
num_frames=49, # Reduce frame count (49 frames ≈ 3 seconds)
height=480, # Reduce resolution
width=720,
guidance_scale=7.5
).frames[0]
# Tip 7: Use 1.3B model instead of 14B
# 1.3B model only needs 8.19GB VRAM, quality remains excellent
pipe_light = WanPipeline.from_pretrained(
"Wan-AI/Wan2.1-T2V-1.3B",
torch_dtype=torch.float16
).to("cuda")| Category | Effective Keywords |
|---|---|
| Quality | 8K, cinematic, film grain, high detail, professional, masterpiece |
| Camera | tracking shot, dolly zoom, aerial shot, close-up, macro, wide angle |
| Lighting | golden hour, volumetric lighting, rim lighting, neon glow, soft diffused |
| Motion | slow motion, time-lapse, smooth pan, gentle movement, dynamic action |
| Style | photorealistic, anime, watercolor, oil painting, cyberpunk, noir |
| Mood | dramatic, ethereal, cozy, epic, melancholic, whimsical |
# Recommended universal negative prompts
blurry, low quality, distorted, deformed, ugly, bad anatomy,
watermark, text overlay, logo, pixelated, noise, artifacts,
oversaturated, underexposed, static image, no movement,
extra fingers, missing limbs, unnatural proportions
# Multi-shot narrative template
Shot 1: [Wide/Establishing] - Set the scene with environment and context
Shot 2: [Medium] - Introduce the subject or character
Shot 3: [Close-up] - Emotional detail or key action
Shot 4: [Wide/Pull-back] - Resolution or reveal
# Example: Coffee commercial multi-shot
Shot 1: Aerial view of a Colombian coffee plantation at sunrise, morning mist.
Shot 2: Farmer's weathered hands picking ripe coffee cherries.
Shot 3: Extreme close-up of coffee beans being roasted, oils glistening.
Shot 4: Steam rising from a freshly poured cup in a modern café.
Wan 2.6 supports native audio synchronization. Tips for best results:
- Environmental sounds are generated automatically based on visual content
- Add audio-specific cues in your prompt:
"birds chirping","rain on window","crowd cheering" - Audio quality improves with longer duration (10-15s)
- Use
audio: truein API calls to enable audio generation
| Resolution | VRAM Usage | Generation Time | Best For |
|---|---|---|---|
| 1920×1080 | ~24GB | ~3-5 min | Final production |
| 1280×720 | ~16GB | ~1-3 min | Draft & testing |
| 854×480 | ~10GB | ~30-60s | Rapid prototyping |
| 640×360 | ~8GB | ~15-30s | Quick preview |
| Provider | Wan 2.6 T2V | Wan 2.5 T2V | Wan Spicy 🌶️ | Key Features |
|---|---|---|---|---|
| Atlas Cloud ⭐ | from $0.07/s (70% off) | Lower | from $0.03/s | ✅ All models, NSFW, LoRA, V2V, Multi-shot |
| Alibaba Cloud | $0.10+ | $0.08+ | N/A | |
| Replicate | $0.15+ | $0.12+ | Limited | Basic API only |
| Self-hosted | GPU cost/hr | GPU cost/hr | GPU cost/hr | ✅ Full control, ❌ maintenance burden |
Prices shown are starting prices per second of generated video. Actual cost depends on resolution, duration, and model selected.
| Use Case | Model | Videos/Month | Atlas Cloud Cost | Self-Hosted (A100) |
|---|---|---|---|---|
| Social Media Creator | Wan 2.6 T2V | 500 | $35/mo | ~$500/mo |
| NSFW Art Creator | Wan Spicy | 1,000 | $30/mo | ~$500/mo |
| Commercial Production | Wan 2.6 T2V | 100 | $7/mo | ~$500/mo |
| Hobbyist | Wan 2.5 T2V | 50 | <$5/mo | ~$500/mo |
💡 Pro tip: Sign up for Atlas Cloud and get 25% bonus on your first recharge (up to $100 bonus). That's up to 1,785 Wan Spicy videos or 1,785+ Wan 2.6 videos for $100+$25 bonus!
| Model | fal.ai Price | Atlas Cloud Price | You Save |
|---|---|---|---|
| Kling | $0.224/sec (5s = $1.12) | from $0.204/s | 82% cheaper |
| Seedance | ~$0.26/video | from $0.044/s | 15% cheaper |
| Wan 2.5 | $0.05/sec (5s = $0.25) | from $0.05/s | 80% cheaper |
| Wan 2.6 | Similar pricing | from $0.07/s | Competitive |
| Veo 3 | $0.40/sec (8s = $3.20) | TBD | Coming soon |
| Vidu Q3-Pro | — | from $0.06/s | Atlas exclusive |
| Vidu Q3-Turbo | — | from $0.034/s | Atlas exclusive |
Prices shown are starting prices per second of generated video. Actual cost depends on resolution, duration, and model selected.
💡 Atlas Cloud offers the lowest prices across all major video models. Switch from fal.ai and save up to 82% on your video generation costs.
| Resource | Link | Notes |
|---|---|---|
| Hugging Face — Wan-AI | huggingface.co/Wan-AI | All model weights |
| ModelScope — Wan | modelscope.cn/Wan-Video | Alternative download (faster in Asia) |
| Wan 2.6 T2V 14B | HF Link | Latest text-to-video |
| Wan 2.6 I2V 14B | HF Link | Latest image-to-video |
| Wan 2.1 T2V 1.3B | HF Link | Lightweight, consumer GPU |
| Paper | Link | Description |
|---|---|---|
| Wan 2.1 Technical Report | arXiv | Architecture & training details |
| VBench Leaderboard | vbench.org | Benchmark rankings |
| Repository | Description |
|---|---|
| Wan-Video/Wan2.6 | Wan 2.6 official code |
| Wan-Video/Wan2.1 | Wan 2.1 official code |
| Extension | Description | Stars |
|---|---|---|
| ComfyUI-Wan | Official Wan nodes for ComfyUI | ⭐ |
| ComfyUI-WanVideoWrapper | Community wrapper with extra features | ⭐ |
| Resource | Description |
|---|---|
| Civitai — Wan LoRAs | Community-trained LoRA models |
| Wan LoRA Training Guide | Official fine-tuning documentation |
| Platform | Link |
|---|---|
| Reddit — r/WanVideo | reddit.com/r/WanVideo |
| Discord — Wan Community | Community Discord server |
| Twitter/X — #WanVideo | Search #WanVideo |
| Title | Type | Link |
|---|---|---|
| Wan 2.6 Complete Guide | Video | YouTube |
| ComfyUI + Wan Workflow | Article | Blog |
| Wan LoRA Training Tutorial | Video | YouTube |
| Wan vs Kling vs Sora Comparison | Article | Blog |
What is the Wan AI video model?
Wan is Alibaba's video generation model series. Wan 2.1 and 2.2 are open-source (Apache 2.0) with weights available on Hugging Face for self-deployment, while Wan 2.5 and 2.6 are closed-source commercial API models. It uses a diffusion transformer architecture trained on 1.5 billion videos and 10 billion images. The Wan 2.1 model achieved the #1 position on the VBench leaderboard with a score of 86.22%, surpassing Sora (84.28%) and Luma (83.61%). The latest version is Wan 2.6, which supports multi-shot 1080p video generation up to 15 seconds with native audio synchronization, available via API.
How to run Wan locally?
You can run Wan locally using:
- ComfyUI — Install the Wan ComfyUI nodes and download model weights
- Hugging Face Diffusers — Use the
WanPipelineclass in Python - Docker — Deploy with our provided Dockerfile
The 1.3B model requires only 8.19GB VRAM, making it accessible on consumer GPUs like the RTX 3060. The 14B model needs 24GB+ VRAM (RTX 4090 or A100).
See the Local Deployment Guide for step-by-step instructions.
Wan vs Seedance vs Kling — Which is better?
| Feature | Wan 2.6 | Seedance | Kling |
|---|---|---|---|
| Open Source | ❌ (2.1/2.2 are open) | ❌ | ❌ |
| Max Duration | 15s | 10s | 10s |
| Audio | ✅ | ❌ | ❌ |
| Multi-Shot | ✅ | ❌ | ❌ |
| NSFW | ✅ (Spicy) | ❌ | ❌ |
| Local Deploy | ❌ (2.1/2.2 can) | ❌ | ❌ |
| API Price | From $0.03 | $0.20+ | $0.30+ |
Wan wins on: pricing, NSFW support, and features. Wan 2.1/2.2 offer open-source flexibility with local deployment, while Wan 2.5/2.6 provide cutting-edge quality via API. Seedance and Kling may have slightly different motion quality for specific use cases, but Wan's rapid iteration makes it the best long-term choice.
What is the best GPU for Wan 2.6?
| GPU | VRAM | Can Run | Performance |
|---|---|---|---|
| RTX 3060 12GB | 12GB | ✅ (1.3B only) | Basic |
| RTX 3090 | 24GB | ✅ (14B with optimization) | Good |
| RTX 4090 | 24GB | ✅ (14B full speed) | Excellent |
| A100 40GB | 40GB | ✅ (all models, batch) | Professional |
| A100 80GB | 80GB | ✅ (all models, high batch) | Enterprise |
Budget recommendation: RTX 4090 for local deployment, or use Atlas Cloud API from $0.03/s if you don't want to invest in hardware.
Is there an uncensored Wan model?
Yes! The Wan 2.2 Spicy 🌶️ is an uncensored/NSFW variant that allows unrestricted creative content generation. It's available via:
- Atlas Cloud API at just from $0.03/s — the cheapest uncensored video generation API available
- Models:
alibaba/wan-2.2-spicy/image-to-videoandalibaba/wan-2.2-spicy/image-to-video-lora(with LoRA support)
How much does Wan video generation cost via API?
Atlas Cloud offers the most affordable Wan API pricing:
- Wan 2.2 Spicy (NSFW): from $0.03/s
- Wan 2.6 T2V (1080p, 15s): from $0.07/s (70% discount)
- Wan 2.5 variants: Even lower pricing
- 25% bonus on first recharge (up to $100)
Compare this to Sora ($0.50+/video) or Runway ($0.50+/video). Get started →
What is Wan 2.6 R2V?
Wan 2.6 R2V (Reference-to-Video) is a unique feature where you upload a character reference video that captures both the person's appearance and voice. The model then generates new video scenes featuring that same character with consistent identity. This is perfect for creating virtual avatars, personalized content, and character-driven narratives.
When is Wan 2.7 coming out?
Wan 2.7 is expected around March 2026, following the consistent release cadence of the Wan series (2.1 in Q4 2024, 2.2 in Q1 2025, 2.5 in Q2 2025, 2.6 in Q3 2025). Based on each version's progression, Wan 2.7 is likely to bring longer video duration (20s+), possible 4K resolution support, better motion control, improved character consistency, and enhanced audio synchronization. See the Wan 2.7 section for a detailed breakdown.
Wan 2.7 will be available on Atlas Cloud upon release. Sign up for Atlas Cloud to be notified when Wan 2.7 launches.
From $0.03/s with Wan Spicy to $0.07/s for Wan 2.6 — the most affordable AI video generation API.
| ✅ Uncensored | Wan 2.2 Spicy for unrestricted creativity |
| ✅ Open Source (2.1/2.2) | Run locally or use via API |
| ✅ Wan 2.7 Coming March 2026 | Longer duration, higher resolution, better control |
| ✅ 25% Bonus | On first top-up (up to $100 bonus) |
| ✅ All Models | T2V, I2V, V2V, Image-Edit, Character Swap, Animation |
No GPU required. No setup. Just prompts and videos.
If you find this resource helpful, please give it a star! It helps others discover the Wan ecosystem.
Contributions are welcome! Please read the contribution guidelines before submitting a PR.
- Fork this repository
- Create your feature branch:
git checkout -b add-new-resource - Add your resource in the appropriate section
- Commit your changes:
git commit -m 'Add new resource' - Push to the branch:
git push origin add-new-resource - Submit a pull request
- New Wan model tools, extensions, and integrations
- Community tutorials and guides
- LoRA models and fine-tuning resources
- Prompt examples with sample outputs
- Benchmark results and comparisons
- Translations to other languages
This work is licensed under the MIT License.
🌊 Wan — Pushing the Boundaries of AI Video Generation 🌊
Made with ❤️ by the open-source community