A memory-efficient enhancement of the WanVaceToVideo node that supports multiple control inputs (pose, depth, edge) in a single node, solving the VAE triple-loading memory crisis.
- Single VAE Instance: Uses one VAE for all control processing, saving ~20-30GB VRAM
- Multi-Control Support: Process pose, depth, and edge controls simultaneously
- Granular Control: Independent strength controls for video and mask per control type
- Full Backward Compatibility: Works seamlessly with existing WanVaceToVideo workflows
- Multiple Combination Modes: Choose how controls are combined (multiply, add, average, max)
-
Navigate to your ComfyUI custom nodes directory:
cd ComfyUI/custom_nodes/ -
Clone this repository:
git clone https://github.com/yourusername/WanVaceToVideoMC.git
-
Restart ComfyUI
The node appears as "WAN VACE to Video (Multi-Control)" in the node menu under conditioning/video_models.
New inputs for multi-control:
control_video_pose/control_masks_pose- Pose control inputscontrol_video_depth/control_masks_depth- Depth control inputscontrol_video_edge/control_masks_edge- Edge control inputsstrength_video_*- Control strength for video influence (0.0-10.0)strength_mask_*- Control strength for mask influence (0.0-10.0)multi_control_mode- How to combine controls: multiply, add, average, max
For backward compatibility, you can still use the original inputs:
control_video- Single control video inputcontrol_masks- Single control maskreference_image- Reference image for style
Note: You cannot use both legacy and multi-control inputs simultaneously.
Traditional approach (3 separate WanVaceToVideo nodes):
- VAE memory usage: ~30-45GB (3 × 10-15GB)
WanVaceToVideoMultiControl:
- VAE memory usage: ~10-15GB (single instance)
- Savings: ~20-30GB VRAM
- multiply: Most restrictive - all controls must agree
- add: Most permissive - any control can influence
- average: Balanced combination (default)
- max: Strongest signal wins
- ComfyUI (latest version)
- PyTorch >= 2.0.0
- CUDA-capable GPU (tested on 3x A6000 setup)
Developed at Zerospace for production workflows with WAN VACE 14B video generation.
AGPL-3.0 License - See LICENSE file for details.