Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -245,7 +245,8 @@
"group": "Video Generation",
"icon": "video",
"pages": [
"video-generation"
"video-generation",
"examples/video-generation/ltx-video-2-3"
]
},
{
Expand Down
122 changes: 122 additions & 0 deletions examples/video-generation/ltx-video-2-3.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,122 @@
---
title: Generating Video with LTX-2.3 and ComfyUI
slug: ltx-video-2-3-vast
createdAt: Thu Mar 13 2026 00:00:00 GMT+0000 (Coordinated Universal Time)
updatedAt: Mon Mar 31 2026 00:00:00 GMT+0000 (Coordinated Universal Time)
---

<script type="application/ld+json" dangerouslySetInnerHTML={{
__html: JSON.stringify({
"@context": "https://schema.org",
"@type": "HowTo",
"name": "How to Generate Video and Audio with LTX-2.3 on Vast.ai",
"description": "Deploy Lightricks LTX-2.3, a 22B parameter joint audio-video model, on Vast.ai using the pre-built ComfyUI template. Generate text-to-video, image-to-video, audio-conditioned video, and keyframe interpolation through a visual interface.",
"step": [
{
"@type": "HowToStep",
"name": "Deploy the LTX-2.3 Template",
"text": "Go to the LTX-2.3 model page on Vast.ai and click Deploy Now. Select a GPU instance and click Rent. The template automatically downloads all model weights during provisioning."
},
{
"@type": "HowToStep",
"name": "Open ComfyUI",
"text": "Once the instance finishes provisioning, click the Open button to launch ComfyUI in your browser. Four pre-configured LTX-2.3 workflows are available in the sidebar."
},
{
"@type": "HowToStep",
"name": "Run a Workflow",
"text": "Select a workflow from the sidebar: text-to-video, image-to-video, image+audio-to-video, or first-last-frame interpolation. Enter a prompt, optionally upload input media, and click Run to generate video."
}
]
})
}} />

[LTX-2.3](https://huggingface.co/Lightricks/LTX-2.3) is the first open-source model that generates synchronized video and audio together in a single pass. Built by Lightricks, this 22B-parameter Diffusion Transformer doesn't just create video — it produces matching audio at the same time, whether that's dialogue with lip-synced characters, ambient soundscapes, or music-driven visuals.

What makes it stand out:

- **Joint audio-video generation** — no need to generate video and audio separately and try to sync them
- **Multiple generation modes** — text-to-video, image-to-video, audio-conditioned video, and keyframe interpolation
- **Fast inference** — the distilled pipeline generates in just 8 denoising steps
- **Open source** — full model weights available on HuggingFace, trainable with LoRA in under an hour

Running LTX-2.3 requires a powerful GPU. Vast.ai gives you on-demand access to the hardware you need, and the pre-built ComfyUI template means you can go from zero to generating videos in minutes — no CLI, SSH, or manual setup required.

## Find and rent your GPU

1. **Setup your Vast account and add credit:** Review the [quickstart guide](/documentation/get-started/quickstart) if you do not have an account with credits loaded.
2. **Deploy the LTX-2.3 template:** Go to the [LTX-2.3 model page](https://vast.ai/model/ltx-2-3) and click **Deploy Now**. This takes you to the Vast console with the LTX-2.3 ComfyUI template pre-selected.
3. **Select a GPU:** Choose an instance from the list and click **Rent**.

<Frame caption="Vast.ai console showing available RTX 5090 instances with the LTX-2.3 ComfyUI template">
![Vast.ai console showing available RTX 5090 instances with the LTX-2.3 ComfyUI template](/images/ltx-2-3/02-select-gpu.jpg)
</Frame>

## Wait for provisioning

After renting, the instance automatically downloads all required model weights. You'll see a loading screen while models download. On a fast connection this takes just a few minutes. Once complete, the instance status shows a green **Running** indicator.

<Frame caption="Running instance with Open button ready">
![Running instance showing verified status](/images/ltx-2-3/03-instance-running.jpg)
</Frame>

## Open ComfyUI

Click the **Open** button on your instance to launch the Instance Portal. Click **Launch Application** under **ComfyUI** to open the visual workflow editor.

<Frame caption="Instance Portal with ComfyUI and other available services">
![Instance Portal showing ComfyUI, API Wrapper, Jupyter, and other services](/images/ltx-2-3/04-instance-portal.jpg)
</Frame>

In the left sidebar under **Workflows > Browse**, you'll see four pre-loaded workflows:

| Workflow | Description |
|----------|-------------|
| `video_ltx2_3_t2v` | Text-to-Video |
| `video_ltx2_3_i2v` | Image-to-Video |
| `video_ltx2_3_ia2v` | Image + Audio-to-Video |
| `video_ltx2_3_flf2v` | First & Last Frame Interpolation |

## Text-to-Video

Select **video_ltx2_3_t2v** from the sidebar. Enter a descriptive prompt in the **Video Generation** node — describe camera angles, lighting, and motion cinematically. Adjust width, height, and frame count if desired (defaults: 1280x720, 121 frames, 25 fps). Click **Run**.

<Frame caption="Text-to-Video workflow generating a video of high-tech machinery">
![Text-to-Video workflow in ComfyUI](/images/ltx-2-3/05-workflow-t2v.jpg)
</Frame>

The workflow includes automatic prompt enhancement powered by Gemma 3, which expands short prompts into detailed cinematic descriptions.

## Image-to-Video

Select **video_ltx2_3_i2v**. Upload a reference image in the **Load Image** node (a sample Egyptian queen image is included). Enter a prompt describing how the image should come to life. Click **Run**. The model uses your image as the first frame and generates consistent motion.

<Frame caption="Image-to-Video workflow animating an Egyptian queen portrait">
![Image-to-Video workflow in ComfyUI](/images/ltx-2-3/06-workflow-i2v.jpg)
</Frame>

## Image + Audio-to-Video

Select **video_ltx2_3_ia2v**. Upload a reference image and an audio file (a sample MP3 is included). Enter a prompt describing the scene. Click **Run**. The model generates video synchronized to the audio — lip movements match dialogue, and scene energy follows the audio's rhythm.

<Frame caption="Image + Audio-to-Video workflow syncing a character to an audio track">
![Image+Audio-to-Video workflow in ComfyUI](/images/ltx-2-3/07-workflow-ia2v.jpg)
</Frame>

## First & Last Frame Interpolation

Select **video_ltx2_3_flf2v**. Load two images — a first frame and a last frame (sample car images are included). Enter a prompt describing the transition. Click **Run**. The model generates a smooth video interpolation between your two keyframes.

<Frame caption="First & Last Frame interpolation generating a smooth camera transition between two car shots">
![First-Last-Frame interpolation workflow in ComfyUI](/images/ltx-2-3/08-workflow-flf2v.jpg)
</Frame>

## Cleanup

When finished, go to the [Vast.ai console](https://cloud.vast.ai/instances/) and click **Delete** on your instance to stop charges.

## Resources

- [LTX-2.3 on HuggingFace](https://huggingface.co/Lightricks/LTX-2.3)
- [LTX-2 GitHub Repository](https://github.com/Lightricks/LTX-2)
- [Vast.ai Model Page](https://vast.ai/model/ltx-2-3)
Binary file added images/ltx-2-3/01-instance-details.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/02-select-gpu.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/03-instance-running.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/04-instance-portal.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/05-workflow-t2v.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/06-workflow-i2v.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/07-workflow-ia2v.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/ltx-2-3/08-workflow-flf2v.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.