Add LTX 2.0 Video Pipelines #12915

dg845 · 2026-01-06T06:28:57Z

What does this PR do?

This PR adds pipelines for the LTX 2.0 video generation model (code, weights). LTX 2.0 is an audio-video foundation model that generates videos with synced audio; it supports generation tasks such as text-to-video (T2V), text-image-to-video (TI2V), and more.

You can try out T2V generation as follows:

python scripts/ltx2_test_full_pipeline.py \
    --model_id Lightricks/LTX-2 \
    --revision refs/pr/3 \
    --cpu_offload

Note that LTX 2.0 video generation uses a lot of memory; it is necessary to use CPU offloading even for an A100 which has 80 GB VRAM (assuming no other memory optimizations other than bf16 inference are used).

Similarly, you can try out I2V generation with

python scripts/ltx2_test_full_pipeline_i2v.py \
    --model_id Lightricks/LTX-2 \
    --revision refs/pr/3 \
    --image_path https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/astronaut.jpg \
    --cpu_offload

Here is an I2V sample from the above:

ltx2_i2v_sample.mp4

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu
@sayakpaul
@ofirbb

…ting

…mplementation

LTX 2.0 Vocoder Implementation

LTX 2.0 Video VAE Implementation

sayakpaul · 2026-01-06T15:51:04Z

Cc: @matanby if you want to test this PR on your end. We will shortly be adding the upsampling pipeline as well.

bghira · 2026-01-06T20:28:34Z

no audio encode?

dg845 · 2026-01-06T21:23:00Z

@bghira, so I understand correctly, is the request for an analogue of diffusers.pipeline.ltx2.export_utils.encode_video that only encodes the audio? encode_video should be able to create videos with audio.

bghira · 2026-01-06T21:51:27Z

the audio autoencoder is missing encode() function which exists in the LTX-2 repo from Lightricks, and ComfyUI is having audio encoding as well

dg845 · 2026-01-06T22:02:54Z

@bghira thanks for the clarification! We will support the audio VAE encoder.

tests/models/transformers/test_models_transformer_ltx2.py

…into ltx-2-transformer

sayakpaul

Some small comments.

tests/models/autoencoders/test_models_autoencoder_kl_ltx2_audio.py

sayakpaul · 2026-01-07T05:27:01Z

src/diffusers/models/transformers/transformer_ltx2.py

+        num_rope_elems = num_pos_dims * 2
+
+        # 3. Create a 1D grid of frequencies for RoPE
+        freqs_dtype = torch.float64 if self.double_precision else torch.float32


(nit): we could keep the self.freqs_dtype inside the init to skip doing it multiple times.

sayakpaul · 2026-01-07T05:28:24Z

src/diffusers/models/transformers/transformer_ltx2.py

+        video_cross_attn_rotary_emb = self.cross_attn_rope(video_coords[:, 0:1, :], device=hidden_states.device)
+        audio_cross_attn_rotary_emb = self.cross_attn_audio_rope(
+            audio_coords[:, 0:1, :], device=audio_hidden_states.device
+        )


(nit): would be nice to have a comment about the small indexing going on there.

src/diffusers/models/autoencoders/autoencoder_kl_ltx2_audio.py

docs/source/en/api/models/autoencoderkl_audio_ltx_2.md

Co-authored-by: Sayak Paul <[email protected]>

* Initial implementation of LTX 2.0 latent upsampling pipeline * Add new LTX 2.0 spatial latent upsampler logic * Add test script for LTX 2.0 latent upsampling * Add option to enable VAE tiling in upsampling test script * Get latent upsampler working with video latents * Fix typo in BlurDownsample * Add latent upsample pipeline docstring and example * Remove deprecated pipeline VAE slicing/tiling methods * make style and make quality * When returning latents, return unpacked and denormalized latents for T2V and I2V * Add model_cpu_offload_seq for latent upsampling pipeline --------- Co-authored-by: Daniel Gu <[email protected]>

dg845 and others added 30 commits December 12, 2025 07:52

Initial LTX 2.0 transformer implementation

aa602ac

Add tests for LTX 2 transformer model

b3096c3

Get LTX 2 transformer tests working

980591d

Rename LTX 2 compile test class to have LTX2

e100b8f

Remove RoPE debug print statements

780fb61

Get LTX 2 transformer compile tests passing

5765759

Fix LTX 2 transformer shape errors

aeecc4d

Initial script to convert LTX 2 transformer to diffusers

a5f2d2d

Add more LTX 2 transformer audio arguments

d86f89d

Allow LTX 2 transformer to be loaded from local path for conversion

57a8b9c

Improve dummy inputs and add test for LTX 2 transformer consistency

a7bc052

Fix LTX 2 transformer bugs so consistency test passes

bda3ff1

Initial implementation of LTX 2.0 video VAE

269cf7b

Explicitly specify temporal and spatial VAE scale factors when conver…

baf23e2

…ting

Add initial LTX 2.0 video VAE tests

5b950d6

Add initial LTX 2.0 video VAE tests (part 2)

491aae0

Get diffusers implementation on par with official LTX 2.0 video VAE i…

a748975

…mplementation

Initial LTX 2.0 vocoder implementation

c6a11a5

Merge pull request #3 from huggingface/ltx-2-vocoder

8bfeb4a

LTX 2.0 Vocoder Implementation

Merge pull request #2 from huggingface/ltx-2-video-vae

b1cf6ff

LTX 2.0 Video VAE Implementation

Use RMSNorm implementation closer to original for LTX 2.0 video VAE

6c56954

start audio decoder.

b34ddb1

init registration.

f4c2435

up

e54cd6b

simplify and clean up

907896d

up

4904fd6

Initial LTX 2.0 text encoder implementation

0028955

Rough initial LTX 2.0 pipeline implementation

d0f9cda

up

5f0f2a0

up

58257eb

sayakpaul added 4 commits January 6, 2026 20:48

remove function map.

57ead0b

remove args.

c39f1b8

update docs.

bdcf23e

update doc entries.

61e0fb4

sayakpaul added 2 commits January 6, 2026 21:31

disable ltx2_consistency test

8c5ab1f

Merge branch 'main' into ltx-2-transformer

64b48c1

cjkindel mentioned this pull request Jan 6, 2026

LTX-2 Diffusers Support griptape-ai/griptape-nodes#3604

Draft

dg845 added 3 commits January 6, 2026 23:32

Simplify LTX 2 RoPE forward by removing coords is None logic

5e0cf2b

make style and make quality

d01a242

Support LTX 2.0 audio VAE encoder

79cf6d7

sayakpaul reviewed Jan 7, 2026

View reviewed changes

tests/models/transformers/test_models_transformer_ltx2.py Show resolved Hide resolved

sayakpaul and others added 3 commits January 7, 2026 09:43

Merge branch 'main' into ltx-2-transformer

cc28cf7

resolve conflicts

91ee2dd

Merge branch 'ltx-2-transformer' of github.com:huggingface/diffusers …

5269ee5

…into ltx-2-transformer

sayakpaul reviewed Jan 7, 2026

View reviewed changes

dg845 and others added 4 commits January 6, 2026 21:34

Apply suggestions from code review

a17f5cb

Co-authored-by: Sayak Paul <[email protected]>

Remove print statement in audio VAE

964f106

up

4dfe509

Merge branch 'main' into ltx-2-transformer

249ae1f

yiyixuxu mentioned this pull request Jan 7, 2026

Any plan on LTX-2? #12920

Closed

Fix bug when calculating audio RoPE coords

040c118

sayakpaul requested a review from yiyixuxu January 7, 2026 12:13

sayakpaul and others added 3 commits January 7, 2026 15:46

Fix latent upsampler filename in LTX 2 conversion script

5e50046

Add latent upsample pipeline to LTX 2 docs

2b85b93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LTX 2.0 Video Pipelines #12915

Add LTX 2.0 Video Pipelines #12915

dg845 commented Jan 6, 2026

Uh oh!

sayakpaul commented Jan 6, 2026

Uh oh!

bghira commented Jan 6, 2026

Uh oh!

dg845 commented Jan 6, 2026

Uh oh!

bghira commented Jan 6, 2026

Uh oh!

dg845 commented Jan 6, 2026

Uh oh!

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

sayakpaul Jan 7, 2026

Uh oh!

sayakpaul Jan 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Add LTX 2.0 Video Pipelines #12915

Are you sure you want to change the base?

Add LTX 2.0 Video Pipelines #12915

Conversation

dg845 commented Jan 6, 2026

What does this PR do?

Who can review?

Uh oh!

sayakpaul commented Jan 6, 2026

Uh oh!

bghira commented Jan 6, 2026

Uh oh!

dg845 commented Jan 6, 2026

Uh oh!

bghira commented Jan 6, 2026

Uh oh!

dg845 commented Jan 6, 2026

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants