Add Dream 7B Diffusion Large Language Model Pipeline #12091

dg845 · 2025-08-07T09:21:19Z

What does this PR do?

This PR implements a pipeline for the Dream 7B diffusion large language model (blog post, weights and code, repo). Dream is a masked (discrete) diffusion model for text which claims to perform comparably to similarly sized SOTA autoregressive LLMs such as Qwen 2.5 7B on NLP tasks and have superior performance on planning tasks.

Fixes #12017.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu
@a-r-r-o-w
@ntoxeg

…for inference

dg845 · 2025-08-15T03:44:14Z

@yiyixuxu @a-r-r-o-w the Dream 7B model uses a transformers-style custom tokenizer, which I believe is based on GPT2Tokenizer but with different pre-tokenization rules. Since this tokenizer is not in transformers, should I open a PR there to add it? And if so, do you think the Dream transformer should also be added to transformers? The original transformer implementation is also transformers-compatible.

a-r-r-o-w · 2025-08-16T22:21:46Z

In case of certain custom implementations, we try to implement and keep the relevant files within diffusers. Some examples I could quickly find are:

Maybe in this case you could create a tokenizer_gpt.py file within the pipeline directory to use it? WDYT @yiyixuxu?

I don't think the model implementation can live in transformers if we're using it for diffusion sampling. For example, Cosmos 1.0 released with an autoregressive and diffusion version, but we have two different implementations and PRs for support it in both libraries. So, let's maintain it here :)

dg845 added 2 commits August 7, 2025 01:57

Initial implementation of Dream Transformer model

362f10a

make style and make quality

90ed142

dg845 mentioned this pull request Aug 7, 2025

Dream 7B #12017

Open

2 tasks

ntoxeg mentioned this pull request Aug 8, 2025

TensorRT Support DreamLM/Dream#45

Open

dg845 added 7 commits August 11, 2025 15:41

Initial implementation of Dream masked diffusion scheduler

af38299

make style and make quality

ee6f585

Improve Dream transformer forward method and add embed_tokens method …

0b52b1f

…for inference

Add comment in scheduler further explaining shifting model_outputs

4b7ef3b

Initial commit for Dream 7B LLM pipeline

b7e6991

make style and make quality

ed50d57

Merge branch 'main' into dream-7b-pipeline

94d3cf5

dg845 added 7 commits August 19, 2025 17:13

Add transformers-style Dream tokenizer from the original code

506354e

make style and make quality

733bde7

Merge branch 'main' into dream-7b-pipeline

a49fe39

Add DreamTokenizer to src/diffusers/__init__.py

c1b1a27

Force Dream RoPE freqs to be calculated in float32

89b868a

Fix shape errors in Dream transformer

ff03fc2

Add modeling tests for the Dream transformer

c83abf5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Dream 7B Diffusion Large Language Model Pipeline #12091

Add Dream 7B Diffusion Large Language Model Pipeline #12091

Uh oh!

dg845 commented Aug 7, 2025

Uh oh!

dg845 commented Aug 15, 2025

Uh oh!

a-r-r-o-w commented Aug 16, 2025

Uh oh!

Uh oh!

Add Dream 7B Diffusion Large Language Model Pipeline #12091

Are you sure you want to change the base?

Add Dream 7B Diffusion Large Language Model Pipeline #12091

Uh oh!

Conversation

dg845 commented Aug 7, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

dg845 commented Aug 15, 2025

Uh oh!

a-r-r-o-w commented Aug 16, 2025

Uh oh!

Uh oh!