Add support for torch.compile dynamic shapes #30560

warner-benjamin · 2024-04-29T23:57:56Z

This PR adds support for compiling models with dynamic shapes dynamic=True to almost all models with SDPAttention implementations which currently do not support dynamic shapes. #30442 added support for Llama, Gemma, OLMo, & Cohere.

The only model not modified is DBRX, which needs the changes from both #30070 and #30442 to add support for SDPA's Flash Attention kernel and support for dynamic shapes, as it I believe it suffers from the same training memory issues detailed in #30010.

As mentioned in #30442, moving the is_causal dispatch logic from inline to an if statement is required to support both fullgraph=True and dynamic=True.

I kept the qlen>1 comments but could remove them if we want to match Llama, which doesn't have it.

cc @ArthurZucker and @fxmarty

fxmarty

Thank you for working on this! Just left a minor comment but it looks good!

(note that afaik Transformers does not enforce line violation

transformers/pyproject.toml

Line 6 in 0ae789e

ignore = ["C901", "E501", "E741", "F402", "F823" ]

)

Could you add a test for dynamic=True?

src/transformers/models/bart/modeling_bart.py

ArthurZucker

LGTM @fxmarty feel free to merge when CIs are green if it's fine with you!

HuggingFaceDocBuilderDev · 2024-05-07T09:45:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

warner-benjamin · 2024-05-07T17:10:04Z

@fxmarty @ArthurZucker I didn't touch Llava. The current "Tensor-likes are not close" error shouldn't have anything to do with this PR. It should be ready to go from my end.

ArthurZucker · 2024-05-10T08:55:25Z

no worries, rebasing on main should most probably fix this!

…namic

ArthurZucker

Merging as this is quite important. Let's keep an eye on the slow tests that will be triggered!

add torch.compile dynamic support

971f7df

fxmarty reviewed Apr 30, 2024

View reviewed changes

src/transformers/models/bart/modeling_bart.py Outdated Show resolved Hide resolved

warner-benjamin added 2 commits May 3, 2024 14:20

Add SDPA dynamic shapes compile test & improve SDPA comment

7b61bea

comment consistency

00363c3

ArthurZucker approved these changes May 7, 2024

View reviewed changes

Merge https://github.com/warner-benjamin/transformers into compile_dy…

53c8a7b

…namic

ArthurZucker approved these changes May 20, 2024

View reviewed changes

ArthurZucker merged commit cd6bd0a into huggingface:main May 20, 2024

warner-benjamin deleted the compile_dynamic branch May 20, 2024 16:03

warner-benjamin mentioned this pull request May 20, 2024

Finish adding support for torch.compile dynamic shapes #30919

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for torch.compile dynamic shapes #30560

Add support for torch.compile dynamic shapes #30560

Uh oh!

warner-benjamin commented Apr 29, 2024

Uh oh!

fxmarty left a comment

Uh oh!

Uh oh!

ArthurZucker left a comment

Uh oh!

HuggingFaceDocBuilderDev commented May 7, 2024

Uh oh!

warner-benjamin commented May 7, 2024 •

edited

Loading

Uh oh!

ArthurZucker commented May 10, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add support for torch.compile dynamic shapes #30560

Add support for torch.compile dynamic shapes #30560

Uh oh!

Conversation

warner-benjamin commented Apr 29, 2024

Uh oh!

fxmarty left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 7, 2024

Uh oh!

warner-benjamin commented May 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker commented May 10, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

warner-benjamin commented May 7, 2024 •

edited

Loading