Fix bucket sizes for AutoParallel 1D #1545

fmassa · 2025-08-08T09:15:44Z

This PR makes bucket sizes for all-gather and reduce-scatter to be of the same size for 1d FSDP.

fmassa · 2025-08-08T09:30:10Z

torchtitan/experiments/auto_parallel/parallelize_llama.py

-            (global_batch_size, job_config.training.seq_len),
-            device=torch.device("cuda"),
-        ),
+        return (


Sorry for the unrelated lint changes, my editor decided to annoy me here

wconstab · 2025-08-08T10:37:13Z

torchtitan/experiments/auto_parallel/parallelize_llama.py

    assert parallel_dims.pp_enabled is False, "PP not supported yet"

+    torch._inductor.config.bucket_all_gathers_fx_bucket_size_determinator = (
+        lambda bucket_idx: 500 / parallel_dims.tp


We may want to consolidate at some point, the other inductor configs that control which passes and pass modes live in titan today, are cli driven, and could change even which bucketing pass is used.

Oops just realized this is totchtitan..

Fix bucket sizes for AutoParallel 1D

4aa2645

fmassa requested a review from wconstab August 8, 2025 09:15

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 8, 2025

fmassa mentioned this pull request Aug 8, 2025

Add Activation Checkpointing Pass meta-pytorch/autoparallel#83

Merged

Generalize to 2d as well

b35683c

fmassa commented Aug 8, 2025

View reviewed changes

wconstab approved these changes Aug 8, 2025

View reviewed changes

fmassa merged commit 4712163 into autoparallel Aug 8, 2025
2 checks passed

fmassa deleted the fmassa/fix_bucket_size branch August 8, 2025 13:29

IvanKobzarev mentioned this pull request Oct 8, 2025

Fix bucket sizes for AutoParallel 1D (#1545) #1827

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bucket sizes for AutoParallel 1D #1545

Fix bucket sizes for AutoParallel 1D #1545

Uh oh!

fmassa commented Aug 8, 2025

Uh oh!

fmassa Aug 8, 2025

Uh oh!

wconstab Aug 8, 2025

Uh oh!

wconstab Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix bucket sizes for AutoParallel 1D #1545

Fix bucket sizes for AutoParallel 1D #1545

Uh oh!

Conversation

fmassa commented Aug 8, 2025

Uh oh!

fmassa Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants