Pass inductor config for static cuda launcher to workers #153382

jamesjwu · 2025-05-12T15:54:21Z

Stack from ghstack (oldest at bottom):

-> Pass inductor config for static cuda launcher to workers #153382

Async compile workers don't respect inductor configs generally that get changed in the middle of execution because they warm up early. StaticCudaLauncher is especially susceptible to this because it affects triton compilation without being part of the inductor meta. So we'll pass it in via extra configs on each worker run.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-05-12T15:54:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153382

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Large queue time for macos-m2-15 instances

✅ You can merge normally! (2 Unrelated Failures)

As of commit 7124256 with merge base dc47295 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor / unit-test / cuda12.6-py3.10-gcc9-sm86 / test (inductor_cpp_wrapper, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
[ FAILED ] AotInductorTest.BasicPackageLoaderTestCpu

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / cuda12.4-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu) (gh) (trunk failure)
MISSING REGRESSION TEST

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 211df5d Pull Request resolved: #153382

[ghstack-poisoned]

ghstack-source-id: f228f85 Pull Request resolved: #153382

jamesjwu · 2025-05-12T15:58:40Z

torch/_inductor/runtime/compile_tasks.py

-    # We can release this memory in the compile subprocesses:
-    linecache.clearcache()
-    return kernel, elapsed_ns // 1000
+    import torch


@masnesral , is it okay to import torch here? I figure load_kernel, precompile etc. will all import torch anyway so there should be no difference.

Yeah, we already import torch in the subproc: https://fburl.com/code/0ncpgxr5. In the future, it might be nice to not import all of torch though because it really slows down the worker startup. So nit: It's fine if we change to just import config more narrowly, right? IIUC that still imports all of torch so it's effectively the same. But Jason was previously playing with some hack where he'd mock out torch such that it's possible to import torch.foo.bar without importing all of torch. Maybe we'd try something like that again in the future.

Ohh I wasn't aware that there was a difference (I had assumed that importing foo.bar.baz was equivalent to first importing foo then foo.bar then foo.bar.baz), but yes, I can change!

Oh yeah. I think it is the same. I meant that in the future I can imagine us doing some shenanigans with torch imports in the compile subprocess specifically, and mocking foo such that foo.bar.baz imports a mocked foo and the real foo.bar.baz.

masnesral

One nit about making the import slightly more friendly to potential future change?

[ghstack-poisoned]

ghstack-source-id: ae011b5 Pull Request resolved: #153382

jamesjwu · 2025-05-14T19:53:04Z

@pytorchbot merge

pytorchmergebot · 2025-05-14T19:55:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Update

6c807d3

[ghstack-poisoned]

This was referenced May 9, 2025

[precompile] Add BundledAOTAutogradCacheEntry #152840

Closed

[nocommit] bundled autograd cache test #153269

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels May 12, 2025

jamesjwu added a commit that referenced this pull request May 12, 2025

Pass inductor config for static cuda launcher to workers

406f31d

ghstack-source-id: 211df5d Pull Request resolved: #153382

Update

a8ac999

[ghstack-poisoned]

jamesjwu added a commit that referenced this pull request May 12, 2025

Pass inductor config for static cuda launcher to workers

16a79eb

ghstack-source-id: f228f85 Pull Request resolved: #153382

jamesjwu requested a review from masnesral May 12, 2025 15:57

jamesjwu marked this pull request as ready for review May 12, 2025 15:58

jamesjwu commented May 12, 2025

View reviewed changes

jamesjwu requested review from aorenste, jansel and oulgen May 12, 2025 16:25

jamesjwu added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category ciflow/pull labels May 12, 2025

masnesral approved these changes May 12, 2025

View reviewed changes

jansel approved these changes May 12, 2025

View reviewed changes

Update

7124256

[ghstack-poisoned]

jamesjwu added a commit that referenced this pull request May 12, 2025

Pass inductor config for static cuda launcher to workers

5d58f60

ghstack-source-id: ae011b5 Pull Request resolved: #153382

pytorchmergebot added the merging label May 14, 2025

pytorchmergebot added the Merged label May 14, 2025

pytorchmergebot closed this in dda2c7c May 14, 2025

pytorchmergebot removed the merging label May 14, 2025

github-actions bot deleted the gh/jamesjwu/151/head branch June 18, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pass inductor config for static cuda launcher to workers #153382

Pass inductor config for static cuda launcher to workers #153382

Uh oh!

jamesjwu commented May 12, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 12, 2025 •

edited

Loading

Uh oh!

jamesjwu May 12, 2025

Uh oh!

masnesral May 12, 2025

Uh oh!

jamesjwu May 12, 2025

Uh oh!

masnesral May 12, 2025

Uh oh!

masnesral left a comment

Uh oh!

jamesjwu commented May 14, 2025

Uh oh!

pytorchmergebot commented May 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Pass inductor config for static cuda launcher to workers #153382

Pass inductor config for static cuda launcher to workers #153382

Uh oh!

Conversation

jamesjwu commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/153382

❗ 1 Active SEVs

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

jamesjwu May 12, 2025

Choose a reason for hiding this comment

Uh oh!

masnesral May 12, 2025

Choose a reason for hiding this comment

Uh oh!

jamesjwu May 12, 2025

Choose a reason for hiding this comment

Uh oh!

masnesral May 12, 2025

Choose a reason for hiding this comment

Uh oh!

masnesral left a comment

Choose a reason for hiding this comment

Uh oh!

jamesjwu commented May 14, 2025

Uh oh!

pytorchmergebot commented May 14, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jamesjwu commented May 12, 2025 •

edited

Loading

pytorch-bot bot commented May 12, 2025 •

edited

Loading