[compile] Switch off inference_mode for fake prop while compiling #149072

anijain2305 · 2025-03-12T20:33:01Z

Stack from ghstack (oldest at bottom):

cc @H-Huang @awgu @kwen2501 @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

[ghstack-poisoned]

pytorch-bot · 2025-03-12T20:33:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/149072

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Unrelated Failure

As of commit 78df81b with merge base dcc502f ():

NEW FAILURES - The following jobs have failed:

pull / linux-focal-cuda11.8-py3.10-gcc9 / test (distributed, 1, 3, linux.g4dn.12xlarge.nvidia.gpu) (gh)
distributed/tensor/experimental/test_tp_transform.py::TensorParallelTest::test_tp_transform_no_bias
pull / linux-focal-py3.13-clang10 / test (dynamo_wrapped, 3, 3, linux.2xlarge) (gh)
test_autograd.py::TestAutogradForwardMode::test_make_dual_inference_tensor_in_inference_mode
pull / linux-focal-py3.9-clang10 / test (dynamo_wrapped, 3, 3, linux.2xlarge) (gh)
profiler/test_profiler.py::TestExperimentalUtils::test_profiler_synchronized_dataloader_pattern
pull / linux-jammy-py3.9-gcc11 / test (distributed, 1, 2, linux.2xlarge) (gh)
distributed/tensor/experimental/test_tp_transform.py::TensorParallelTest::test_tp_transform_no_bias

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_timm, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
crossvit_9_240

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 · 2025-03-14T16:59:27Z

torch/_subclasses/meta_utils.py

            id=self.get_tensor_id(t),
            storage=storage,
-            is_inference=t.is_inference(),
+            is_inference=False if DISABLE_INFERENCE_MODE else t.is_inference(),


This line is the main change.

bdhirsh · 2025-03-14T17:12:13Z

torch/_subclasses/meta_utils.py

    assert a == b, f"{a} != {b}"


+DISABLE_INFERENCE_MODE = False


nit: can we make this a real config somewhere, just in case someone actually needs to flip it if we end up breaking inference code in a subtle way.

I guess the annoying part is that we don't have global config for fake tensor... Maybe in dynamo config?

Added a config. And the behavior is controlled in convert_frame.py

test/test_autograd.py

torch/_dynamo/variables/tensor.py

bdhirsh · 2025-03-14T17:15:24Z

torch/_dynamo/variables/torch.py

+            unimplemented_v2(
+                gb_type="Encountered torch.is_inference_mode_enabled during tracing",
+                context="",
+                explanation="torch.is_inference_mode_enabled() is not supported",


on the explanation - our claim is basically that if you are using compile, we want people to use no_grad instead of inference_mode since it gives the same perf under compile and is more composable. Should we mention that in these explanations?

Thats a better message. I can do that.

Updated the graph break hint messages.

cc H-Huang awgu kwen2501 wanchaol fegin fduwjj wz337 wconstab d4l3k c-p-i-o voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

bdhirsh

🤞

…mpiling" cc H-Huang awgu kwen2501 wanchaol fegin fduwjj wz337 wconstab d4l3k c-p-i-o voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 · 2025-03-15T00:14:49Z

Seems like with the last change, CI is broken everywhere. This requires more work :(

…mpiling" cc H-Huang awgu kwen2501 wanchaol fegin fduwjj wz337 wconstab d4l3k c-p-i-o voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

jansel

Re-request review once tests are passing

zou3519 · 2025-03-17T13:37:40Z

test/dynamo/test_functions.py

        cnts = torch._dynamo.testing.CompileCounter()
-        opt_fn = torch.compile(fn, backend=cnts, fullgraph=True)
+        opt_fn = torch.compile(fn, backend=cnts, fullgraph=False)
+


I mentioned this offline, but there is some is_inference switching code in flash-attention: https://github.com/Dao-AILab/flash-attention/blob/main/flash_attn/layers/rotary.py#L419

Parts of this codebase are fullgraph compileable and parts aren't (I don't remember which) so we should check that we're not regressing anything here

anijain2305 · 2025-03-17T19:06:04Z

Closing in favor of #149321

ghstack-source-id: 4d63a7a Pull Request resolved: pytorch/pytorch#149072

ghstack-source-id: 934b23e Pull Request resolved: pytorch/pytorch#149072

[compile] Switch off inference_mode while compiling

5560452

[ghstack-poisoned]

anijain2305 mentioned this pull request Mar 12, 2025

[dynamo][invoke_subgraph] Input aliasing and mutation check in Dynamo #148953

Closed

pytorch-bot bot added ciflow/inductor module: dynamo labels Mar 12, 2025

anijain2305 mentioned this pull request Mar 12, 2025

[dynamo][invoke_subgraph] Faster aliasing checks #148997

Closed

Update on "[compile] Switch off inference_mode while compiling"

68c145b

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 added the topic: not user facing topic category label Mar 12, 2025

anijain2305 added 2 commits March 12, 2025 16:35

Update on "[compile] Switch off inference_mode while compiling"

64620fc

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

Update on "[compile] Switch off inference_mode while compiling"

c6c246c

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

anijain2305 added the keep-going Don't stop on first failure, keep running tests until the end label Mar 14, 2025

Update on "[compile] Switch off inference_mode while compiling"

6fe7b22

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

pytorch-bot bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Mar 14, 2025

anijain2305 commented Mar 14, 2025

View reviewed changes

bdhirsh reviewed Mar 14, 2025

View reviewed changes

test/test_autograd.py Show resolved Hide resolved

bdhirsh reviewed Mar 14, 2025

View reviewed changes

torch/_dynamo/variables/tensor.py Show resolved Hide resolved

bdhirsh reviewed Mar 14, 2025

View reviewed changes

anijain2305 requested a review from bdhirsh March 14, 2025 18:10

anijain2305 changed the title ~~[compile] Switch off inference_mode while compiling~~ [compile] Switch off inference_mode for fake prop while compiling Mar 14, 2025

anijain2305 requested review from eellison, jansel and zou3519 March 14, 2025 18:11

bdhirsh approved these changes Mar 14, 2025

View reviewed changes

anijain2305 added 3 commits March 14, 2025 12:48

anijain2305 marked this pull request as draft March 15, 2025 00:13

jansel requested changes Mar 16, 2025

View reviewed changes

zou3519 reviewed Mar 17, 2025

View reviewed changes

anijain2305 closed this Mar 17, 2025

jurgen-paul pushed a commit to jurgen-paul/pytorch.git.file that referenced this pull request Mar 19, 2025

[compile] Switch off inference_mode while compiling

c1124ef

ghstack-source-id: 4d63a7a Pull Request resolved: pytorch/pytorch#149072

jurgen-paul pushed a commit to jurgen-paul/pytorch.git.file that referenced this pull request Mar 19, 2025

[compile] Switch off inference_mode while compiling

6361a94

ghstack-source-id: 934b23e Pull Request resolved: pytorch/pytorch#149072

github-actions bot deleted the gh/anijain2305/699/head branch April 20, 2025 02:20

[compile] Switch off inference_mode for fake prop while compiling #149072

[compile] Switch off inference_mode for fake prop while compiling #149072

Uh oh!

Conversation

anijain2305 commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/149072

❌ 4 New Failures, 1 Unrelated Failure

Uh oh!

anijain2305 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anijain2305 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bdhirsh Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anijain2305 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anijain2305 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

bdhirsh left a comment

Choose a reason for hiding this comment

Uh oh!

anijain2305 commented Mar 15, 2025

Uh oh!

jansel left a comment

Choose a reason for hiding this comment

Uh oh!

zou3519 Mar 17, 2025

Choose a reason for hiding this comment

Uh oh!

anijain2305 commented Mar 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

anijain2305 commented Mar 12, 2025 •

edited

Loading

pytorch-bot bot commented Mar 12, 2025 •

edited

Loading