Fix torch.compile FunctionalTensor inputs for higherOrderOps #107604

ydwu4 · 2023-08-21T15:59:42Z

Stack from ghstack (oldest at bottom):

Before this PR, for the added test, which feeds FunctionTensorWrapper inputs to higherOrderOperator, we have an assertion error in this line code.

The key difference of this PR is this line of check:

        elif (
            isinstance(example_value, FakeTensor)
            and example_value.fake_mode is tx.fake_mode
        ):

The original intention of it seems to be dealing with case where we want to wrap an fx proxy for an intermediate fake tensor that's produced by some tensor ops and an example value is provided (as is the case for higherOrderOps here). A fakified FunctionalTensorWrapper(FakeTensor) always fails this check. This PR changes it to checking whether it's already fakified by tx.fake_mode.

cc @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @aakhundov

[ghstack-poisoned]

pytorch-bot · 2023-08-21T15:59:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107604

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ 1 Unrelated Failure

As of commit 4b2410e with merge base 67bb3c0 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 71ed933 Pull Request resolved: #107604

zou3519 · 2023-08-22T13:49:35Z

test/dynamo/test_subclasses.py

+            actual = normalize_gm(
+                backend.graphs[exp_n_graph - 1].print_readable(print_output=False)
+            )
+            self.assertEqual(actual, exp_graph)


nit: can we assertExpectedInline somehow on this?

torch/_dynamo/variables/builder.py

zou3519 · 2023-08-22T13:57:43Z

torch/_subclasses/fake_tensor.py


-def is_fake(x):
-    if isinstance(x, FakeTensor):
+def is_fake(x, fake_mode=None):


I don't really understand why we need to pass in a fake_mode to is_fake. Are we saying that a FakeTensor might have a different fake_mode than expected?

Yes, this could happen when input is a FakeTensor created outside of compiled region e.g. make_fx(torch.func.functionalize(compiled_f))(x). But theoretically, FakeTensors should already be fakified by dynamo by the time we call the first is_fake right now. I guess it's mostly to add more safety checks and enforcing an invariant that all fake tensors in the dynamo should has the same fake_mode as current instructionTranslator.

My problem right now is that the is_fake(fake_mode) API doesn't really make sense. If you want to uphold an invariant that all fake tensors have the same fake_mode as the current instruction translator, then instead of complicating the is_fake check we should add an assertion somewhere (maybe post-fakification?) that all FakeTensors have the same fake mode

Yeah, that also sounds good to me! I can change the implementation.

Before this PR, for the added [test](https://github.com/pytorch/pytorch/pull/107604/files#diff-c618f2274b6b5ccc533c580549d2e552edbd9fc5ac0da1aa4b00338525c8f78dR224), which feeds FunctionTensorWrapper inputs to higherOrderOperator, we have an assertion error in this line [code](https://github.com/pytorch/pytorch/pull/107604/files#diff-9f0663783bcd93e948e0491ef61b48123bdc9977bcc632fd707da578df13bfa1R1284). The key difference of this PR is this [line ](https://github.com/pytorch/pytorch/pull/107604/files#diff-9f0663783bcd93e948e0491ef61b48123bdc9977bcc632fd707da578df13bfa1L1263)of check: ```python elif ( isinstance(example_value, FakeTensor) and example_value.fake_mode is tx.fake_mode ): ``` The original intention of it seems to be dealing with case where we want to wrap an fx proxy for an intermediate fake tensor that's already tracked by dynamo and an example value is provided (as is the case for higherOrderOps [here](https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/variables/higher_order_ops.py#L85)). A fakified FunctionalTensorWrapper(FakeTensor) always fails this check. This PR changes it to checking whether it's already fakified by tx.fake_mode. cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx ipiszy chenyang78 aakhundov [ghstack-poisoned]

ghstack-source-id: a263b81 Pull Request resolved: #107604

ydwu4 · 2023-08-23T02:40:22Z

@pytorchbot merge

pytorchmergebot · 2023-08-23T02:42:13Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fix torch.compile FunctionalTensor inputs for higherOrderOps

03df8f1

[ghstack-poisoned]

ydwu4 mentioned this pull request Aug 21, 2023

Reland fakify FunctionalTensor #107569

Closed

ydwu4 added a commit that referenced this pull request Aug 21, 2023

Fix torch.compile FunctionalTensor inputs for higherOrderOps

63ac6d0

ghstack-source-id: 71ed933 Pull Request resolved: #107604

github-actions bot added module: dynamo ciflow/inductor labels Aug 21, 2023

ydwu4 added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Aug 21, 2023

ydwu4 requested review from bdhirsh, kshitij12345, wanchaol and zou3519 August 21, 2023 16:10

zou3519 reviewed Aug 22, 2023

View reviewed changes

torch/_dynamo/variables/builder.py Outdated Show resolved Hide resolved

zou3519 reviewed Aug 22, 2023

View reviewed changes

zou3519 approved these changes Aug 22, 2023

View reviewed changes

ydwu4 added a commit that referenced this pull request Aug 22, 2023

Fix torch.compile FunctionalTensor inputs for higherOrderOps

8f48a2c

ghstack-source-id: a263b81 Pull Request resolved: #107604

pytorchmergebot added the merging label Aug 23, 2023

pytorchmergebot added Merged and removed merging labels Aug 23, 2023

pytorchmergebot closed this in cbcd551 Aug 23, 2023

facebook-github-bot deleted the gh/ydwu4/6/head branch August 26, 2023 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix torch.compile FunctionalTensor inputs for higherOrderOps #107604

Fix torch.compile FunctionalTensor inputs for higherOrderOps #107604

Uh oh!

ydwu4 commented Aug 21, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 21, 2023 •

edited

Loading

Uh oh!

zou3519 Aug 22, 2023

Uh oh!

Uh oh!

zou3519 Aug 22, 2023

Uh oh!

ydwu4 Aug 22, 2023 •

edited

Loading

Uh oh!

zou3519 Aug 22, 2023

Uh oh!

ydwu4 Aug 22, 2023

Uh oh!

ydwu4 commented Aug 23, 2023

Uh oh!

pytorchmergebot commented Aug 23, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix torch.compile FunctionalTensor inputs for higherOrderOps #107604

Fix torch.compile FunctionalTensor inputs for higherOrderOps #107604

Uh oh!

Conversation

ydwu4 commented Aug 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/107604

✅ 1 Unrelated Failure

Uh oh!

zou3519 Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zou3519 Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

ydwu4 Aug 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zou3519 Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

ydwu4 Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

ydwu4 commented Aug 23, 2023

Uh oh!

pytorchmergebot commented Aug 23, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ydwu4 commented Aug 21, 2023 •

edited

Loading

pytorch-bot bot commented Aug 21, 2023 •

edited

Loading

ydwu4 Aug 22, 2023 •

edited

Loading