[inductor] Fix bugs in emulate_precision_casts #163520

jansel · 2025-09-22T15:02:18Z

Stack from ghstack (oldest at bottom):

cc @ezyang @EikanWang @jgong5 @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

[ghstack-poisoned]

pytorch-bot · 2025-09-22T15:02:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163520

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 49b28d4 with merge base 51152ef ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Fixes #163449 ghstack-source-id: 2c62fb2 Pull-Request: #163520

[ghstack-poisoned]

eellison

nice!

eellison · 2025-09-23T16:42:44Z

torch/fx/experimental/proxy_tensor.py

-    if (
-        not isinstance(func, torch._ops.OpOverload)
-        or torch.Tag.pointwise not in func.tags
-    ):


eellison · 2025-09-23T16:52:41Z

torch/fx/experimental/proxy_tensor.py

+    if not output_low_precision:
+        for input_node in last_node.all_input_nodes:
+            val = input_node.meta.get("val") if hasattr(input_node, "meta") else None
+            if isinstance(val, torch.Tensor) and val.dtype in low_pr_fp:
+                output_low_precision = True
+                break


Thinking out loud:

For something like

x: bfloat16 y = x.to(float32)

This would set low_precision_pointwise_barrier on the x.to(float32) output. I guess this is okay because its actual dtype later in lowering will be float32, so we'll ignore it.

And the decomps themselves should be upcasting intermediaries to be fp32, so those will also get ignored, e.g. gelu here.

[ghstack-poisoned]

pytorchmergebot · 2025-09-23T23:31:12Z

Starting merge as part of PR stack under #163482

Fixes #163457 Pull Request resolved: #163482 Approved by: https://github.com/eellison ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520

This reverts commit a8cd437. See #163481 (comment) This PR might also cause issues with cudagraphs. Pull Request resolved: #163737 Approved by: https://github.com/ezyang ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482

Fixes pytorch#163449 Pull Request resolved: pytorch#163520 Approved by: https://github.com/eellison ghstack dependencies: pytorch#163386, pytorch#163398, pytorch#163387, pytorch#163414, pytorch#163415, pytorch#163419, pytorch#163434, pytorch#163393, pytorch#163412, pytorch#163422, pytorch#163481

Fixes pytorch#163457 Pull Request resolved: pytorch#163482 Approved by: https://github.com/eellison ghstack dependencies: pytorch#163386, pytorch#163398, pytorch#163387, pytorch#163414, pytorch#163415, pytorch#163419, pytorch#163434, pytorch#163393, pytorch#163412, pytorch#163422, pytorch#163481, pytorch#163520

This reverts commit a8cd437. See pytorch#163481 (comment) This PR might also cause issues with cudagraphs. Pull Request resolved: pytorch#163737 Approved by: https://github.com/ezyang ghstack dependencies: pytorch#163386, pytorch#163398, pytorch#163387, pytorch#163414, pytorch#163415, pytorch#163419, pytorch#163434, pytorch#163393, pytorch#163412, pytorch#163422, pytorch#163481, pytorch#163520, pytorch#163482

Fixes #163449 Pull Request resolved: #163520 Approved by: https://github.com/eellison ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481

Fixes #163457 Pull Request resolved: #163482 Approved by: https://github.com/eellison ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520

This reverts commit a8cd437. See #163481 (comment) This PR might also cause issues with cudagraphs. Pull Request resolved: #163737 Approved by: https://github.com/ezyang ghstack dependencies: #163386, #163398, #163387, #163414, #163415, #163419, #163434, #163393, #163412, #163422, #163481, #163520, #163482

Fixes #163449 ghstack-source-id: e173d84 Pull-Request: pytorch/pytorch#163520

Update

5405cea

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor release notes: fx release notes category labels Sep 22, 2025

jansel mentioned this pull request Sep 22, 2025

Better decomp for torch.eye #163386

Closed

jansel added a commit that referenced this pull request Sep 22, 2025

[inductor] Fix bugs in emulate_precision_casts

ebeaaaf

Fixes #163449 ghstack-source-id: 2c62fb2 Pull-Request: #163520

jansel mentioned this pull request Sep 22, 2025

[inductor] Support out_dtype arg to matmul #163393

Closed

facebook-github-bot added the fx label Sep 22, 2025

Update

f0ce460

[ghstack-poisoned]

jansel requested a review from eellison September 22, 2025 15:13

Update

c9d16b1

[ghstack-poisoned]

jansel mentioned this pull request Sep 22, 2025

Update conv1d meta kernel to match eager #163584

Closed

eellison approved these changes Sep 23, 2025

View reviewed changes

Update

49b28d4

[ghstack-poisoned]

pytorchmergebot added the Merged label Sep 24, 2025

pytorchmergebot closed this in 6fa9727 Sep 24, 2025

jeffdaily mentioned this pull request Sep 24, 2025

DISABLED test_emulate_precision_casts_mean_ratio_chain (__main__.CudaReproTests) #163765

Open

github-actions bot deleted the gh/jansel/546/head branch October 25, 2025 02:12

Khanaksahu pushed a commit to Khanaksahu/pytorch that referenced this pull request Nov 17, 2025

[inductor] Fix bugs in emulate_precision_casts

25510d0

Fixes #163449 ghstack-source-id: e173d84 Pull-Request: pytorch/pytorch#163520

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] Fix bugs in emulate_precision_casts #163520

[inductor] Fix bugs in emulate_precision_casts #163520

Uh oh!

jansel commented Sep 22, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 22, 2025 •

edited

Loading

Uh oh!

eellison left a comment

Uh oh!

eellison Sep 23, 2025

Uh oh!

eellison Sep 23, 2025

Uh oh!

pytorchmergebot commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[inductor] Fix bugs in emulate_precision_casts #163520

[inductor] Fix bugs in emulate_precision_casts #163520

Uh oh!

Conversation

jansel commented Sep 22, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163520

✅ No Failures

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

eellison Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

eellison Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

pytorchmergebot commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jansel commented Sep 22, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 22, 2025 •

edited

Loading