Support more foreach ops for tensor beta support #134170

mlazos · 2024-08-21T22:42:18Z

Add more foreach ops so we don't have fallbacks.

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-08-21T22:42:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134170

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f87e590 with merge base a1b22e3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

eellison

wonder if we could do this more programatically..

mlazos · 2024-08-22T06:54:44Z

wonder if we could do this more programatically..

Hmm I'm not sure, I think for testing I could use the op info tests for for each. For this I'm not sure mainly because there might be patterns or weird overloads that aren't support yet. Which is why we've incrementally added. This is kind of a consequence of the foreach API itself which has been incrementally updated as needed and not all overloads are supported for every op.

eellison

meant to accept

eellison · 2024-08-22T16:08:35Z

Like, would it be sufficient to register

for op in foreach:
    equiv_aten_op = op.name[op.name.find("aten._foreach_"):]
    register_foreach_pointwise(op, lowerings[getattr(aten, equiv_aten_op)])

mlazos · 2024-08-22T18:02:27Z

Like, would it be sufficient to register


for op in foreach:

    equiv_aten_op = op.name[op.name.find("aten._foreach_"):]

    register_foreach_pointwise(op, lowerings[getattr(aten, equiv_aten_op)])

The issue is I also need to generate tests for them, but let me see if I can use some sort of operator schema to generate the tests (which would be pretty cool)

Add more foreach ops so we don't have fallbacks. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

mlazos · 2024-08-30T18:35:02Z

@pytorchbot merge

Add more foreach ops so we don't have fallbacks. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

mlazos · 2024-10-16T01:01:11Z

@pytorchbot merge

pytorchmergebot · 2024-10-16T01:02:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-16T02:04:50Z

Merge failed

Reason: 4 mandatory check(s) failed. The first few are:

pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 2, 5, lf.linux.g5.4xlarge.nvidia.gpu)
pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 3, 5, lf.linux.g5.4xlarge.nvidia.gpu)
pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 4, 5, lf.linux.g5.4xlarge.nvidia.gpu)
pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / test (default, 5, 5, lf.linux.g5.4xlarge.nvidia.gpu)

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

Add more foreach ops so we don't have fallbacks. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

mlazos · 2024-10-16T09:03:50Z

@pytorchbot merge

pytorchmergebot · 2024-10-16T09:05:51Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-16T15:04:30Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

mlazos · 2024-10-16T17:26:34Z

@pytorchbot merge

pytorchmergebot · 2024-10-16T17:28:14Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-10-16T23:26:56Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

mlazos · 2024-10-17T17:49:07Z

@pytorchbot merge

pytorchmergebot · 2024-10-17T17:51:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

# Motivation Fix #138577. # Solution 1. All UTs in `test/inductor/test_compiled_optimizers.py` are fixed by #134170 2. UT in `test/inductor/test_pattern_matcher.py` is introduced by #138089, we will skip this UT due to the unsupported feature `max_autotune_gemm_backends:Triton`. 3. We have a new impl related to `histc`, so we remove the expected failure from `test/inductor/test_torchinductor_opinfo.py` 4. We support `avg_pool3d` for `fp16` data type, so we remove the expected failure from `test/inductor/test_torchinductor_opinfo.py` 5. CUDA-bias code is introduced by #138472, we just generalize it to `GPU_TYPE`. # Additional Context > Why update torch-xpu-ops commit pin here? We have to update commit pin to avoid the build failure raised by the code change [C10_UNUSED](#138364). > What does the feature of torch-xpu-ops update? 1. Add some foreach ops, like `unary ops` and `foreach_clamp_max` etc; 2. Add some maxpool ops forward and backward, like `averge_pool3d` and `max_pool3d` 3. Add some other ops, like `log_normal_`, `index_copy`, and `mode` etc; 4. fix build failure related to `C10_UNUSED`; Pull Request resolved: #138548 Approved by: https://github.com/malfet, https://github.com/EikanWang

ghstack-source-id: 84d4ab8 Pull Request resolved: pytorch/pytorch#134170 Fix test counts fix counts

Support more foreach ops for tensor beta support

f8c9b7c

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 21, 2024

mlazos requested a review from eellison August 21, 2024 22:55

mlazos added the release notes: inductor label Aug 21, 2024

eellison reviewed Aug 21, 2024

View reviewed changes

eellison approved these changes Aug 22, 2024

View reviewed changes

mlazos added 2 commits August 27, 2024 09:27

mlazos added 3 commits August 29, 2024 03:48

pytorchmergebot added the merging label Oct 16, 2024

pytorchmergebot removed the merging label Oct 16, 2024

pytorchmergebot added the merging label Oct 16, 2024

pytorchmergebot added the Merged label Oct 17, 2024

pytorchmergebot closed this in 0b2c12c Oct 17, 2024

pytorchmergebot removed the merging label Oct 17, 2024

This was referenced Oct 22, 2024

[CI] Fix XPU CI failure #138548

Closed

[Break XPU] C10_UNUSED change and newly add Inductor UTs break XPU CI. #138577

Closed

github-actions bot deleted the gh/mlazos/76/head branch November 17, 2024 02:12

desai0007 pushed a commit to desai0007/test-repo-pytorch that referenced this pull request Feb 26, 2025

Support more foreach ops for tensor beta support

eaa026c

ghstack-source-id: 84d4ab8 Pull Request resolved: pytorch/pytorch#134170 Fix test counts fix counts

Support more foreach ops for tensor beta support #134170

Support more foreach ops for tensor beta support #134170

Uh oh!

Conversation

mlazos commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134170

✅ No Failures

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

mlazos commented Aug 22, 2024

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

eellison commented Aug 22, 2024

Uh oh!

mlazos commented Aug 22, 2024

Uh oh!

mlazos commented Aug 30, 2024

Uh oh!

mlazos commented Oct 16, 2024

Uh oh!

pytorchmergebot commented Oct 16, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 16, 2024

Merge failed

Uh oh!

mlazos commented Oct 16, 2024

Uh oh!

pytorchmergebot commented Oct 16, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 16, 2024

Uh oh!

mlazos commented Oct 16, 2024

Uh oh!

pytorchmergebot commented Oct 16, 2024

Merge started

Uh oh!

pytorchmergebot commented Oct 16, 2024

Uh oh!

mlazos commented Oct 17, 2024

Uh oh!

pytorchmergebot commented Oct 17, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mlazos commented Aug 21, 2024 •

edited

Loading

pytorch-bot bot commented Aug 21, 2024 •

edited

Loading