Skip to content

Conversation

@bobrenjc93
Copy link
Contributor

@bobrenjc93 bobrenjc93 commented Oct 25, 2024

[ghstack-poisoned]
@pytorch-bot
Copy link

pytorch-bot bot commented Oct 25, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138922

Note: Links to docs will display an error until the docs builds have been completed.

❌ 30 New Failures, 1 Unrelated Failure

As of commit 794f6a4 with merge base c3fbec7 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

bobrenjc93 added a commit that referenced this pull request Oct 25, 2024
ghstack-source-id: 284179d
Pull Request resolved: #138922
@bobrenjc93
Copy link
Contributor Author

@pytorchbot rebase -b main

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here

[ghstack-poisoned]
@pytorchmergebot
Copy link
Collaborator

Successfully rebased gh/bobrenjc93/90/orig onto refs/remotes/origin/main, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/138922)

pytorchmergebot pushed a commit that referenced this pull request Oct 31, 2024
ghstack-source-id: 9919435
Pull Request resolved: #138922
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
bobrenjc93 added a commit that referenced this pull request Nov 1, 2024
ghstack-source-id: f97721c
Pull Request resolved: #138922
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec

[ghstack-poisoned]
@yf225
Copy link
Contributor

yf225 commented Nov 25, 2024

@bobrenjc93 I believe this might be causing perf regression on torchbench: https://hud.pytorch.org/benchmark/torchbench/inductor_no_cudagraphs?dashboard=torchinductor&startTime=Mon,%2018%20Nov%202024%2022:00:43%20GMT&stopTime=Mon,%2025%20Nov%202024%2022:00:43%20GMT&granularity=hour&mode=training&dtype=amp&deviceName=cuda%20(a100)&lBranch=gh/bobrenjc93/90/head&lCommit=e67d26992e8a473aec6ede13b8cb928208c3df34&rBranch=main&rCommit=1bdb92cbff6f8f6fd1842e8fd9969bca688eaea3

I also bisected on torchbench llama model (on A100 devvm):
python benchmarks/dynamo/torchbench.py --performance --cold-start-latency --inductor --disable-cudagraphs --device cuda --inference --bfloat16 --print-memory --output output.csv --only llama --batch-size 32
5268754 (11/22 nightly): 2.177902x
995e307: 2.196x
c513f01: 2.198x
11c786d: 2.160x
ba5253d: 1.748x <- this PR
51b6126 (11/23 nightly): 1.781900x

@yf225
Copy link
Contributor

yf225 commented Nov 25, 2024

Discussed in Inductor group chat and the group agrees to revert this PR

@yf225
Copy link
Contributor

yf225 commented Nov 26, 2024

@pytorchbot revert -m "perf regression on torchbench"

@pytorch-bot
Copy link

pytorch-bot bot commented Nov 26, 2024

❌ 🤖 pytorchbot command failed:

@pytorchbot revert: error: the following arguments are required: -c/--classification

usage: @pytorchbot revert -m MESSAGE -c
                          {nosignal,ignoredsignal,landrace,weird,ghfirst}

Try @pytorchbot --help for more info.

@yf225
Copy link
Contributor

yf225 commented Nov 26, 2024

@pytorchbot revert -m "perf regression on torchbench" -c nosignal

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot added a commit that referenced this pull request Nov 26, 2024
@pytorchmergebot
Copy link
Collaborator

@bobrenjc93 your PR has been successfully reverted.

[ghstack-poisoned]
bobrenjc93 added a commit that referenced this pull request Nov 26, 2024
ghstack-source-id: bb8d10a
Pull Request resolved: #138922
@ezyang
Copy link
Contributor

ezyang commented Nov 26, 2024

Endorsing the NN module spec changes, although we will have to carefully check if this actually fixes all the benchmarks

@bobrenjc93 bobrenjc93 closed this Nov 30, 2024
Ryo-not-rio pushed a commit to Ryo-not-rio/pytorch that referenced this pull request Dec 2, 2024
Ryo-not-rio pushed a commit to Ryo-not-rio/pytorch that referenced this pull request Dec 2, 2024
This reverts commit 6d779d0.

Reverted pytorch#138922 on behalf of https://github.com/huydhn due to Sorry for reverting your change but there is some slow tests failing after this land ([comment](pytorch#138922 (comment)))
Ryo-not-rio pushed a commit to Ryo-not-rio/pytorch that referenced this pull request Dec 2, 2024
Ryo-not-rio pushed a commit to Ryo-not-rio/pytorch that referenced this pull request Dec 2, 2024
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
This reverts commit 6d779d0.

Reverted pytorch#138922 on behalf of https://github.com/huydhn due to Sorry for reverting your change but there is some slow tests failing after this land ([comment](pytorch#138922 (comment)))
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
Esquains pushed a commit to Esquains/study1 that referenced this pull request Dec 15, 2024
ghstack-source-id: 27c7ecc
Pull Request resolved: pytorch/pytorch#138922
Esquains pushed a commit to Esquains/study1 that referenced this pull request Dec 15, 2024
ghstack-source-id: e83c7fd
Pull Request resolved: pytorch/pytorch#138922
Esquains pushed a commit to Esquains/study1 that referenced this pull request Dec 15, 2024
ghstack-source-id: 3abea0f
Pull Request resolved: pytorch/pytorch#138922
@github-actions github-actions bot deleted the gh/bobrenjc93/90/head branch December 31, 2024 02:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci-no-td Do not run TD on this PR ciflow/inductor ciflow/slow ciflow/trunk Trigger trunk jobs on your pull request keep-going Don't stop on first failure, keep running tests until the end Merged module: dynamo module: inductor Reverted topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants