Skip to content

Conversation

@huydhn
Copy link
Contributor

@huydhn huydhn commented Nov 23, 2024

Binary build is failing in trunk after #139206 lands, for example, https://github.com/pytorch/pytorch/actions/runs/11981181986/job/33410250461#step:17:539. It's a bit tricky to spot the issue but the difference is between 3.2.0+35c6c7c628 set by PyTorch and 3.2.0+git35c6c7c6 from triton (look closely one has the length of 10, the other of 8 characters)

Triton now has its own nightly build logic in triton-lang/triton#4812 that takes only 8 characters by default while the original logic from PT took 10. So, PT nightly couldn't find the dependency.

@huydhn huydhn requested review from a team and atalman November 23, 2024 01:30
@pytorch-bot
Copy link

pytorch-bot bot commented Nov 23, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141410

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5b7817c with merge base eb954ef (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Nov 23, 2024
@huydhn huydhn requested a review from malfet November 23, 2024 01:30
@huydhn huydhn changed the title Fix pytorch-triton nightly checksum suffix Fix pytorch-triton nightly checksum shorthand Nov 23, 2024
@seemethere
Copy link
Member

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 23, 2024
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@malfet
Copy link
Contributor

malfet commented Nov 23, 2024

@pytorchbot merge -f "Let's not wait for trunk"

@pytorchmergebot
Copy link
Collaborator

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@huydhn
Copy link
Contributor Author

huydhn commented Nov 23, 2024

@pytorchbot revert -m 'There is still a small tweak that I need to do 35c6c7c628 is now git35c6c7c6 so a prefix is needed, going to revert and reland this' -c ignoredsignal

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot added a commit that referenced this pull request Nov 23, 2024
This reverts commit 9f8a191.

Reverted #141410 on behalf of https://github.com/huydhn due to There is still a small tweak that I need to do 35c6c7c628 is now git35c6c7c6 so a prefix is needed, going to revert and reland this ([comment](#141410 (comment)))
@pytorchmergebot
Copy link
Collaborator

@huydhn your PR has been successfully reverted.

@pytorchmergebot pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Nov 23, 2024
@huydhn huydhn changed the title Fix pytorch-triton nightly checksum shorthand Fix pytorch-triton nightly checksum shorthash Nov 23, 2024
@huydhn huydhn removed the ci-no-td Do not run TD on this PR label Nov 23, 2024
@huydhn
Copy link
Contributor Author

huydhn commented Nov 23, 2024

@pytorchbot merge -f 'Land and run in trunk to avoid ephemeral queue'

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@bertmaher
Copy link
Contributor

@huydhn Thank you for fixing this! 🙏

pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
Binary build is failing in trunk after pytorch#139206 lands, for example, https://github.com/pytorch/pytorch/actions/runs/11981181986/job/33410250461#step:17:539.  It's a bit tricky to spot the issue but the difference is between `3.2.0+35c6c7c628` set by PyTorch and `3.2.0+git35c6c7c6` from triton (look closely one has the length of 10, the other of 8 characters)

Triton now has its own nightly build logic in triton-lang/triton#4812 that takes only 8 characters by default while the original logic from PT took 10. So, PT nightly couldn't find the dependency.
Pull Request resolved: pytorch#141410
Approved by: https://github.com/seemethere, https://github.com/malfet
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
This reverts commit 9f8a191.

Reverted pytorch#141410 on behalf of https://github.com/huydhn due to There is still a small tweak that I need to do 35c6c7c628 is now git35c6c7c6 so a prefix is needed, going to revert and reland this ([comment](pytorch#141410 (comment)))
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
Binary build is failing in trunk after pytorch#139206 lands, for example, https://github.com/pytorch/pytorch/actions/runs/11981181986/job/33410250461#step:17:539.  It's a bit tricky to spot the issue but the difference is between `3.2.0+35c6c7c628` set by PyTorch and `3.2.0+git35c6c7c6` from triton (look closely one has the length of 10, the other of 8 characters)

Triton now has its own nightly build logic in triton-lang/triton#4812 that takes only 8 characters by default while the original logic from PT took 10. So, PT nightly couldn't find the dependency.
Pull Request resolved: pytorch#141410
Approved by: https://github.com/seemethere, https://github.com/malfet
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants