Skip to content

Conversation

@jaglinux
Copy link
Contributor

@jaglinux jaglinux commented Nov 16, 2020

The PR #47898 fixes the global tests. Hence enabling the tests.

Signed-off-by: Jagadish Krishnamoorthy [email protected]

The PR pytorch#47898 fixes the global tests. Hence enabling the tests.

Signed-off-by: Jagadish Krishnamoorthy <[email protected]>
@facebook-github-bot facebook-github-bot added cla signed oncall: distributed Add this issue/PR to distributed oncall triage queue labels Nov 16, 2020
@jaglinux
Copy link
Contributor Author

jaglinux commented Nov 16, 2020

@jeffdaily @pruthvistony @KyleCZH please review.

@jeffdaily jeffdaily added the module: rocm AMD GPU support for Pytorch label Nov 16, 2020
@jeffdaily
Copy link
Collaborator

LGTM. Let's confirm with the ROCm CI results.

Copy link
Contributor

@rohan-varma rohan-varma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For context, I think the distributed multi-gpu tests were disabled as part of #47703, we are currently working on re-enabling them after which we should get good additional signal for this PR. We are checking if we can re-enable after the earlier fix in #47639.

@codecov
Copy link

codecov bot commented Nov 16, 2020

Codecov Report

Merging #48023 (15dfdcf) into master (9aaf7fb) will decrease coverage by 0.15%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #48023      +/-   ##
==========================================
- Coverage   81.30%   81.14%   -0.16%     
==========================================
  Files        1839     1839              
  Lines      198444   198440       -4     
==========================================
- Hits       161342   161028     -314     
- Misses      37102    37412     +310     

@gchanan gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Nov 17, 2020
@jaglinux
Copy link
Contributor Author

Sure, test_all_reduce_sum_cuda_async test was disabled as part of ROCm CI failure.
We can wait for #47639 to be merged.

@jaglinux
Copy link
Contributor Author

jaglinux commented Dec 4, 2020

@rohan-varma can you please approve the PR ?

@rohan-varma rohan-varma self-requested a review December 4, 2020 23:09
Copy link
Contributor

@rohan-varma rohan-varma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the ping, LGTM

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rohan-varma has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@rohan-varma merged this pull request in 03abd81.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged module: rocm AMD GPU support for Pytorch oncall: distributed Add this issue/PR to distributed oncall triage queue open source triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants