[ROCm] Enable skipped distributed global tests #48023

jaglinux · 2020-11-16T18:10:53Z

The PR #47898 fixes the global tests. Hence enabling the tests.

Signed-off-by: Jagadish Krishnamoorthy [email protected]

The PR pytorch#47898 fixes the global tests. Hence enabling the tests. Signed-off-by: Jagadish Krishnamoorthy <[email protected]>

jaglinux · 2020-11-16T18:12:58Z

@jeffdaily @pruthvistony @KyleCZH please review.

jeffdaily · 2020-11-16T18:23:44Z

LGTM. Let's confirm with the ROCm CI results.

rohan-varma

For context, I think the distributed multi-gpu tests were disabled as part of #47703, we are currently working on re-enabling them after which we should get good additional signal for this PR. We are checking if we can re-enable after the earlier fix in #47639.

codecov · 2020-11-16T21:13:19Z

Codecov Report

Merging #48023 (15dfdcf) into master (9aaf7fb) will decrease coverage by 0.15%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #48023      +/-   ##
==========================================
- Coverage   81.30%   81.14%   -0.16%     
==========================================
  Files        1839     1839              
  Lines      198444   198440       -4     
==========================================
- Hits       161342   161028     -314     
- Misses      37102    37412     +310

jaglinux · 2020-11-17T18:21:23Z

Sure, test_all_reduce_sum_cuda_async test was disabled as part of ROCm CI failure.
We can wait for #47639 to be merged.

jaglinux · 2020-12-04T20:31:33Z

@rohan-varma can you please approve the PR ?

rohan-varma

Thanks for the ping, LGTM

facebook-github-bot

@rohan-varma has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-12-05T03:18:01Z

@rohan-varma merged this pull request in 03abd81.

[ROCm] Enable skipped distributed global tests

15dfdcf

The PR pytorch#47898 fixes the global tests. Hence enabling the tests. Signed-off-by: Jagadish Krishnamoorthy <[email protected]>

jaglinux requested review from mrshenli, pritamdamania87, rohan-varma and zhaojuanmao as code owners November 16, 2020 18:10

facebook-github-bot added cla signed oncall: distributed Add this issue/PR to distributed oncall triage queue labels Nov 16, 2020

pytorchbot added the open source label Nov 16, 2020

jeffdaily added the module: rocm AMD GPU support for Pytorch label Nov 16, 2020

rohan-varma reviewed Nov 16, 2020

View reviewed changes

gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Nov 17, 2020

rohan-varma self-requested a review December 4, 2020 23:09

rohan-varma approved these changes Dec 4, 2020

View reviewed changes

facebook-github-bot reviewed Dec 4, 2020

View reviewed changes

facebook-github-bot closed this in 03abd81 Dec 5, 2020

facebook-github-bot added the Merged label Dec 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Enable skipped distributed global tests #48023

[ROCm] Enable skipped distributed global tests #48023

Uh oh!

jaglinux commented Nov 16, 2020 •

edited by jeffdaily

Loading

Uh oh!

jaglinux commented Nov 16, 2020 •

edited

Loading

Uh oh!

jeffdaily commented Nov 16, 2020

Uh oh!

rohan-varma left a comment

Uh oh!

codecov bot commented Nov 16, 2020

Uh oh!

jaglinux commented Nov 17, 2020

Uh oh!

jaglinux commented Dec 4, 2020

Uh oh!

rohan-varma left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Dec 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[ROCm] Enable skipped distributed global tests #48023

[ROCm] Enable skipped distributed global tests #48023

Uh oh!

Conversation

jaglinux commented Nov 16, 2020 • edited by jeffdaily Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jaglinux commented Nov 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeffdaily commented Nov 16, 2020

Uh oh!

rohan-varma left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Nov 16, 2020

Codecov Report

Uh oh!

jaglinux commented Nov 17, 2020

Uh oh!

jaglinux commented Dec 4, 2020

Uh oh!

rohan-varma left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Dec 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jaglinux commented Nov 16, 2020 •

edited by jeffdaily

Loading

jaglinux commented Nov 16, 2020 •

edited

Loading