Skip to content

Conversation

@jeffdaily
Copy link
Collaborator

Revision of #22173 to address CI failure after merging.

@pytorchbot pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general oncall: distributed Add this issue/PR to distributed oncall triage queue module: nccl Problems related to nccl support module: pybind Related to our Python bindings / interactions with other Python libraries labels Jul 18, 2019
Copy link
Contributor

@mrshenli mrshenli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @jeffdaily!!

Pre-approving as this is already reviewed in #22173, which misses noexcept(false) in AutoNcclGroup destructor. Let's wait for the CI to complete before landing.

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mrshenli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@mrshenli
Copy link
Contributor

mrshenli commented Jul 19, 2019

@pytorchbot retest this please

@mrshenli
Copy link
Contributor

Hi @yf225 , I tried the failed ROCM CI test ([1] [2]) twice, but the following tests keep failing. I think it is irrelevant to this PR, as the same error also occurs in other PR (e.g.). Do you know if anyone is working on the fix? I would like to make sure all tests pass before landing this.

test_conv_backcompat (__main__.TestNN) ... ERROR
test_no_grad (__main__.TestNN) ... ERROR
test_noncontig_conv_grad_cuda (__main__.TestNN) ... ERROR

@pietern
Copy link
Contributor

pietern commented Jul 22, 2019

@pytorchbot retest this please

@facebook-github-bot
Copy link
Contributor

@mrshenli merged this pull request in 8bc28cc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Merged module: cuda Related to torch.cuda, and CUDA support in general module: nccl Problems related to nccl support module: pybind Related to our Python bindings / interactions with other Python libraries oncall: distributed Add this issue/PR to distributed oncall triage queue open source

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants