Always enable P2P access for GPU copies #21872

colesbury · 2019-06-17T21:25:56Z

PR #20685 incorrectly only enabled P2P access for non-contiguous copies.
This can make cudaMemcpy slow for inter-gpu copies, especially on ROCm
devices. I didn't notice a difference on CUDA 10, but @ngimel says it's
important for CUDA too.

PR pytorch#20685 incorrectly only enabled P2P access for non-contiguous copies. This can make cudaMemcpy slow for inter-gpu copies, especially on ROCm devices.

facebook-github-bot

@colesbury has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

colesbury · 2019-06-17T21:48:42Z

I've run the imagenet ResNet-18 example (without data loading) on four Vega 20 cards. Perf is ~0.113 ms/batch of 256 vs. ~0.175 ms/batch before this PR. (Before 320c385 perf was ~0.118 ms/batch).

facebook-github-bot · 2019-06-18T02:34:08Z

@colesbury merged this pull request in cc4498a.

Summary: PR pytorch/pytorch#20685 incorrectly only enabled P2P access for non-contiguous copies. This can make cudaMemcpy slow for inter-gpu copies, especially on ROCm devices. I didn't notice a difference on CUDA 10, but ngimel says it's important for CUDA too. Pull Request resolved: pytorch/pytorch#21872 Differential Revision: D15863965 Pulled By: colesbury fbshipit-source-id: 0a858f3c338fa2a5d05949d7f65fc05a70a9dfe1

Always enable P2P access for GPU copies

e1feb39

PR pytorch#20685 incorrectly only enabled P2P access for non-contiguous copies. This can make cudaMemcpy slow for inter-gpu copies, especially on ROCm devices.

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: operators labels Jun 17, 2019

colesbury requested a review from soumith June 17, 2019 21:36

facebook-github-bot reviewed Jun 17, 2019

View reviewed changes

soumith approved these changes Jun 17, 2019

View reviewed changes

facebook-github-bot closed this in cc4498a Jun 18, 2019

colesbury deleted the p2p_acesss branch June 18, 2019 00:56

facebook-github-bot added the merged label Jun 18, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Always enable P2P access for GPU copies #21872

Always enable P2P access for GPU copies #21872

Uh oh!

colesbury commented Jun 17, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

colesbury commented Jun 17, 2019

Uh oh!

facebook-github-bot commented Jun 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Always enable P2P access for GPU copies #21872

Always enable P2P access for GPU copies #21872

Uh oh!

Conversation

colesbury commented Jun 17, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

colesbury commented Jun 17, 2019

Uh oh!

facebook-github-bot commented Jun 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants