Use atomicAdd from cuda_fp16 header when building with CUDA 10 #12108

syed-ahmed · 2018-09-26T18:22:35Z

An efficient atomicAdd for halfs has been added in cuda_fp16.h in CUDA 10:
__CUDA_FP16_DECL__ __half atomicAdd(__half *address, __half val);

Through this change, PyTorch will be able to utilize efficient atomicAdd when building with CUDA 10.

facebook-github-bot

soumith has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: An efficient atomicAdd for halfs has been added in `cuda_fp16.h` in CUDA 10: ```__CUDA_FP16_DECL__ __half atomicAdd(__half *address, __half val);``` Through this change, PyTorch will be able to utilize efficient atomicAdd when building with CUDA 10. Pull Request resolved: pytorch/pytorch#12108 Differential Revision: D10053385 Pulled By: soumith fbshipit-source-id: 946c90691a8f6bdcf6d6e367a507ac3c9970b750

Calls atomicAdd from cuda_fp16 header when using at::Half with CUDA 10

4d0af20

syed-ahmed requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners September 26, 2018 18:22

soumith approved these changes Sep 26, 2018

View reviewed changes

facebook-github-bot reviewed Sep 26, 2018

View reviewed changes

facebook-github-bot closed this in 1b45f68 Sep 26, 2018

syed-ahmed deleted the cuda-10-atomic-add branch September 26, 2018 23:17

bearpelican mentioned this pull request Dec 28, 2018

negligble performance gains and non convergence on DCGAN using apex (what to change?) NVIDIA/apex#82

Open

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use atomicAdd from cuda_fp16 header when building with CUDA 10 #12108

Use atomicAdd from cuda_fp16 header when building with CUDA 10 #12108

Uh oh!

syed-ahmed commented Sep 26, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use atomicAdd from cuda_fp16 header when building with CUDA 10 #12108

Use atomicAdd from cuda_fp16 header when building with CUDA 10 #12108

Uh oh!

Conversation

syed-ahmed commented Sep 26, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants