Add CUDA support for _dirichlet_grad #21191

vishwakftw · 2019-05-31T06:19:12Z

Changelog:

Migrate _dirichlet_grad implementation from TH to ATen
Add CUDA support for _dirichlet_grad

Closes #11030.
Closes #15773.

Changelog: - Migrate _dirichlet_grad implementation from TH to ATen - Add CUDA support for _dirichlet_grad

vishwakftw · 2019-05-31T06:19:32Z

cc: @fritzo

vishwakftw · 2019-05-31T06:21:45Z

@pytorchbot rebase this please

…chlet_grad-cuda-support

fritzo · 2019-05-31T21:41:32Z

Great. Can you confirm that

This PR moves dirichlet_grad math to a new file so that the code can be used by both CPU and CUDA codepaths, and therefore the CUDA implementation is covered by existing CPU tests.
The CUDA implementation uses double precision for the function approximations (which require higher precision).

vishwakftw · 2019-06-01T04:51:58Z

The _dirichlet_grad uses the _dirichlet_grad_one function defined in Distributions.h for both CPU and CUDA. This addresses your first point.

Regarding your second point, I have followed the same steps as standard_gamma_grad. In CUDA, I think the acc_type (data type used for function approximations) for half and float is float, and double for double. In CPU however, the acc_type is double universally.

ezyang

I'm assuming there were no substantive algorithmic changes.

ezyang · 2019-06-03T15:56:42Z

aten/src/ATen/native/Distributions.cpp

+Tensor _dirichlet_grad_cpu(const Tensor& x, const Tensor& alpha, const Tensor& total) {
+  Tensor ret = at::empty(x.sizes(), x.options());
+  AT_DISPATCH_FLOATING_TYPES(x.scalar_type(), "_dirichlet_grad_cpu", [&] {
+    CPU_tensor_apply4<scalar_t, scalar_t, scalar_t, scalar_t>(ret, x, alpha, total,


Doesn't have to be this PR, but at some point it would be good to port this into using TensorIterator (which I think should be doable here.) CC @VitalyFedyunin

There are a lot of dispatches in this file that have to be ported into using TensorIterator.

ezyang · 2019-06-03T15:57:48Z

Windows failures look related to this diff:

14:29:13          C:/Jenkins/workspace/caffe2-builds/py2-cuda9.0-cudnn7-windows-build/aten/src\ATen/native/Distributions.h(271): error : calling a __host__ function("isnan< ::c10::Half> ") from a __device__ function("_NV_ANON_NAMESPACE::_beta_grad_alpha_small< ::c10::Half, float> ") is not allowed [C:\Jenkins\workspace\caffe2-builds\py2-cuda9.0-cudnn7-windows-build\caffe2\caffe2_gpu.vcxproj]
14:29:13          C:/Jenkins/workspace/caffe2-builds/py2-cuda9.0-cudnn7-windows-build/aten/src\ATen/native/Distributions.h(271): error : identifier "isnan< ::c10::Half> " is undefined in device code [C:\Jenkins\workspace\caffe2-builds\py2-cuda9.0-cudnn7-windows-build\caffe2\caffe2_gpu.vcxproj]
14:29:13          C:/Jenkins/workspace/caffe2-builds/py2-cuda9.0-cudnn7-windows-build/aten/src\ATen/native/Distributions.h(288): error : calling a __host__ function("isnan< ::c10::Half> ") from a __device__ function("_NV_ANON_NAMESPACE::_beta_grad_beta_small< ::c10::Half, float> ") is not allowed [C:\Jenkins\workspace\caffe2-builds\py2-cuda9.0-cudnn7-windows-build\caffe2\caffe2_gpu.vcxproj]
14:29:13          C:/Jenkins/workspace/caffe2-builds/py2-cuda9.0-cudnn7-windows-build/aten/src\ATen/native/Distributions.h(288): error : identifier "isnan< ::c10::Half> " is undefined in device code [C:\Jenkins\workspace\caffe2-builds\py2-cuda9.0-cudnn7-windows-build\caffe2\caffe2_gpu.vcxproj]

ezyang · 2019-06-03T15:58:24Z

I guess you accidentally turned on half precision support for this operation?

vishwakftw · 2019-06-03T15:59:04Z

Yes, I'm sorry. I'll revert that change - it's in Distributions.cu.

vishwakftw · 2019-06-04T02:52:06Z

Windows failure is unrelated to this PR.

@ezyang is this good to go?

vishwakftw · 2019-06-05T01:59:49Z

@pytorchbot rebase this please

vishwakftw · 2019-06-05T08:50:00Z

@pytorchbot merge this please

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Changelog: - Migrate _dirichlet_grad implementation from TH to ATen - Add CUDA support for _dirichlet_grad Closes #11030. Closes #15773. Pull Request resolved: pytorch/pytorch#21191 Differential Revision: D15660330 Pulled By: ezyang fbshipit-source-id: c8ad5b80366e5348139ce9be10400f22fc430344

facebook-github-bot · 2019-06-05T15:11:53Z

@ezyang merged this pull request in 6251c56.

vishwakftw added 2 commits May 30, 2019 23:36

Add CUDA support for _dirichlet_grad

15df156

Changelog: - Migrate _dirichlet_grad implementation from TH to ATen - Add CUDA support for _dirichlet_grad

Fix typographical error in Macro and fix formula

8fd194b

pytorchbot added module: cpu CPU specific problem (e.g., perf, algorithm) module: cuda Related to torch.cuda, and CUDA support in general module: internals Related to internal abstractions in c10 and ATen module: operators labels May 31, 2019

vishwakftw mentioned this pull request May 31, 2019

Add CUDA support for _dirichlet_grad #21160

Closed

vishwakftw added 2 commits May 31, 2019 17:52

Fix CUDA errors

c4978dc

Merge branch 'master' of https://github.com/pytorch/pytorch into diri…

fd088b3

…chlet_grad-cuda-support

vishwakftw force-pushed the dirichlet_grad-cuda-support branch from f971b43 to fd088b3 Compare May 31, 2019 12:25

vishwakftw requested review from ezyang and gchanan and removed request for gchanan June 1, 2019 04:52

ezyang approved these changes Jun 3, 2019

View reviewed changes

ezyang reviewed Jun 3, 2019

View reviewed changes

vishwakftw added 2 commits June 3, 2019 21:30

Update Distributions.cu

b64b878

Update Distributions.cu

7236d06

Merge remote-tracking branch 'origin/master' into HEAD

a0187bc

pytorchbot added the merge-this-please Was marked for merge with @pytorchbot merge this please label Jun 5, 2019

facebook-github-bot reviewed Jun 5, 2019

View reviewed changes

facebook-github-bot closed this in 6251c56 Jun 5, 2019

vishwakftw deleted the dirichlet_grad-cuda-support branch June 5, 2019 13:41

facebook-github-bot added the merged label Jun 5, 2019

ezyang added the open source label Jun 24, 2019

mruberry added the Merged label Oct 28, 2020

Add CUDA support for _dirichlet_grad #21191

Add CUDA support for _dirichlet_grad #21191

Uh oh!

Conversation

vishwakftw commented May 31, 2019

Uh oh!

vishwakftw commented May 31, 2019

Uh oh!

vishwakftw commented May 31, 2019

Uh oh!

fritzo commented May 31, 2019 • edited by vishwakftw Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishwakftw commented Jun 1, 2019

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang Jun 3, 2019

Choose a reason for hiding this comment

Uh oh!

vishwakftw Jun 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jun 3, 2019

Uh oh!

ezyang commented Jun 3, 2019

Uh oh!

vishwakftw commented Jun 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishwakftw commented Jun 4, 2019

Uh oh!

vishwakftw commented Jun 5, 2019

Uh oh!

vishwakftw commented Jun 5, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 5, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fritzo commented May 31, 2019 •

edited by vishwakftw

Loading

vishwakftw Jun 3, 2019 •

edited

Loading

vishwakftw commented Jun 3, 2019 •

edited

Loading