Remove curandStateMTGP32 usage #21301

syed-ahmed · 2019-06-03T19:03:24Z

Stack from ghstack:

Remove curandStateMTGP32 usage #21301 Remove curandStateMTGP32 usage
Speedup bernoulli_scalar_cuda_kernel with grid-stride loop #21300 Speedup bernoulli_scalar_cuda_kernel with grid-stride loop
Move THCTensor_(lognormal) to ATen #21299 Move THCTensor_(lognormal) to ATen
Move THCTensor_(geometric) to ATen #21298 Move THCTensor_(geometric) to ATen
Move THCTensor_(exponential) to ATen #21297 Move THCTensor_(exponential) to ATen

Resubmit of #20886

Summary:

This PR removes curandStateMTGP32 usages since it's not stream-safe.
Main changes are:

It modifies THCTensor_(getRNGState) and THCTensor_(setRNGState) to not read/write curandStateMTGP anymore.
It modifies RRelu.cu and cuda multinomial kernels to use curandStatePhilox
It deletes new_state.clone() from torch.cuda.random.py to get a performance boost.

Differential Revision: D15632929

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

ryanchesler · 2019-06-04T04:39:19Z

Sorry if this is a noob question but is there some way for me to be able to start utilizing these changes before they are fully tested? I am running across this error and hoping this fix will resolve them.

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

syed-ahmed · 2019-06-04T05:50:46Z

@ryanchesler The changes should be landing very soon if you could wait a bit. If not, you can find the full diff here: https://github.com/pytorch/pytorch/commits/gh/syed-ahmed/13/orig. It's the first 5 commits.

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

syed-ahmed · 2019-06-04T16:36:45Z

@ezyang all tests have passed in the stack. GitHub is just not updating the statuses

ryanchesler · 2019-06-04T17:29:12Z

Awesome. Glad this is making it through. Hopefully it solves the issue blocking me.

ezyang · 2019-06-04T22:02:27Z

Hey @syed-ahmed this stack is conflicting with cauchy which I just landed. Can you rebase past that?

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

syed-ahmed · 2019-06-04T22:29:49Z

Rebased :)

Summary: Pull Request resolved: pytorch/pytorch#21301 ghimport-source-id: d4516237a8fb46d1f74c47532e849e5926fc6a79 Differential Revision: D15632929 Pulled By: ezyang fbshipit-source-id: b5147edb95dc3d71f87581aa2ab002e48c3fef30

facebook-github-bot · 2019-06-06T01:35:20Z

@ezyang merged this pull request in 0e3c4a0.

sbelharbi · 2019-06-15T13:36:46Z

Hi,
Sorry for this question.
I am facing some issues with rng states (see here).
It was suggested that this merge has fixed the issue.
By checking the date of this merge and the date of the latest version of Pytorch 1.1.0 (April 30), it seems that Pytorch 1.1.0 does not have this feature yet.

Do you have an idea when this feature will be available in a stable version?

I want to publish my code, and make it clear which version of Pytorch I used so it will be easy to install/reproduce.

Thanks!

ezyang · 2019-06-16T19:17:11Z

It will probably be included in 1.2 (assuming it doesn't get reverted before branch cut). You can also try using a nightly to get binaries with the fix earlier.

sbelharbi · 2019-06-16T20:23:12Z

Thanks!
The issue seems to be fixed in the nightly build (https://download.pytorch.org/whl/nightly/cu100/torch_nightly-1.2.0.dev20190616-cp37-cp37m-linux_x86_64.whl).

I hope 1.2.0 will be released soon!

sbelharbi · 2019-08-09T01:40:09Z

hi,
@ezyang @syed-ahmed
sorry to bother you.
was this commit integrated in v1.2.0 which was released today?
https://github.com/pytorch/pytorch/releases/tag/v1.2.0

I searched in the notes (key: rng, 21301), and did not find it.
thanks

ezyang · 2019-08-09T13:43:34Z

Yes it's in 1.2.0; 0e3c4a0 is reachable from v1.2.0 branch. It does look like this is missing from the changelog notes.

sbelharbi · 2019-08-09T14:36:06Z

thanks!

This was referenced Jun 3, 2019

Move THCTensor_(exponential) to ATen #21297

Closed

Move THCTensor_(geometric) to ATen #21298

Closed

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: internals Related to internal abstractions in c10 and ATen module: operators labels Jun 3, 2019

This was referenced Jun 3, 2019

Move THCTensor_(lognormal) to ATen #21299

Closed

Speedup bernoulli_scalar_cuda_kernel with grid-stride loop #21300

Closed

syed-ahmed added 2 commits June 3, 2019 12:03

Remove curandStateMTGP32 usage

50181e2

Update on "Remove curandStateMTGP32 usage"

3711bb8

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

Update on "Remove curandStateMTGP32 usage"

37263fc

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

syed-ahmed requested a review from ezyang June 4, 2019 05:53

Update on "Remove curandStateMTGP32 usage"

4366bf9

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

syed-ahmed mentioned this pull request Jun 4, 2019

[CPU] Refactor Random Number Generators in ATen #21364

Closed

ezyang approved these changes Jun 4, 2019

View reviewed changes

Update on "Remove curandStateMTGP32 usage"

acfcd47

Remove curandStateMTGP32 usage gh-metadata: pytorch pytorch 21301 gh/syed-ahmed/13/head

ezyang added the open source label Jun 5, 2019

facebook-github-bot closed this in 0e3c4a0 Jun 5, 2019

zou3519 deleted the gh/syed-ahmed/13/head branch June 5, 2019 21:08

facebook-github-bot added the merged label Jun 6, 2019

mruberry added the Merged label Oct 28, 2020

Remove curandStateMTGP32 usage #21301

Remove curandStateMTGP32 usage #21301

Uh oh!

Conversation

syed-ahmed commented Jun 3, 2019 • edited by ezyang Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Uh oh!

ryanchesler commented Jun 4, 2019

Uh oh!

syed-ahmed commented Jun 4, 2019

Uh oh!

syed-ahmed commented Jun 4, 2019

Uh oh!

ryanchesler commented Jun 4, 2019

Uh oh!

ezyang commented Jun 4, 2019

Uh oh!

syed-ahmed commented Jun 4, 2019

Uh oh!

facebook-github-bot commented Jun 6, 2019

Uh oh!

sbelharbi commented Jun 15, 2019

Uh oh!

ezyang commented Jun 16, 2019

Uh oh!

sbelharbi commented Jun 16, 2019

Uh oh!

sbelharbi commented Aug 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Aug 9, 2019

Uh oh!

sbelharbi commented Aug 9, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

syed-ahmed commented Jun 3, 2019 •

edited by ezyang

Loading

sbelharbi commented Aug 9, 2019 •

edited

Loading