Creates device generic cuDNN decorators #26791

mruberry · 2019-09-25T08:24:53Z

Creates @skipCUDAIfNoCudnn, @skipCUDAIfCudnnVersionLessThan decorators
Makes several test_nn.py tests generic

Many tests in test_nn.py test cuDNN. These tests are guarded on various conditionals using TEST_CUDNN and TEST_CUDNN_VERSION imported from common_cuda.py and custom error messages like 'CUDNN not available' and 'needs cudnn.'

This PR suggests using the CUDA base test class instead of common_cuda.py to test cuDNN's availability, at least on generic tests. The CUDA base test class is preferable to common_cuda.py since it only creates a CUDA context if its tests are run. Importing from common_cuda.py, on the other hand, always creates a CUDA context. Using the CUDA base test class is also consistent with how other generic tests are guarded and provides consistent skip messages.

One quirk to this approach is that it makes use of the self argument to the test functions to check for cuDNN availability during a test. See test_rnn_retain_variables. The self argument could also be used to check the device type instead of the more verbose torch.device(device).type == 'cuda'.

An alternative approach to making test_nn.py generic would be to continue to use common_cuda.py imports, try to keep their skip messages consistent, and not worry about creating unnecessary CUDA contexts. This would preclude writing generic tests that can only run on CUDA if cuDNN is available, however, so tests like "_test_RNN_cpu_vs_cudnn" would require additional changes to make into device generic precision tests like "_test_RNN_cpu_vs_xla."

For consistency, simplicity, and ease of use, I recommend we adopt the proposed decorators and make use of the self argument when productive.

…ric_expansion

Summary: Make cudnn rnn respect current stream. After this lands, non-default test stream can be reenabled in #26791 Pull Request resolved: #27026 Test Plan: default stream functionality is tested in existing tests, stream safety tests will be added in #26791 Differential Revision: D17656967 Pulled By: ngimel fbshipit-source-id: 8b051aedd1df089b21f666ec553a5acefffdac88

Summary: Make cudnn rnn respect current stream. After this lands, non-default test stream can be reenabled in pytorch/pytorch#26791 Pull Request resolved: pytorch/pytorch#27026 Test Plan: default stream functionality is tested in existing tests, stream safety tests will be added in pytorch/pytorch#26791 Differential Revision: D17656967 Pulled By: ngimel fbshipit-source-id: 8b051aedd1df089b21f666ec553a5acefffdac88

test/common_device_type.py

test/test_nn.py

mruberry · 2019-09-30T22:11:29Z

@pytorchbot rebase this please.

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

mruberry · 2019-10-01T01:33:55Z

@pytorchbot rebase this please

facebook-github-bot

@mruberry is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-10-01T10:05:42Z

@mruberry merged this pull request in 3099732.

Summary: Make cudnn rnn respect current stream. After this lands, non-default test stream can be reenabled in pytorch#26791 Pull Request resolved: pytorch#27026 Test Plan: default stream functionality is tested in existing tests, stream safety tests will be added in pytorch#26791 Differential Revision: D17656967 Pulled By: ngimel fbshipit-source-id: 8b051aedd1df089b21f666ec553a5acefffdac88

Summary: - Creates skipCUDAIfNoCudnn, skipCUDAIfCudnnVersionLessThan decorators - Makes several test_nn.py tests generic Many tests in test_nn.py test cuDNN. These tests are guarded on various conditionals using TEST_CUDNN and TEST_CUDNN_VERSION imported from common_cuda.py and custom error messages like 'CUDNN not available' and 'needs cudnn.' This PR suggests using the CUDA base test class instead of common_cuda.py to test cuDNN's availability, at least on generic tests. The CUDA base test class is preferable to common_cuda.py since it only creates a CUDA context if its tests are run. Importing from common_cuda.py, on the other hand, always creates a CUDA context. Using the CUDA base test class is also consistent with how other generic tests are guarded and provides consistent skip messages. One quirk to this approach is that it makes use of the self argument to the test functions to check for cuDNN availability during a test. See test_rnn_retain_variables. The self argument could also be used to check the device type instead of the more verbose torch.device(device).type == 'cuda'. An alternative approach to making test_nn.py generic would be to continue to use common_cuda.py imports, try to keep their skip messages consistent, and not worry about creating unnecessary CUDA contexts. This would preclude writing generic tests that can only run on CUDA if cuDNN is available, however, so tests like "_test_RNN_cpu_vs_cudnn" would require additional changes to make into device generic precision tests like "_test_RNN_cpu_vs_xla." For consistency, simplicity, and ease of use, I recommend we adopt the proposed decorators and make use of the self argument when productive. Pull Request resolved: pytorch#26791 Differential Revision: D17678325 Pulled By: mruberry fbshipit-source-id: 1794735ede9bc9f36856e72b3804b136ad3e0de2

Mike Ruberry added 4 commits September 24, 2019 22:58

makes slow tests slow and not so slow tests not slow

95f8949

just additive

8461255

test_nn gumbel and cudnn

aafea60

fixes git

ae64da7

pytorchbot added the module: nn Related to torch.nn label Sep 25, 2019

mruberry requested a review from gchanan September 25, 2019 08:35

Mike Ruberry added 6 commits September 26, 2019 17:51

merges with master

62f7d74

Merge branch 'master' of github.com:pytorch/pytorch into test_nn_gene…

a162ed9

…ric_expansion

Merge branch 'master' of github.com:pytorch/pytorch into test_nn_gene…

d56b0a7

…ric_expansion

checks for cudnn to avoid redundancy

bed98ea

sets default stream in failing test

af2b4a3

import fix

ff84110

ngimel mentioned this pull request Sep 29, 2019

make cudnn rnn respect current stream #27026

Closed

mruberry requested review from ngimel and removed request for gchanan September 30, 2019 16:24

ngimel approved these changes Sep 30, 2019

View reviewed changes

test/common_device_type.py Outdated Show resolved Hide resolved

test/test_nn.py Outdated Show resolved Hide resolved

Updates per reviewer feedback.

2c83237

facebook-github-bot reviewed Sep 30, 2019

View reviewed changes

restores test sizes

b52e32c

mruberry force-pushed the test_nn_generic_expansion branch from 96d3781 to b52e32c Compare October 1, 2019 00:23

Merge remote-tracking branch 'origin/master' into HEAD

b1218ee

facebook-github-bot reviewed Oct 1, 2019

View reviewed changes

facebook-github-bot closed this in 3099732 Oct 1, 2019

mruberry deleted the test_nn_generic_expansion branch October 1, 2019 09:37

facebook-github-bot added the merged label Oct 1, 2019

ngimel mentioned this pull request Oct 5, 2019

Use grad_out for cudnn CTC loss #27039

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Creates device generic cuDNN decorators #26791

Creates device generic cuDNN decorators #26791

Uh oh!

mruberry commented Sep 25, 2019

Uh oh!

Uh oh!

Uh oh!

mruberry commented Sep 30, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

mruberry commented Oct 1, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Oct 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Creates device generic cuDNN decorators #26791

Creates device generic cuDNN decorators #26791

Uh oh!

Conversation

mruberry commented Sep 25, 2019

Uh oh!

Uh oh!

Uh oh!

mruberry commented Sep 30, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

mruberry commented Oct 1, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants