Make CUDA triu / tril support batches of size > 65535 #21067

vishwakftw · 2019-05-29T17:22:04Z

In the previous implementation of triu / tril, we passed the batch size in the 2nd dimension of a grid. This is limited to 65535, which means that performing triu / tril on a tensor with batch size > 65535 will throw an error. This PR removes the dependence on the 2nd dimension, and corresponding non-contiguity constraints.

Changelog:

Compute offset, row and col in the kernel
Use 1st dimension of grid alone
Remove unnecessary contiguity checks on tensors as a result of this change.

Test Plan:

All tests should pass to verify that the change is correct.

ngimel

I'm ok with this PR if we need only functional support, if performance is important some changes are necessary. Code is nicely simplified.

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu

vishwakftw · 2019-05-30T08:15:42Z

@ngimel This is not a necessarily performance critical kernel - it's generally used in tandem with many linear algebra operations. However, since you mentioned that indexing based on int32 helps a lot for smaller inputs, I've incorporated it in the PR.

vishwakftw · 2019-05-30T18:00:02Z

@pytorchbot rebase this please

vishwakftw · 2019-05-31T02:38:59Z

@pytorchbot merge this please

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-05-31T04:22:53Z

@ezyang merged this pull request in eb4d43d.

Summary: In the previous implementation of triu / tril, we passed the batch size in the 2nd dimension of a grid. This is limited to 65535, which means that performing triu / tril on a tensor with batch size > 65535 will throw an error. This PR removes the dependence on the 2nd dimension, and corresponding non-contiguity constraints. Changelog: - Compute offset, row and col in the kernel - Use 1st dimension of grid alone - Remove unnecessary contiguity checks on tensors as a result of this change. Pull Request resolved: pytorch/pytorch#21067 Differential Revision: D15572501 Pulled By: ezyang fbshipit-source-id: 93851cb661918ce794d43eeb12c8a38762e1358c

Make CUDA triu / tril support batches of size > 65535

df027aa

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: operators labels May 29, 2019

Fix performance regression

000658d

vishwakftw changed the title ~~[WIP] Make CUDA triu / tril support batches of size > 65535~~ Make CUDA triu / tril support batches of size > 65535 May 30, 2019

vishwakftw requested a review from ngimel May 30, 2019 04:47

vishwakftw mentioned this pull request May 30, 2019

[ready for review] Batched upper triangular, lower triangular #15257

Closed

ngimel reviewed May 30, 2019

View reviewed changes

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu Outdated Show resolved Hide resolved

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu Outdated Show resolved Hide resolved

Optimize based on 32-bit / 64-bit indexing

9507979

Merge remote-tracking branch 'origin/master' into HEAD

acc312b

ngimel approved these changes May 30, 2019

View reviewed changes

pytorchbot added the merge-this-please Was marked for merge with @pytorchbot merge this please label May 31, 2019

facebook-github-bot reviewed May 31, 2019

View reviewed changes

facebook-github-bot closed this in eb4d43d May 31, 2019

facebook-github-bot added the merged label May 31, 2019

vishwakftw deleted the triu-tril-cuda-big-batches branch May 31, 2019 06:04

YurongYou mentioned this pull request Jun 11, 2019

torch.solve in GPU fails when batch > 65535 #21643

Closed

ezyang added the open source label Jun 24, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make CUDA triu / tril support batches of size > 65535 #21067

Make CUDA triu / tril support batches of size > 65535 #21067

Uh oh!

vishwakftw commented May 29, 2019 •

edited

Loading

Uh oh!

ngimel left a comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw commented May 30, 2019

Uh oh!

vishwakftw commented May 30, 2019

Uh oh!

vishwakftw commented May 31, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented May 31, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Make CUDA triu / tril support batches of size > 65535 #21067

Make CUDA triu / tril support batches of size > 65535 #21067

Uh oh!

Conversation

vishwakftw commented May 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw commented May 30, 2019

Uh oh!

vishwakftw commented May 30, 2019

Uh oh!

vishwakftw commented May 31, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 31, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

vishwakftw commented May 29, 2019 •

edited

Loading