lu: When not using pivoting, return the identity permutation instead of zeros #22242

bamos · 2019-06-25T23:05:29Z

Some of my qpth users have told me that updating to the latest version of PyTorch and replacing the btrifact/btrisolve calls with the LU ones wasn't working and I didn't believe them until I tried it myself :)

These updates have broken unpivoted LU factorizations/solves on CUDA. The LU factorization code used to return the identity permutation when pivoting wasn't used but now returns all zeros as the pivots. This PR reverts it back to return the identity permutation. I've not yet tested this code as I'm having some trouble compiling PyTorch with this and am hitting #21700 and am not sure how to disable that option.

Here's a MWE to reproduce the broken behavior, and my fix.

torch.manual_seed(0)

n = 4
L = torch.randn(n,n)
A = L.mm(L.t()).unsqueeze(0)
b = torch.randn(1, n)

A_lu_cpu = torch.lu(A)
A_lu_cuda_nopivot = torch.lu(A.cuda(), pivot=False)
A_lu_cuda_pivot = torch.lu(A.cuda(), pivot=True)
print('A_lu_cuda_nopivot\n', A_lu_cuda_nopivot)
print('-----\nA_lu_cuda_pivot\n', A_lu_cuda_nopivot)

x_cpu = b.lu_solve(*A_lu_cpu)
x_cuda_nopivot = b.cuda().lu_solve(*A_lu_cuda_nopivot)
x_cuda_nopivot_fixed = b.cuda().lu_solve(
    A_lu_cuda_nopivot[0], torch.arange(1, n+1, device='cuda:0').int())
x_cuda_pivot = b.cuda().lu_solve(*A_lu_cuda_pivot)

print(x_cpu, x_cuda_nopivot, x_cuda_nopivot_fixed, x_cuda_pivot)

Output:

A_lu_cuda_nopivot
 (tensor([[[ 2.8465, -0.7560,  0.8716, -1.7337],
         [-0.2656,  5.5724, -1.1316,  0.6678],
         [ 0.3062, -0.2031,  1.4206, -0.5438],
         [-0.6091,  0.1198, -0.3828,  1.5103]]], device='cuda:0'), tensor([[0, 0, 0, 0]], device='cuda:0', dtype=torch.int32))


-----


A_lu_cuda_pivot
 (tensor([[[ 2.8465, -0.7560,  0.8716, -1.7337],
         [-0.2656,  5.5724, -1.1316,  0.6678],
         [ 0.3062, -0.2031,  1.4206, -0.5438],
         [-0.6091,  0.1198, -0.3828,  1.5103]]], device='cuda:0'), tensor([[0, 0, 0, 0]], device='cuda:0', dtype=torch.int32))


(tensor([[-0.3121, -0.1673, -0.4450, -0.2483]]),
 tensor([[-0.1661, -0.1875, -0.5694, -0.4772]], device='cuda:0'),
 tensor([[-0.3121, -0.1673, -0.4450, -0.2483]], device='cuda:0'),
 tensor([[-0.3121, -0.1673, -0.4450, -0.2483]], device='cuda:0'))

…of all zeros.

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu

vishwakftw

Thank you for fixing this. I’ll pre-approve this for now.

test/test_torch.py

Needs a fix

test/test_torch.py

vishwakftw

After investigating the test failures, I've localized the issues. I am 99% sure that the tests would pass now. I'm sorry about reviewing again and again and keeping the PR in a state of flux.

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu

test/test_torch.py

vishwakftw

The failing Windows test has passed which most probably indicates that the patch is correct. Should be good to merge after CI finishes. Thank you @bamos for the PR!

vishwakftw · 2019-06-27T21:52:53Z

Failures seem to be unrelated.

@pytorchbot merge this please

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

vishwakftw · 2019-07-02T02:13:09Z

@ezyang is this good to go?

vishwakftw · 2019-07-07T06:13:14Z

@pytorchbot rebase this please

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…of zeros (#22242) Summary: Some of my qpth users have told me that updating to the latest version of PyTorch and replacing the btrifact/btrisolve calls with the LU ones wasn't working and I didn't believe them until I tried it myself :) These updates have broken unpivoted LU factorizations/solves on CUDA. The LU factorization code used to return the identity permutation when pivoting wasn't used but now returns all zeros as the pivots. This PR reverts it back to return the identity permutation. I've not yet tested this code as I'm having some trouble compiling PyTorch with this and am hitting pytorch/pytorch#21700 and am not sure how to disable that option. Here's a MWE to reproduce the broken behavior, and my fix. ```python torch.manual_seed(0) n = 4 L = torch.randn(n,n) A = L.mm(L.t()).unsqueeze(0) b = torch.randn(1, n) A_lu_cpu = torch.lu(A) A_lu_cuda_nopivot = torch.lu(A.cuda(), pivot=False) A_lu_cuda_pivot = torch.lu(A.cuda(), pivot=True) print('A_lu_cuda_nopivot\n', A_lu_cuda_nopivot) print('-----\nA_lu_cuda_pivot\n', A_lu_cuda_nopivot) x_cpu = b.lu_solve(*A_lu_cpu) x_cuda_nopivot = b.cuda().lu_solve(*A_lu_cuda_nopivot) x_cuda_nopivot_fixed = b.cuda().lu_solve( A_lu_cuda_nopivot[0], torch.arange(1, n+1, device='cuda:0').int()) x_cuda_pivot = b.cuda().lu_solve(*A_lu_cuda_pivot) print(x_cpu, x_cuda_nopivot, x_cuda_nopivot_fixed, x_cuda_pivot) ``` Output: ``` A_lu_cuda_nopivot (tensor([[[ 2.8465, -0.7560, 0.8716, -1.7337], [-0.2656, 5.5724, -1.1316, 0.6678], [ 0.3062, -0.2031, 1.4206, -0.5438], [-0.6091, 0.1198, -0.3828, 1.5103]]], device='cuda:0'), tensor([[0, 0, 0, 0]], device='cuda:0', dtype=torch.int32)) ----- A_lu_cuda_pivot (tensor([[[ 2.8465, -0.7560, 0.8716, -1.7337], [-0.2656, 5.5724, -1.1316, 0.6678], [ 0.3062, -0.2031, 1.4206, -0.5438], [-0.6091, 0.1198, -0.3828, 1.5103]]], device='cuda:0'), tensor([[0, 0, 0, 0]], device='cuda:0', dtype=torch.int32)) (tensor([[-0.3121, -0.1673, -0.4450, -0.2483]]), tensor([[-0.1661, -0.1875, -0.5694, -0.4772]], device='cuda:0'), tensor([[-0.3121, -0.1673, -0.4450, -0.2483]], device='cuda:0'), tensor([[-0.3121, -0.1673, -0.4450, -0.2483]], device='cuda:0')) ``` Pull Request resolved: pytorch/pytorch#22242 Differential Revision: D16049334 Pulled By: ezyang fbshipit-source-id: 7eacae810d87ffbdf8e07159bbbc03866dd9979d

facebook-github-bot · 2019-07-09T20:36:49Z

@ezyang merged this pull request in 046c458.

lu: When not using pivoting, return the identity permutation instead …

ef76da6

…of all zeros.

pytorchbot added module: cuda Related to torch.cuda, and CUDA support in general module: operators labels Jun 25, 2019

soumith requested a review from vishwakftw June 26, 2019 02:56

vishwakftw suggested changes Jun 26, 2019

View reviewed changes

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu Outdated Show resolved Hide resolved

bamos added 3 commits June 26, 2019 07:04

Add tests for unpivoted CUDA LU code.

4b8fe1b

Fix initial pivot.

6576b3d

expand_as -> expand

2c07787

vishwakftw previously approved these changes Jun 26, 2019

View reviewed changes

vishwakftw reviewed Jun 26, 2019

View reviewed changes

test/test_torch.py Outdated Show resolved Hide resolved

Fix nopiv check.

4012a96

vishwakftw reviewed Jun 26, 2019

View reviewed changes

test/test_torch.py Outdated Show resolved Hide resolved

bamos added 2 commits June 26, 2019 15:51

Fix size test and lint issue.

4863175

test_torch: Check all batch elements of nopiv

08bb363

vishwakftw reviewed Jun 27, 2019

View reviewed changes

aten/src/ATen/native/cuda/BatchLinearAlgebra.cu Outdated Show resolved Hide resolved

test/test_torch.py Outdated Show resolved Hide resolved

clone after expand and pivot=False in nopiv test

fec75e9

vishwakftw self-assigned this Jun 27, 2019

vishwakftw approved these changes Jun 27, 2019

View reviewed changes

pytorchbot added the merge-this-please Was marked for merge with @pytorchbot merge this please label Jun 27, 2019

facebook-github-bot reviewed Jun 28, 2019

View reviewed changes

Merge remote-tracking branch 'origin/master' into HEAD

7a97501

facebook-github-bot reviewed Jul 8, 2019

View reviewed changes

facebook-github-bot closed this in 046c458 Jul 9, 2019

facebook-github-bot added the merged label Jul 9, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

lu: When not using pivoting, return the identity permutation instead of zeros #22242

lu: When not using pivoting, return the identity permutation instead of zeros #22242

Uh oh!

bamos commented Jun 25, 2019

Uh oh!

Uh oh!

vishwakftw left a comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw left a comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw left a comment

Uh oh!

vishwakftw commented Jun 27, 2019 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

vishwakftw commented Jul 2, 2019

Uh oh!

vishwakftw commented Jul 7, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jul 9, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lu: When not using pivoting, return the identity permutation instead of zeros #22242

lu: When not using pivoting, return the identity permutation instead of zeros #22242

Uh oh!

Conversation

bamos commented Jun 25, 2019

Uh oh!

Uh oh!

vishwakftw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vishwakftw left a comment

Choose a reason for hiding this comment

Uh oh!

vishwakftw commented Jun 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

vishwakftw commented Jul 2, 2019

Uh oh!

vishwakftw commented Jul 7, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 9, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vishwakftw commented Jun 27, 2019 •

edited

Loading