Remove extra workspace queries in matrix inverse computation #20904

vishwakftw · 2019-05-24T12:52:17Z

Earlier, the workspace size query and allocation was placed inside the loop.
However, since we have batches of matrices with the same number of rows and columns, the workspace size query and allocation for every matrix in the batch is redundant.

This PR moves the workspace size query and allocation outside the loop, effectively saving (batch_size - 1) number of queries and allocation (and consequently the deallocation).

There is a tremendous speedup in inverse computation as a result of this change.

Changelog:

Move workspace query and allocation outside the batch loop

Test Plan:

All existing tests for inverse should pass to verify that the change is correct

Earlier, the workspace size query and allocation was placed inside the loop. However, since we have batches of matrices with the same number of rows and columns, the workspace size query and allocation for every matrix in the batch is redundant. This PR moves the workspace size query and allocation outside the loop, effectively saving (batch_size - 1) number of queries and allocation (and consequently the deallocation). There is a tremendous speedup in inverse computation as a result of this change. Changelog: - Move workspace query and allocation outside the batch loop Test Plan: - All existing tests for inverse should pass to verify that the change is correct

vishwakftw · 2019-05-24T12:54:38Z

Speedup summary on batches of 3 x 3 matrices:

Dimensions	Before this PR	After this PR	Improvement factor (Higher is better)
100 x 3 x 3	95 µs ± 539 ns per loop	24.7 µs ± 326 ns per loop	3.85
200 x 3 x 3	185 µs ± 1.21 µs per loop	43.6 µs ± 227 ns per loop	4.23
500 x 3 x 3	450 µs ± 4.6 µs per loop	99.1 µs ± 189 ns per loop	4.54
1000 x 3 x 3	889 µs ± 4.5 µs per loop	193 µs ± 586 ns per loop	4.61
2000 x 3 x 3	1.74 ms ± 2.83 µs per loop	379 µs ± 1.43 µs per loop	4.59

ezyang

Nice!

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-05-24T17:35:10Z

@ezyang merged this pull request in 8c4b2a8.

Summary: Earlier, the workspace size query and allocation was placed inside the loop. However, since we have batches of matrices with the same number of rows and columns, the workspace size query and allocation for every matrix in the batch is redundant. This PR moves the workspace size query and allocation outside the loop, effectively saving (batch_size - 1) number of queries and allocation (and consequently the deallocation). There is a tremendous speedup in inverse computation as a result of this change. Changelog: - Move workspace query and allocation outside the batch loop Pull Request resolved: pytorch/pytorch#20904 Differential Revision: D15495505 Pulled By: ezyang fbshipit-source-id: 226729734465fcaf896f86e1b1a548a81440e082

pytorchbot added the module: operators label May 24, 2019

ezyang approved these changes May 24, 2019

View reviewed changes

facebook-github-bot reviewed May 24, 2019

View reviewed changes

facebook-github-bot closed this in 8c4b2a8 May 24, 2019

facebook-github-bot added the merged label May 24, 2019

vishwakftw deleted the remove-extra-workspace-queries branch May 29, 2019 17:21

ezyang added the open source label Jun 24, 2019

vishwakftw added the module: performance Issues related to performance, either of kernel code or framework glue label Aug 3, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove extra workspace queries in matrix inverse computation #20904

Remove extra workspace queries in matrix inverse computation #20904

Uh oh!

vishwakftw commented May 24, 2019

Uh oh!

vishwakftw commented May 24, 2019

Uh oh!

ezyang left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented May 24, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Remove extra workspace queries in matrix inverse computation #20904

Remove extra workspace queries in matrix inverse computation #20904

Uh oh!

Conversation

vishwakftw commented May 24, 2019

Uh oh!

vishwakftw commented May 24, 2019

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 24, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants