Fix the slowness of mvn's log_prob #17294

fehiepsi · 2019-02-20T05:54:59Z

This PR addresses the slowness of MVN's log_prob as reported in #17206.

@t-vi I find it complicated to handle permutation dimensions if we squeeze singleton dimensions of bL, so I leave it as-is and keep the old approach. What do you think?

t-vi

Seems good. Thank you!

t-vi · 2019-02-20T09:07:39Z

torch/distributions/multivariate_normal.py

+
+    bx_batch_shape = bx.shape[:-1]
+    # Assume that bL.shape = (i, 1, n, n), bx.shape = (..., i, j, n),
+    # we are going to make bx have shape (..., i, 1, n) to apply _batch_trtrs_lower


Is this description still accurate?

t-vi · 2019-02-20T09:08:56Z

torch/distributions/multivariate_normal.py

+    # Reshape bx with the shape (..., 1, i, j, 1, n)
+    bx_new_shape = bx.shape[:outer_batch_dims]
+    for (sL, sx) in zip(bL.shape[:-2], bx.shape[outer_batch_dims:-1]):
+        bx_new_shape += (sx // sL, sL)


Clever with the two dims!

I'm not entirely convinced that amending the broadcasting semantics is a good idea, though, unless you have a specific use case. People will start to depend on it in obscure fashions and when we get a batch triangular solver, you won't be able to replace this code.

Here (+ the reshape) will cause stride 0 dimensions of L to be expanded, but I guess we're not too concerned about people having used expand beforehand.

I think that when we have batch version of triangular solver, replacing the method batch_trtrs_lower might be enough. If batch triangular solver also handles broadcasting, then we can remove these reshape+permute mechanism.

Here (+ the reshape) will cause stride 0 dimensions of L to be expanded, but I guess we're not too concerned about people having used expand beforehand.

Yeah, agree. In mvn, the math involving scale_tril mostly depends on unbroadcasted (unexpaned) version. So to get a better performance, users should not expand scale_tril/covariance_matrix before creating mvn distribution.

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

fehiepsi added 2 commits February 20, 2019 00:43

fix the slowness of mvn's log_prob

e3d5a18

remove unnecessary usage of broadcast_tensor

42e272c

t-vi reviewed Feb 20, 2019

View reviewed changes

fix the comment in batch_mahalanobis

74789d9

facebook-github-bot reviewed Feb 20, 2019

View reviewed changes

facebook-github-bot closed this in de81a27 Feb 21, 2019

t-vi mentioned this pull request Feb 22, 2019

multivariate_normal.log_prob is slow #17206

Closed

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the slowness of mvn's log_prob #17294

Fix the slowness of mvn's log_prob #17294

Uh oh!

fehiepsi commented Feb 20, 2019

Uh oh!

t-vi left a comment

Uh oh!

t-vi Feb 20, 2019

Uh oh!

t-vi Feb 20, 2019

Uh oh!

fehiepsi Feb 20, 2019

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix the slowness of mvn's log_prob #17294

Fix the slowness of mvn's log_prob #17294

Uh oh!

Conversation

fehiepsi commented Feb 20, 2019

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

t-vi Feb 20, 2019

Choose a reason for hiding this comment

Uh oh!

t-vi Feb 20, 2019

Choose a reason for hiding this comment

Uh oh!

fehiepsi Feb 20, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants