[Update transforms.py] Fix numerically instability of `SigmoidTransform` #19802

zuoxingdong · 2019-04-26T14:07:17Z

fix #18254 for numerically instability of SigmoidTransform

zou3519 · 2019-04-26T19:42:57Z

Needs a test

zou3519 · 2019-04-26T19:43:48Z

cc @fritzo @neerajprad

neerajprad

We should also add a test to test_distributions.py::TestTransforms to verify this behavior around the boundary.

neerajprad · 2019-04-26T23:36:44Z

torch/distributions/transforms.py


    def log_abs_det_jacobian(self, x, y):
-        return -(y.reciprocal() + (1 - y).reciprocal()).log()
+        return -torch.log(1e-6 + y.reciprocal() + (1 - y).reciprocal())


This will not address the case when y=1., where we will still see infinity. How about we clamp this using

clamped = torch.clamp(y.reciprocal() + (1 - y).reciprocal(), max=torch.finfo(y.dtype).max) return -clamped.log()

@neerajprad It looks good ! I've tried a testing, it seems when std from Normal distribution is large, there is still possibliity to get 'inf, e.g.

d = TransformedDistribution(Normal(torch.tensor(0.0), torch.tensor(100.0)), [StableSigmoidTransform()]) x = d.sample() print(x) d.log_prob(x) >>> tensor(1.) >>> tensor(-inf)

Yes, that would just fix the jacobian term, but in TransformedDistribution.log_prob we still evaluate base_dist.log_prob(y) and in this case y = inf. I think what you really want is something like a ClippedSigmoidTransform which differs from SigmoidTransform in that it clips the output of _call to lie in (0, 1):

def _call(self, x): finfo = torch.finfo(x.dtype) clamped = torch.clamp(torch.sigmoid(x), min=finfo.tiny, max=1. - finfo.eps) return clamped

Then you shouldn't see any infs in your example above.

@alicanb, @fritzo - Do you think this is something we should do by default (tensorflow doesn't seem to do this either), or have a separate transform class ClippedSigmoidTransform?

@neerajprad @alicanb @fritzo Thanks a lot for the proposal. I think this is rather important for some applications where we evaluate log_prob of a sample drawn from a transformed distribution that is conditional on the input of the NN. Without such fixes, it is easily happen to get NaN/Inf. An example use case is tanh transformed policy network in reinforcement learning context, e.g. SAC algorithm.

Or this can be provided with an optionally flag in SigmoidTransform, without creating a new class.

I think that’s a good idea - we can have an optional keyword arg, clip=False.

Sorry, I'm late to the party. @neerajprad your suggestion sounds great, do you think users would want to change clip limits (@zuoxingdong, have you ever had the need?) ? If that's the case, instead of a bool arg, we can keep the clamping symmetric (min=lim, max=1-lim) and have lim as the argument. If not, then bool arg sound great

neerajprad · 2019-05-01T21:55:48Z

torch/distributions/transforms.py


    def log_abs_det_jacobian(self, x, y):
-        return -(y.reciprocal() + (1 - y).reciprocal()).log()
+        return -F.softplus(-x) - F.softplus(x)


Nice! I think this is the best way to handle the numerical stability issue.

This change looks great, but just to confirm, this still doesn't handle the TransformedDistribution case you mentioned?

This is so nice!!!!!!!!!!!!!!!!!!!! A great way to address the overflow/underflow while computing y.

neerajprad · 2019-05-17T23:01:09Z

@zuoxingdong - @fehiepsi has a PR out - #20288, which amongst other things is also tackling this issue that you attempted in this PR. We can get this PR merged first, or otherwise move the discussion to #20288. Feel free to review that PR too.

fehiepsi · 2019-05-17T23:32:57Z

This PR looks great to me! The precision of log_abs_det_jacobian is really helpful to prevent the unconstrained parameter x getting too large or too small domain in inference algorithms (in those domains, large magnitude logdet implies smaller log-likelihood). Otherwise, the algorithm has to rely on other factors to recognize that moving x to large/small domain is bad.

zuoxingdong · 2019-05-20T12:46:48Z

Hi @neerajprad @fehiepsi , thank you for your feedbacks, #20288 looks nice to me, in this case, I'm proposing to only contain the log_abs_det_jacobian numerical stability fix in this PR to be merged, and we could discuss other issues in #20288 all together in one place. What do you think ?

zuoxingdong · 2019-05-20T12:48:00Z

I've removed the clipping part in this PR as it is already included in #20288 , and now it only contains fix for log_abs_det_jacobian

neerajprad

LGTM. Lets discuss the other changes in #20288.

neerajprad · 2019-05-20T20:32:41Z

The build failures are unrelated.

@pytorchbot merge this please

fehiepsi · 2019-06-04T15:49:55Z

@pytorchbot retest this please

zou3519 · 2019-06-04T17:50:56Z

I think this might need a rebase so that the CI can run

ezyang · 2019-06-06T15:16:58Z

@pytorchbot rebase this please

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-06-07T02:37:00Z

@ezyang merged this pull request in c5d5d45.

Update transforms.py

c84dcee

pytorchbot added the module: distributions Related to torch.distributions label Apr 26, 2019

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Apr 26, 2019

neerajprad reviewed Apr 26, 2019

View reviewed changes

Update transforms.py

748b53e

neerajprad reviewed May 1, 2019

View reviewed changes

Update transforms.py

b216f41

neerajprad mentioned this pull request May 17, 2019

[distributions] clip sigmoid to prevent transforms return inf/nan values #20288

Closed

Update transforms.py

2f9752b

neerajprad approved these changes May 20, 2019

View reviewed changes

pytorchbot added the merge-this-please Was marked for merge with @pytorchbot merge this please label May 20, 2019

ezyang added the open source label Jun 5, 2019

Merge remote-tracking branch 'origin/master' into HEAD

285d809

facebook-github-bot reviewed Jun 6, 2019

View reviewed changes

facebook-github-bot closed this in c5d5d45 Jun 6, 2019

facebook-github-bot added the merged label Jun 7, 2019

[Update transforms.py] Fix numerically instability of SigmoidTransform #19802

[Update transforms.py] Fix numerically instability of SigmoidTransform #19802

Uh oh!

Conversation

zuoxingdong commented Apr 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zou3519 commented Apr 26, 2019

Uh oh!

zou3519 commented Apr 26, 2019

Uh oh!

neerajprad left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zuoxingdong Apr 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

neerajprad Apr 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zuoxingdong Apr 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

neerajprad commented May 17, 2019

Uh oh!

fehiepsi commented May 17, 2019

Uh oh!

zuoxingdong commented May 20, 2019

Uh oh!

zuoxingdong commented May 20, 2019

Uh oh!

neerajprad left a comment

Choose a reason for hiding this comment

Uh oh!

neerajprad commented May 20, 2019

Uh oh!

fehiepsi commented Jun 4, 2019

Uh oh!

zou3519 commented Jun 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ezyang commented Jun 6, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

[Update transforms.py] Fix numerically instability of `SigmoidTransform` #19802

[Update transforms.py] Fix numerically instability of `SigmoidTransform` #19802

zuoxingdong commented Apr 26, 2019 •

edited

Loading

zuoxingdong Apr 27, 2019 •

edited

Loading

neerajprad Apr 27, 2019 •

edited

Loading

zuoxingdong Apr 27, 2019 •

edited

Loading

zou3519 commented Jun 4, 2019 •

edited

Loading