reformulate bce_with_logits to not use abs #2195

alykhantejani · 2017-07-24T19:58:04Z

This is a fix for the bug reported here in which binary_cross_entropy_with_logits gives the wrong gradient with input and target are 0.

After some investigation, this is because the gradient of abs at 0 is 0 (see here)

In this PR I have reformulated the numerically stable binary_cross_entropy_with_logits to not use abs (originally I used what tensorflow does, which is use abs).

I guess in general we should think about whether we want the grad of abs(0) to be 0?

fmassa · 2017-07-24T20:12:06Z

About grad(abs(0)) = 0, well, it is not differentiable in x=0, and it's sub-derivative can be anything between -1 and 1.
I had a quick look at what TF does, and I have the impression that the gradient at 0 is defined in the same way as in PyTorch, compare TF abs and PyTorch abs, and the underlying sign function has the same derivative, TF sign and PyTorch sign

soumith · 2017-07-24T22:16:31Z

thanks Aly!

…#2195) * Add support for a symbolic output_shape for broadcast_in_dim in the Python Frontend * Fixed compilation of BroadcastInDimOpRecord with template specialization for defining expand sizes. Added SymbolicSizesRecord. Added python binding for symbolic_sizes(). * Fix up code with some suggestions from Ivan. Changed symbolic_sizes() to tensor_sizes(). * Add some tests and fix up python based Tensor and Scalar printing. * Added testing for tensor_sizes(). Made some minor changes to faciliate string captured definition testing. * Add comments and fix lint issues. * Add an output broadcast test with tensor_sizes(). * Added a test for tensor_size usage when each operand of a binary op has a broadcast. * Fix tensor_sizes() to reflect an expand in extent. Add appropriate test.

- Changed to support new Hipblas 3.0.0 which is part of ROCm 7.0 is done. CP of ROCm@ec0c539 Co-authored-by: Pruthvi Madugundu <[email protected]>

reformulate bce_with_logits to not use abs

ac2db8d

alykhantejani mentioned this pull request Jul 24, 2017

Add numerically stable BCELoss which takes logits as input #1792

Merged

flake8 fixes

f03f369

soumith approved these changes Jul 24, 2017

View reviewed changes

soumith merged commit 112728c into pytorch:master Jul 24, 2017

alykhantejani mentioned this pull request Jul 26, 2017

[bugfix] in bce_with_logits logsumexp calculation #2221

Merged

alykhantejani mentioned this pull request Aug 5, 2017

BCEWithLogitsLoss computes wrong gradient? #2298

Closed

fehiepsi mentioned this pull request May 18, 2019

Fix Binomimal overflow when logits is large #20679

Closed

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

reformulate bce_with_logits to not use abs #2195

reformulate bce_with_logits to not use abs #2195

Uh oh!

alykhantejani commented Jul 24, 2017

Uh oh!

fmassa commented Jul 24, 2017

Uh oh!

soumith commented Jul 24, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

reformulate bce_with_logits to not use abs #2195

reformulate bce_with_logits to not use abs #2195

Uh oh!

Conversation

alykhantejani commented Jul 24, 2017

Uh oh!

fmassa commented Jul 24, 2017

Uh oh!

soumith commented Jul 24, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants