Add high order gradient support for activation function #1496

caogang · 2017-05-06T07:34:21Z

[WIP] Add high order gradient support for sigmoid function, solving the issue #1483

sigmoid

torch/autograd/_functions/pointwise.py

apaszke · 2017-05-06T12:48:32Z

@pytorchbot test this please

torch/autograd/_functions/pointwise.py

apaszke · 2017-05-07T09:56:43Z

@pytorchbot test this please

fmassa · 2017-05-07T10:14:49Z

Are there tests for double backprop that could be easily added?

apaszke · 2017-05-07T10:59:57Z

per-op double backprop tests are unnecessary. We only use first-order jacobian vector product functions to compute grads of any order, so as long as first-order is correct it should be all good (assuming autograd code is correct, but we have separate tests for that).

fmassa · 2017-05-07T13:48:47Z

That's not true for code that has a different behavior if the grad is volatile, as in this PR.

soumith · 2017-05-07T14:23:23Z

i think we should add gradgradcheck, just like gradcheck. We dont know if new-style functions have been written correctly for grad of grad out of the box (for example, user may have rewrapped a Variable somewhere and thought it was okay)

apaszke · 2017-05-07T19:54:51Z

gradcheck runs tests only on volatile grads, so this case is covered

apaszke · 2017-05-07T19:56:22Z

I think that instead of computing a full hessian of each op (these tests would be soooo slooow) we could just add some simple clauses to gradcheck that make sure that there exists a path from grad_input.grad_fn to grad_output's grad accumulator. This should be enough.

* master: Add F.normalize (pytorch#1467) Expose custom attributes from C++ functions (pytorch#1430) Add high order gradient support for Sigmoid (pytorch#1496)

* Minor fix for trivial reductions. Co-authored-by: Naoya Maruyama <[email protected]>

fmassa reviewed May 6, 2017

View reviewed changes

torch/autograd/_functions/pointwise.py Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke reviewed May 6, 2017

View reviewed changes

torch/autograd/_functions/pointwise.py Outdated

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

apaszke reviewed May 6, 2017

View reviewed changes

torch/autograd/_functions/pointwise.py Outdated

This comment was marked as off-topic.

Sign in to view

torch/autograd/_functions/pointwise.py Outdated

This comment was marked as off-topic.

Sign in to view

caogang added 7 commits May 7, 2017 09:43

Add twice differentiation of sigmoid op

965f06d

Fixed : a small bug

e4a8c4d

Modify the sigmoid in torch.nn.functional.py

3b3f814

Modify the return value of sigmoid

cea34cc

Add case : grad_output.volatile

88cc1e9

set grad_input volatile=True

796e419

using grad_output.size() instead and no need to do zero_()

b510286

caogang force-pushed the master branch from f840913 to b510286 Compare May 7, 2017 02:18

apaszke approved these changes May 7, 2017

View reviewed changes

apaszke merged commit e3f41a4 into pytorch:master May 7, 2017

caogang added a commit to caogang/pytorch that referenced this pull request May 8, 2017

Merge branch 'master' into develop

d56a703

* master: Add F.normalize (pytorch#1467) Expose custom attributes from C++ functions (pytorch#1430) Add high order gradient support for Sigmoid (pytorch#1496)

Jiaming-Liu pushed a commit to Jiaming-Liu/pytorch that referenced this pull request May 18, 2017

Add high order gradient support for Sigmoid (pytorch#1496)

766352c

ezyang added the open source label Jun 24, 2019

jjsjann123 pushed a commit to jjsjann123/pytorch that referenced this pull request Mar 2, 2022

Minor fix for trivial reductions. (pytorch#1496)

fca0186

* Minor fix for trivial reductions. Co-authored-by: Naoya Maruyama <[email protected]>

jagadish-amd pushed a commit to jagadish-amd/pytorch that referenced this pull request Sep 5, 2024

Update requirements.txt (pytorch#1496)

9712f03

Add high order gradient support for activation function #1496

Add high order gradient support for activation function #1496

Uh oh!

Conversation

caogang commented May 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented May 6, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented May 7, 2017

Uh oh!

fmassa commented May 7, 2017

Uh oh!

apaszke commented May 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmassa commented May 7, 2017

Uh oh!

soumith commented May 7, 2017

Uh oh!

apaszke commented May 7, 2017

Uh oh!

apaszke commented May 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

caogang commented May 6, 2017 •

edited

Loading

apaszke commented May 7, 2017 •

edited

Loading

apaszke commented May 7, 2017 •

edited

Loading