Add gelu gradient for pytorch #21237

xiaomengy · 2019-06-01T00:41:10Z

Summary: Add gelu gradient for pytorch

Differential Revision: D15589816

soumith

please check inline comment and provide resolution on what's up with the tolerance adjustment. Once you get clarity into that, and verify it's not a bug, do land.

Things reviewed in the diff:

MKL and non-MKL implementations match in formula
CUDA and CPU implementation match in formula

Things not reviewed in the diff:

gradient formula is correct (relying on gradcheck to say it's right)

soumith · 2019-06-01T03:38:12Z

test/test_nn.py

this looks suspicious. gradchecks happen in double precision, so a tolerance of 1e-3 looks really high and a custom eps is usually not needed. Any idea what is going on? Can you check some sample inputs to inspect.

This part just came from testing the numerical stability for gradchecks. Atol has been removed and now it is using default value.

Summary: Add gelu activation forward on CPU in pytorch Differential Revision: D15400974 fbshipit-source-id: 1c59104bea69cbe26ab96921e242131890db657e

Summary: Pull Request resolved: pytorch#21237 Add gelu gradient for pytorch Reviewed By: zheng-xq Differential Revision: D15589816 fbshipit-source-id: 2feb4ed779cda1dec3fe03fcfba29861b4a86d12

Summary: Pull Request resolved: pytorch/pytorch#21237 Add gelu gradient for pytorch Reviewed By: zheng-xq Differential Revision: D15589816 fbshipit-source-id: 76fda7c413afed5b6cc3abe3a26c258d393a53ce

facebook-github-bot · 2019-06-02T19:04:26Z

This pull request has been merged in 31c79b7.

pytorchbot added module: cpu CPU specific problem (e.g., perf, algorithm) module: cuda Related to torch.cuda, and CUDA support in general module: internals Related to internal abstractions in c10 and ATen module: nn Related to torch.nn module: operators labels Jun 1, 2019

soumith approved these changes Jun 1, 2019

View reviewed changes

xiaomengy force-pushed the export-D15589816 branch from d773d72 to b333c2c Compare June 1, 2019 05:03

xiaomengy force-pushed the export-D15589816 branch from b333c2c to 2ef4e48 Compare June 2, 2019 06:45

xiaomengy force-pushed the export-D15589816 branch from 2ef4e48 to c1b1d5d Compare June 2, 2019 06:57

xiaomengy force-pushed the export-D15589816 branch from c1b1d5d to d247ea1 Compare June 2, 2019 07:11

pytorchbot added the module: docs Related to our documentation, both in docs/ and docblocks label Jun 2, 2019

xiaomengy added 2 commits June 2, 2019 00:15

Add gelu activation in pytorch

1d20fb2

Summary: Add gelu activation forward on CPU in pytorch Differential Revision: D15400974 fbshipit-source-id: 1c59104bea69cbe26ab96921e242131890db657e

Add gelu gradient for pytorch (pytorch#21237)

aaf94f7

Summary: Pull Request resolved: pytorch#21237 Add gelu gradient for pytorch Reviewed By: zheng-xq Differential Revision: D15589816 fbshipit-source-id: 2feb4ed779cda1dec3fe03fcfba29861b4a86d12

xiaomengy force-pushed the export-D15589816 branch from d247ea1 to aaf94f7 Compare June 2, 2019 07:15

facebook-github-bot closed this in 31c79b7 Jun 2, 2019

xiaomengy deleted the export-D15589816 branch June 2, 2019 16:46

facebook-github-bot added the merged label Jun 2, 2019

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gelu gradient for pytorch #21237

Add gelu gradient for pytorch #21237

Uh oh!

xiaomengy commented Jun 1, 2019

Uh oh!

soumith left a comment

Uh oh!

soumith Jun 1, 2019

Uh oh!

xiaomengy Jun 2, 2019

Uh oh!

facebook-github-bot commented Jun 2, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add gelu gradient for pytorch #21237

Add gelu gradient for pytorch #21237

Uh oh!

Conversation

xiaomengy commented Jun 1, 2019

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

soumith Jun 1, 2019

Choose a reason for hiding this comment

Uh oh!

xiaomengy Jun 2, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 2, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants