Add torch.nn.GELU for GELU activation #28944

xiaomengy · 2019-10-31T03:10:36Z

Summary:

Add torch.nn.GELU for GELU activation
Fix fp16 failures for torch.nn.functional.gelu on CUDA
Add infinitely differentiable backward implementation to support higher order gradient.
Change gelu_cpu kernel to use TensorIterator.

Differential Revision: D18240946

facebook-github-bot · 2019-10-31T03:10:47Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-10-31T03:45:15Z

This pull request was exported from Phabricator. Differential Revision: D18240946

xiaomengy · 2019-10-31T03:59:44Z

Link to #28947

facebook-github-bot · 2019-10-31T04:05:58Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-10-31T04:06:59Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-10-31T04:17:51Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-10-31T14:01:53Z

This pull request was exported from Phabricator. Differential Revision: D18240946

kostmo · 2019-10-31T14:36:54Z

CircleCI build failures summary

As of commit dc2112f:

0/1 flaky

Here are the reasons each build failed.

This comment was automatically generated by Dr. CI.
Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

facebook-github-bot · 2019-10-31T14:40:49Z

This pull request was exported from Phabricator. Differential Revision: D18240946

BramVanroy · 2019-11-03T09:39:56Z

I'm happy to see these lines indicating that another kernel might get implemented, too, but I wonder how you are going to implement this as an interface. I've had great results with that implementation as a custom module.

https://github.com/pytorch/pytorch/blob/53c2d01ebc0e40c8982ccd2e9cb4113e1d0b5bc1/aten/src/ATen/native/cpu/Activation.cpp#L138-L140

Will it be a separate function/module, or rather a parameter setting?

xiaomengy · 2019-11-03T09:45:41Z

I'm happy to see these lines indicating that another kernel might get implemented, too, but I wonder how you are going to implement this as an interface. I've had great results with that implementation as a custom module.

https://github.com/pytorch/pytorch/blob/53c2d01ebc0e40c8982ccd2e9cb4113e1d0b5bc1/aten/src/ATen/native/cpu/Activation.cpp#L138-L140

Will it be a separate function/module, or rather a parameter setting?

Maybe we will add it with a parameter in gelu later. Currently the reason we didn't do that is actually that approximation will not provide a better performance compare to current implementation with MKL functions. In my testing, the tanh approximation's performance rely on the performance of tanh. The eigen which is the backend of TensorFlow provides a fast approximation of tanh here. Without that tanh implementation, I think it is not very necessary to add the approximation for gelu here.
https://github.com/eigenteam/eigen-git-mirror/blob/8e409c71423fc070239cad72afdbd973727b5e0f/Eigen/src/Core/MathFunctionsImpl.h#L26

facebook-github-bot · 2019-11-03T16:22:35Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-11-03T18:15:38Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-11-04T01:04:45Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-11-04T01:18:46Z

This pull request was exported from Phabricator. Differential Revision: D18240946

Summary: Pull Request resolved: pytorch#28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 708c41d2f328bdf137fb8b0c533a977725daab41

facebook-github-bot · 2019-11-04T01:28:12Z

This pull request was exported from Phabricator. Differential Revision: D18240946

facebook-github-bot · 2019-11-04T06:35:03Z

This pull request has been merged in 2460dce.

Summary: Pull Request resolved: pytorch/pytorch#28944 Add torch.nn.GELU for GELU activation Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GELU" Reviewed By: hl475, houseroad Differential Revision: D18240946 fbshipit-source-id: 6284b30def9bd4c12bf7fb2ed08b1b2f0310bb78

calclavia · 2019-11-26T00:49:54Z

Is this released on torch 1.3.1 because it wasn't fixed in that version.

yf225 · 2019-11-26T03:54:42Z

@calclavia This PR is not in PyTorch 1.3.1. It will be in our upcoming release 1.4.0.

xiaomengy requested review from apaszke, ebetica, goldsborough and yf225 as code owners October 31, 2019 03:10

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Oct 31, 2019

xiaomengy force-pushed the export-D18240946 branch from 253e4b3 to 1450498 Compare October 31, 2019 03:45

xiaomengy requested a review from soumith October 31, 2019 03:48

xiaomengy mentioned this pull request Oct 31, 2019

GELU should work with fp16 #28738

Closed

xiaomengy mentioned this pull request Oct 31, 2019

Add gelu activation in pytorch #20665

Closed

xiaomengy force-pushed the export-D18240946 branch from 1450498 to 1a0bd8b Compare October 31, 2019 04:05

xiaomengy force-pushed the export-D18240946 branch from 1a0bd8b to 20d2c33 Compare October 31, 2019 04:06

xiaomengy force-pushed the export-D18240946 branch from 20d2c33 to 93c9246 Compare October 31, 2019 04:17

xiaomengy force-pushed the export-D18240946 branch from 93c9246 to dc2112f Compare October 31, 2019 14:01

xiaomengy force-pushed the export-D18240946 branch from dc2112f to ae1e7c8 Compare October 31, 2019 14:40

xiaomengy mentioned this pull request Oct 31, 2019

Add torch.nn.GELU as the module for GELU activation #28947

Closed

BramVanroy mentioned this pull request Nov 3, 2019

add gelu activation to fp32 list NVIDIA/apex#564

Merged

xiaomengy force-pushed the export-D18240946 branch from 53c2d01 to f303f02 Compare November 3, 2019 16:22

xiaomengy force-pushed the export-D18240946 branch from f303f02 to 74b6d8a Compare November 3, 2019 18:15

xiaomengy force-pushed the export-D18240946 branch from 74b6d8a to c97bc85 Compare November 4, 2019 01:04

xiaomengy force-pushed the export-D18240946 branch from c97bc85 to 49f826b Compare November 4, 2019 01:18

xiaomengy force-pushed the export-D18240946 branch from 49f826b to 5a7e331 Compare November 4, 2019 01:28

facebook-github-bot closed this in 2460dce Nov 4, 2019

facebook-github-bot added the merged label Nov 4, 2019

xiaomengy deleted the export-D18240946 branch November 4, 2019 19:01

yf225 mentioned this pull request Nov 5, 2019

Python/C++ API Parity: torch.nn modules and functional #25883

Open

wailoktam mentioned this pull request Oct 11, 2020

Getting AttributeError: Can't get attribute 'gelu' on <module 'transformers.modeling_bert' in "/hedwig/models/bert/__main__.py", l castorini/hedwig#71

Closed

mruberry added the Merged label Oct 28, 2020

Add torch.nn.GELU for GELU activation #28944

Add torch.nn.GELU for GELU activation #28944

Uh oh!

Conversation

xiaomengy commented Oct 31, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

xiaomengy commented Oct 31, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

kostmo commented Oct 31, 2019

CircleCI build failures summary

Uh oh!

facebook-github-bot commented Oct 31, 2019

Uh oh!

BramVanroy commented Nov 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xiaomengy commented Nov 3, 2019

Uh oh!

facebook-github-bot commented Nov 3, 2019

Uh oh!

facebook-github-bot commented Nov 3, 2019

Uh oh!

facebook-github-bot commented Nov 4, 2019

Uh oh!

facebook-github-bot commented Nov 4, 2019

Uh oh!

facebook-github-bot commented Nov 4, 2019

Uh oh!

facebook-github-bot commented Nov 4, 2019

Uh oh!

calclavia commented Nov 26, 2019

Uh oh!

yf225 commented Nov 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

xiaomengy commented Oct 31, 2019 •

edited

Loading

xiaomengy commented Oct 31, 2019 •

edited

Loading

BramVanroy commented Nov 3, 2019 •

edited

Loading