Skip to content

[Feature Request] Swish Activation Function #3169

@isaykatsman

Description

@isaykatsman

Swish (arxiv) is an activation function that has been shown to empirically outperform ReLU and several other popular activation functions on Inception-ResNet-v2 and MobileNet. On models with more layers Swish typically outperforms ReLU. Implementation is simple:

swish

Sigma is just sigmoid.

Worth a PR?

cc @albanD @mruberry

Metadata

Metadata

Labels

enhancementNot as big of a feature, but technically not a bug. Should be easy to fixmodule: nnRelated to torch.nntriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions