Set Batchnorm weight scalar initialization to unit (not random uniform)

## 🐛 Feature suggestion

Pytorch Batch normalization currently initializes it's scalar weight parameter with random uniform variables (https://github.com/pytorch/pytorch/blob/1d3f650ce4f1b781f03e8b4f250a25d5a8f819cc/torch/nn/modules/batchnorm.py#L46). This initialization has no theoretical basis, while on the other hand it makes sense to use a fixed value of 1. 

Part of the usefulness of Batchnorm is that it can initialize the network easily, with inputs to hidden units normalized to zero norm and unit variance (original paper: https://arxiv.org/abs/1502.03167) and it is also how recent studies use it (e.g. https://arxiv.org/abs/1706.02677).

## To change

From 
init.uniform_(self.weight)

to 
init.ones_(self.weight)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Set Batchnorm weight scalar initialization to unit (not random uniform) #12259

🐛 Feature suggestion

To change

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Set Batchnorm weight scalar initialization to unit (not random uniform) #12259

Description

🐛 Feature suggestion

To change

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions