Gradient of gradient explodes(nan) when training WGAN-GP on Mnist

I was using WGAN-GP training to generate handwritten character images on Mnist dataset. Thanks for [caogang's code](https://github.com/caogang/wgan-gp), I modified his network structure, the DCGAN was applied to my WGAN-GP's structure.
During the training process, a sudden explosion (**nan**) of the gradients occurred, and the location of the explosion was after the backward propagation using the gradient penalty loss.

Before the explosion, the gradients and the parameters was always normal and there was no tendency to gradually increase, so I thought it was the problem of calculating the gradient of the gradient.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gradient of gradient explodes(nan) when training WGAN-GP on Mnist #2534

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Gradient of gradient explodes(nan) when training WGAN-GP on Mnist #2534

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions