Skip to content

Conversation

@suo
Copy link
Member

@suo suo commented May 21, 2019

Stack from ghstack:

as title

Differential Revision: D15486511

@pytorchbot pytorchbot added the oncall: jit Add this issue/PR to JIT oncall triage queue label May 21, 2019
suo added 2 commits May 21, 2019 09:49
…flag"

[jit] dropout symbolic_script should respect the training flag

as title

gh-metadata: pytorch pytorch 20760 gh/suo/41/head
…flag"

[jit] dropout symbolic_script should respect the training flag

as title

gh-metadata: pytorch pytorch 20760 gh/suo/41/head
@ailzhang
Copy link
Contributor

is it possible to add a test to prevent regression in the future?

res = mask * input / p1m
p1m = 1.
res = input
mask = torch.zeros(0)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The definition of backwards when is_training = False is not correct.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oops this got merged out somehow

…flag"

[jit] dropout symbolic_script should respect the training flag

as title

gh-metadata: pytorch pytorch 20760 gh/suo/41/head
suo added 2 commits May 23, 2019 16:24
…flag"

[jit] dropout symbolic_script should respect the training flag

as title. This unfortunately means that the forward for dropout doesn't
fuse completely anymore, but the "important" parts are fused, and all
we're adding is the is_training check overhead. The only time we're
doing "extra" stuff is if we 1) require_grad and 2) are not training,
which seems like uncommon things.

gh-metadata: pytorch pytorch 20760 gh/suo/41/head
…flag"

[jit] dropout symbolic_script should respect the training flag

as title. This unfortunately means that the forward for dropout doesn't
fuse completely anymore, but the "important" parts are fused, and all
we're adding is the is_training check overhead. The only time we're
doing "extra" stuff is if we 1) require_grad and 2) are not training,
which seems like uncommon things.

gh-metadata: pytorch pytorch 20760 gh/suo/41/head
@zou3519 zou3519 deleted the gh/suo/41/head branch May 24, 2019 01:19
@facebook-github-bot
Copy link
Contributor

@suo merged this pull request in 62af37a.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Merged oncall: jit Add this issue/PR to JIT oncall triage queue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants