add mkldnn batch_norm backward #37147

XiaobingSuper · 2020-04-23T14:12:35Z

Stack from ghstack:

BFloat16: enable prepacked weights's inference #37218 BFloat16: enable prepacked weights's inference
BFloat16: add explicit dtype support for to_mkldnn and to_dense #37215 BFloat16: add explicit dtype support for to_mkldnn and to_dense
add mkldnn relu backward and reshape backward #37199 add mkldnn relu backward and reshape backward
add mkldnn batch_norm backward #37147 add mkldnn batch_norm backward
add mkldnn pooling backward #37146 add mkldnn pooling backward
add mkldnn linear backward #36122 add mkldnn linear backward
enable mkldnn conv2d backward to support mkldnn tensor input #36121 enable mkldnn conv2d backward to support mkldnn tensor input

Differential Revision: D22440965

[ghstack-poisoned]

ghstack-source-id: 4c2452f Pull Request resolved: #37147

dr-ci · 2020-04-23T14:26:04Z

💊 CI failures summary and remediations

As of commit 78bb6d6 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 76 times.

[ghstack-poisoned]

ghstack-source-id: c7d4b27 Pull Request resolved: #37147

[ghstack-poisoned]

albanD

Thanks for the PR!
Just some comments on the test and missing checks.

albanD · 2020-04-27T15:21:21Z

aten/src/ATen/native/mkldnn/Normalization.cpp

+  ideep::batch_normalization_backward::compute(
+      x, m, v, grady, w, gradx, gradw, gradb, eps);
+
+  if (weight.is_mkldnn()) {


There seems to be a lot of assumptions on the input types here. Can we have the corresponding checks both for the forward and backward functions?

Now, just using second path.

albanD · 2020-04-27T15:24:35Z

test/test_mkldnn.py

+                        affine=affine,
+                        track_running_stats=track_running_stats).float().train(train)
+                    if (train or not track_running_stats):
+                        mkldnn_bn = copy.deepcopy(bn)


Could you explain why in this case you don't send the module to mkldnn?

For training case, module's parameters awalys a dense tensor, not need to call mkldnn_utils.to_mkldnn

Ho ok. But why is track_running_stats here?

test/test_mkldnn.py

albanD · 2020-04-27T15:27:45Z

test/test_mkldnn.py

+                        loss2.backward()
+                        self.assertEqual(x1.grad, x2.grad.to_dense())
+                        np.testing.assert_allclose(
+                            bn.weight.grad, mkldnn_bn.weight.grad, rtol=1e-3, atol=1e-3)


Why is the tolerance so high here give that you assert that y1, y2 are exactly equal below?

diff_weight is computed by Sum_over_MB*H*W(diff_dst[i] * x_normalized[i]), the summation order of float number may get different number, MKLDNN will the whole job to a piece job for using muti-thread, the order sum may different with the native path, I also test cuda case, there also has same problem. but for y1, y2, there just has element wise operation, there has't big difference for MKLDNN and native path.

But given that you work with small Tensors and the only problem is ordering, a tolerance of 1e-5 should be enough no?

test/test_mkldnn.py

[ghstack-poisoned]

Differential Revision: [D22440965](https://our.internmc.facebook.com/intern/diff/D22440965) [ghstack-poisoned]

ghstack-source-id: a122475 Pull Request resolved: pytorch#37147

facebook-github-bot · 2020-10-30T17:34:39Z

Hi @XiaobingSuper!

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but we do not have a signature on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

add mkldnn batch_norm backward

99a7972

[ghstack-poisoned]

This was referenced Apr 23, 2020

enable mkldnn conv2d backward to support mkldnn tensor input #36121

Closed

add mkldnn linear backward #36122

Closed

add mkldnn pooling backward #37146

Closed

XiaobingSuper added a commit that referenced this pull request Apr 23, 2020

add mkldnn batch_norm backward

771a069

ghstack-source-id: 4c2452f Pull Request resolved: #37147

pytorchbot added the open source label Apr 23, 2020

zhangguanheng66 requested a review from VitalyFedyunin April 23, 2020 22:01

zhangguanheng66 added triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module module: mkldnn Related to Intel IDEEP or oneDNN (a.k.a. mkldnn) integration labels Apr 23, 2020

Update on "add mkldnn batch_norm backward"

4ad7e5b

[ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Apr 24, 2020

add mkldnn batch_norm backward

b5ea4df

ghstack-source-id: c7d4b27 Pull Request resolved: #37147

This was referenced Apr 24, 2020

add mkldnn relu backward and reshape backward #37199

Closed

BFloat16: add explicit dtype support for to_mkldnn and to_dense #37215

Closed

Update on "add mkldnn batch_norm backward"

bbc5dbb

[ghstack-poisoned]

XiaobingSuper mentioned this pull request Apr 24, 2020

BFloat16: enable prepacked weights's inference #37218

Closed

albanD self-requested a review April 24, 2020 18:45

XiaobingSuper added 3 commits April 26, 2020 10:24

Update on "add mkldnn batch_norm backward"

4375188

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

5cf6cf4

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

2783d3d

[ghstack-poisoned]

albanD requested changes Apr 27, 2020

View reviewed changes

XiaobingSuper added 9 commits April 29, 2020 15:41

Update on "add mkldnn batch_norm backward"

2e250ee

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

18a2ce6

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

6f36394

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

8e48e92

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

c7f17d4

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

8a9738b

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

c81dd3d

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

be514f9

[ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

bc77058

[ghstack-poisoned]

XiaobingSuper added 2 commits July 9, 2020 05:47

Update on "add mkldnn batch_norm backward"

3fcf455

Differential Revision: [D22440965](https://our.internmc.facebook.com/intern/diff/D22440965) [ghstack-poisoned]

Update on "add mkldnn batch_norm backward"

78bb6d6

Differential Revision: [D22440965](https://our.internmc.facebook.com/intern/diff/D22440965) [ghstack-poisoned]

VitalyFedyunin approved these changes Jul 15, 2020

View reviewed changes

qiuxin2012 pushed a commit to qiuxin2012/pytorch that referenced this pull request Jul 27, 2020

add mkldnn batch_norm backward

353cb88

ghstack-source-id: a122475 Pull Request resolved: pytorch#37147

facebook-github-bot added the cla signed label Nov 11, 2020

XiaobingSuper closed this Jan 13, 2021

facebook-github-bot deleted the gh/xiaobingsuper/15/head branch February 12, 2021 15:18

add mkldnn batch_norm backward #37147

add mkldnn batch_norm backward #37147

Uh oh!

Conversation

XiaobingSuper commented Apr 23, 2020 • edited by VitalyFedyunin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Apr 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot commented Oct 30, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

XiaobingSuper commented Apr 23, 2020 •

edited by VitalyFedyunin

Loading

dr-ci bot commented Apr 23, 2020 •

edited

Loading