enable mkldnn conv2d backward to support mkldnn tensor input #36121

XiaobingSuper · 2020-04-07T03:11:04Z

Stack from ghstack:

BFloat16: enable prepacked weights's inference #37218 BFloat16: enable prepacked weights's inference
BFloat16: add explicit dtype support for to_mkldnn and to_dense #37215 BFloat16: add explicit dtype support for to_mkldnn and to_dense
add mkldnn relu backward and reshape backward #37199 add mkldnn relu backward and reshape backward
add mkldnn batch_norm backward #37147 add mkldnn batch_norm backward
add mkldnn pooling backward #37146 add mkldnn pooling backward
add mkldnn linear backward #36122 add mkldnn linear backward
enable mkldnn conv2d backward to support mkldnn tensor input #36121 enable mkldnn conv2d backward to support mkldnn tensor input

Differential Revision: D22440969

[ghstack-poisoned]

dr-ci · 2020-04-07T03:15:30Z

💊 CI failures summary and remediations

As of commit 81fc4cc (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 87 times.

XiaobingSuper · 2020-04-07T03:18:46Z

@VitalyFedyunin , those PRs #20567 #20570 #20571 #20572 are tool old , I will re-pull them. thanks!

[ghstack-poisoned]

XiaobingSuper · 2020-04-07T03:26:18Z

@uyongw, @Jianhui-Li, @jgong5, @hongzhen1, @CaoZhongZ

[ghstack-poisoned]

XiaobingSuper · 2020-04-23T02:20:06Z

@VitalyFedyunin , could you help review this code?

[ghstack-poisoned]

VitalyFedyunin · 2020-04-24T21:47:21Z

Are we fine with BC-incompatible changes in mkldnn_convolution_backward_weights operator?

What is the purpose of ideep update?

XiaobingSuper · 2020-04-26T01:20:31Z

The purpose of ideep update:

support conv3d forward and backward
solve group convolution backward perfromance degradation
solve batchnorm backward issue when input is nchw4c

Why I change mkldnn_convolution_backward_weights? for bf16 backward path, the weight can be a fp32(just convert input to float32 MKLDNN tensor) or bf16 tensor(convert input to a bf16 MKLDNN tensor and call model.to_mkldnn(torch.bfloat16)), so I need get the data_type from this weight parameters.

[ghstack-poisoned]

VitalyFedyunin · 2020-04-29T16:58:49Z

Test errors seems to be mkldnn related

Traceback (most recent call last):
  File "test_mkldnn.py", line 149, in test_conv2d
    rtol=1e-5)
  File "C:\Users\circleci\project\build\win_tmp\build\torch\testing\_internal\common_utils.py", line 928, in assertEqual
    assertTensorsEqual(x, y)
  File "C:\Users\circleci\project\build\win_tmp\build\torch\testing\_internal\common_utils.py", line 890, in assertTensorsEqual
    torch.testing.assert_allclose(a, b, atol=atol, rtol=rtol, equal_nan=True, msg=message)
  File "C:\Users\circleci\project\build\win_tmp\build\torch\testing\__init__.py", line 60, in assert_allclose
    raise AssertionError(msg)
AssertionError: Not within tolerance rtol=1e-05 atol=1e-05 at input[0, 1, 1, 2] (-1572.8182373046875 vs. -1572.835693359375) and 0 other locations (5.00%)

[ghstack-poisoned]

VitalyFedyunin · 2020-06-17T22:03:10Z

Please whitelist changed operator in

pytorch/test/backward_compatibility/check_backward_compatibility.py

Line 19 in 70192c6

white_list = [

VitalyFedyunin · 2020-06-17T22:11:24Z

test/test_mkldnn.py

-                mkldnn_conv2d = mkldnn_utils.to_mkldnn(copy.deepcopy(conv2d))
+            for train in [True, False]:
+                for bias in [True, False]:
+                    conv2d = torch.nn.Conv2d(in_channels=C,


This looks like indent problem for me.

aten/src/ATen/native/native_functions.yaml

VitalyFedyunin

Overall looks good, need some tests alterations.

[ghstack-poisoned]

VitalyFedyunin · 2020-07-08T20:36:10Z

test/backward_compatibility/check_backward_compatibility.py

    ('aten::__or__', datetime.date(2020, 6, 30)),
    ('aten::__xor__', datetime.date(2020, 6, 30)),
    ('aten::split', datetime.date(2020, 6, 30)),
+    ('aten::mkldnn_convolution_backward_weights', datetime.date(2020, 6, 30)),


Sorry you need to update this date before landing.

Differential Revision: [D22440969](https://our.internmc.facebook.com/intern/diff/D22440969) [ghstack-poisoned]

VitalyFedyunin · 2020-07-15T21:34:32Z

Land fails with:

Command failed with exit code 1.

stderr: caffe2/aten/src/ATen/native/mkldnn/Conv.cpp:159:5: error: no matching function for call to 'compute'
    ideep::convolution_backward_weights::compute(
    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 [removed]/build/ideep/include/ideep/operators/conv.hpp:476:15: note: candidate function not viable: no known conversion from 'dnnl::memory::data_type' to 'ideep::algorithm' (aka 'dnnl::algorithm') for 11th argument
  static void compute(const tensor& src,
              ^
[removed]/build/ideep/include/ideep/operators/conv.hpp:493:15: note: candidate function not viable: no known conversion from 'ideep::tensor' to 'const ideep::dims' (aka 'const vector<long>') for 5th argument
  static void compute(const tensor& src,
              ^
caffe2/aten/src/ATen/native/mkldnn/Conv.cpp:172:5: error: no matching function for call to 'compute'
    ideep::convolution_backward_weights::compute(
    ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
[removed]/build/ideep/include/ideep/operators/conv.hpp:493:15: note: candidate function not viable: no known conversion from 'dnnl::memory::data_type' to 'ideep::algorithm' (aka 'dnnl::algorithm') for 10th argument
  static void compute(const tensor& src,
              ^
[removed]/build/ideep/include/ideep/operators/conv.hpp:476:15: note: candidate function not viable: no known conversion from 'std::vector<long>' to 'ideep::tensor &' for 5th argument
  static void compute(const tensor& src,
              ^
2 errors generated.

XiaobingSuper · 2020-07-16T02:08:26Z

@VitalyFedyunin , this error seems not find the corresponding function, I can reproduce this error by checkout old ideep commit(ca7b718), could you help check the ideep commit id which used by the failed case? PyTorch master use 938cc68 now. Thanks!

ghstack-source-id: e8ab99b Pull Request resolved: pytorch#36121

ZhuJewel · 2020-08-26T08:27:01Z

@VitalyFedyunin , please let us know if you need any help for the errors related to the ideep. Thanks.

facebook-github-bot · 2020-10-30T17:29:34Z

Hi @XiaobingSuper!

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but we do not have a signature on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

VitalyFedyunin · 2020-12-11T22:43:05Z

Hi! I'm ready to land it. Can you please rebase to make sure we avoid merge conflicts.

jgong5 · 2020-12-12T01:50:26Z

Hi! I'm ready to land it. Can you please rebase to make sure we avoid merge conflicts.

@VitalyFedyunin Thank you. We created a new PR #48994 to ease the rebase. I copied you there. This old one is supposed to be closed. Thanks.

VitalyFedyunin · 2020-12-12T03:12:20Z

Are you planning to port the remaining 4 PRs?

Jianhui-Li · 2020-12-12T05:59:29Z

Are you planning to port the remaining 4 PRs?

I think so. @XiaobingSuper

enable mkldnn conv2d backward to support mkldnn tensor input

786b437

[ghstack-poisoned]

XiaobingSuper requested review from albanD and apaszke as code owners April 7, 2020 03:11

XiaobingSuper mentioned this pull request Apr 7, 2020

add mkldnn linear backward #36122

Closed

XiaobingSuper requested review from VitalyFedyunin and removed request for albanD and apaszke April 7, 2020 03:14

pytorchbot added the open source label Apr 7, 2020

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

379ff05

[ghstack-poisoned]

XiaobingSuper mentioned this pull request Apr 7, 2020

Add aten mkldnn conv2d backward operator #20567

Closed

6 tasks

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

1cd7ea3

[ghstack-poisoned]

gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Apr 8, 2020

XiaobingSuper added 3 commits April 9, 2020 22:02

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

d37fe31

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

6cbf119

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

6590195

[ghstack-poisoned]

This was referenced Apr 23, 2020

add mkldnn pooling backward #37146

Closed

add mkldnn batch_norm backward #37147

Closed

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

1f0aca6

[ghstack-poisoned]

This was referenced Apr 24, 2020

add mkldnn relu backward and reshape backward #37199

Closed

BFloat16: add explicit dtype support for to_mkldnn and to_dense #37215

Closed

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

e7de4de

[ghstack-poisoned]

XiaobingSuper mentioned this pull request Apr 24, 2020

BFloat16: enable prepacked weights's inference #37218

Closed

XiaobingSuper requested a review from albanD April 26, 2020 02:12

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

f7e580b

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

1d95f4f

[ghstack-poisoned]

XiaobingSuper added 3 commits May 7, 2020 21:59

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

3162d0b

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

b4847fd

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

6478b01

[ghstack-poisoned]

VitalyFedyunin reviewed Jun 17, 2020

View reviewed changes

aten/src/ATen/native/native_functions.yaml Show resolved Hide resolved

VitalyFedyunin suggested changes Jun 17, 2020

View reviewed changes

XiaobingSuper added 4 commits June 22, 2020 14:48

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

61694b3

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

5eae8c0

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

0e3a591

[ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

72addb6

[ghstack-poisoned]

VitalyFedyunin approved these changes Jul 8, 2020

View reviewed changes

VitalyFedyunin reviewed Jul 8, 2020

View reviewed changes

XiaobingSuper added 2 commits July 9, 2020 05:47

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

f80db64

Differential Revision: [D22440969](https://our.internmc.facebook.com/intern/diff/D22440969) [ghstack-poisoned]

Update on "enable mkldnn conv2d backward to support mkldnn tensor input"

81fc4cc

Differential Revision: [D22440969](https://our.internmc.facebook.com/intern/diff/D22440969) [ghstack-poisoned]

XiaobingSuper requested a review from VitalyFedyunin July 20, 2020 01:26

qiuxin2012 pushed a commit to qiuxin2012/pytorch that referenced this pull request Jul 27, 2020

enable mkldnn conv2d backward to support mkldnn tensor input

0a79d01

ghstack-source-id: e8ab99b Pull Request resolved: pytorch#36121

facebook-github-bot added the cla signed label Nov 11, 2020

XiaobingSuper closed this Dec 16, 2020

facebook-github-bot deleted the gh/xiaobingsuper/12/head branch January 15, 2021 15:17

enable mkldnn conv2d backward to support mkldnn tensor input #36121

enable mkldnn conv2d backward to support mkldnn tensor input #36121

Uh oh!

Conversation

XiaobingSuper commented Apr 7, 2020 • edited by VitalyFedyunin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

XiaobingSuper commented Apr 7, 2020

Uh oh!

XiaobingSuper commented Apr 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

XiaobingSuper commented Apr 23, 2020

Uh oh!

VitalyFedyunin commented Apr 24, 2020

Uh oh!

XiaobingSuper commented Apr 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VitalyFedyunin commented Apr 29, 2020

Uh oh!

VitalyFedyunin commented Jun 17, 2020

Uh oh!

VitalyFedyunin Jun 17, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

VitalyFedyunin left a comment

Choose a reason for hiding this comment

Uh oh!

VitalyFedyunin Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

XiaobingSuper Jul 8, 2020

Choose a reason for hiding this comment

Uh oh!

VitalyFedyunin commented Jul 15, 2020

Uh oh!

XiaobingSuper commented Jul 16, 2020

Uh oh!

ZhuJewel commented Aug 26, 2020

Uh oh!

facebook-github-bot commented Oct 30, 2020

Uh oh!

VitalyFedyunin commented Dec 11, 2020

Uh oh!

jgong5 commented Dec 12, 2020

Uh oh!

VitalyFedyunin commented Dec 12, 2020

Uh oh!

Jianhui-Li commented Dec 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

XiaobingSuper commented Apr 7, 2020 •

edited by VitalyFedyunin

Loading

dr-ci bot commented Apr 7, 2020 •

edited

Loading

XiaobingSuper commented Apr 7, 2020 •

edited

Loading

XiaobingSuper commented Apr 26, 2020 •

edited

Loading