add BFloat16 support for LayerNorm CPU by mingfeima · Pull Request #55210 · pytorch/pytorch

Since this PR is not related to parallelization feature, only single core perf is tested:
NB: LayerNorm CPU doesn't have BFloat16 support previously, the before perf refers to simple impl as following:

-  AT_DISPATCH_FLOATING_TYPES(X.scalar_type(), "LayerNormKernelImpl", [&]() {
+  AT_DISPATCH_FLOATING_TYPES_AND(kBFloat16, X.scalar_type(), "LayerNormKernelImpl", [&]() {

performance update on avx512 machine: Xeon(R) Gold 6248 CPU @ 2.50GHz

before: LayerNorm: 32x128x1024: fp32: 2.806 ms; bf16: 9.901 ms
after:  LayerNorm: 32x128x1024: fp32: 2.813 ms; bf16: 2.306 ms

performance update on avx2 machine: Xeon(R) CPU E5-2680 v3 @ 2.50GHz

before: LayerNorm: 32x128x1024: fp32: 5.286 ms; bf16: 15.186 ms
after:  LayerNorm: 32x128x1024: fp32: 5.258 ms; bf16: 3.469 ms

[ghstack-poisoned]

ghstack-source-id: 88be540 Pull Request resolved: pytorch#55210

[ghstack-poisoned]

ghstack-source-id: 18d4100 Pull Request resolved: pytorch#55210

VitalyFedyunin · 2021-06-02T16:12:40Z

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

mingfeima · 2021-06-11T06:44:47Z

rebased and clear test cases failures from test_ops.py

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

VitalyFedyunin · 2021-06-23T16:41:19Z

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

VitalyFedyunin · 2021-06-25T19:01:11Z

Please rebase

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

mingfeima · 2021-06-29T02:36:56Z

@VitalyFedyunin rebased, please check!

VitalyFedyunin · 2021-06-29T18:12:13Z

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-06-29T21:08:49Z

@VitalyFedyunin merged this pull request in 652d911.

add BFloat16 support for LayerNorm CPU

e46c9a0

[ghstack-poisoned]

mingfeima mentioned this pull request Apr 2, 2021

Optimize some reduction operators on CPU BFloat16 #55202

Closed

facebook-github-bot added the cla signed label Apr 2, 2021

mingfeima added a commit that referenced this pull request Apr 2, 2021

add BFloat16 support for LayerNorm CPU

3f7f31b

ghstack-source-id: f26d56d Pull Request resolved: #55210

pytorchbot added the open source label Apr 2, 2021

This was referenced Apr 2, 2021

SumKernel (BFloat16): use float as accumulation type #55217

Closed

optimize BFloat16 elemwise operators CPU: sigmoid, sigmoid_backward, tanh_backward, addcmul, addcdiv #55221

Closed

Update on "add BFloat16 support for LayerNorm CPU"

597730a

[ghstack-poisoned]

mingfeima mentioned this pull request Apr 8, 2021

add bf16 support for bucketize #55588

Closed

mingfeima added 2 commits April 12, 2021 19:27

Update on "add BFloat16 support for LayerNorm CPU"

f2ad4ed

[ghstack-poisoned]

Update on "add BFloat16 support for LayerNorm CPU"

245ce59

[ghstack-poisoned]

mingfeima mentioned this pull request Apr 19, 2021

add BFloat16 support for bernoulli and Dropout on CPU #56372

Closed

Update on "add BFloat16 support for LayerNorm CPU"

77542e0

[ghstack-poisoned]

This was referenced Apr 26, 2021

add BFloat16 support for AdaptiveAvgPool2d on CPU #56902

Closed

add BFloat16 support for MaxPool2d on CPU #56903

Closed

optimize transpose copy for float32 and bfloat16 on CPU #56904

Closed

mingfeima added 2 commits April 27, 2021 09:55

Update on "add BFloat16 support for LayerNorm CPU"

412463b

[ghstack-poisoned]

Update on "add BFloat16 support for LayerNorm CPU"

246f862

[ghstack-poisoned]

mingfeima added a commit to mingfeima/pytorch that referenced this pull request Apr 28, 2021

add BFloat16 support for LayerNorm CPU

0dabcc5

ghstack-source-id: 88be540 Pull Request resolved: pytorch#55210

Update on "add BFloat16 support for LayerNorm CPU"

6d01a20

[ghstack-poisoned]

mingfeima mentioned this pull request May 13, 2021

use mkldnn for Linear on CPU BFloat16 dtype #58210

Closed

mingfeima added 2 commits May 13, 2021 10:52

Update on "add BFloat16 support for LayerNorm CPU"

550e07c

[ghstack-poisoned]

Update on "add BFloat16 support for LayerNorm CPU"

2014fca

[ghstack-poisoned]

mingfeima mentioned this pull request May 14, 2021

add BFloat16 support for UpSample on CPU #58297

Closed

dgl-intel pushed a commit to dgl-intel/pytorch that referenced this pull request May 14, 2021

add BFloat16 support for LayerNorm CPU

154cfba

ghstack-source-id: 18d4100 Pull Request resolved: pytorch#55210

mingfeima added 2 commits June 4, 2021 14:25

Update on "add BFloat16 support for LayerNorm CPU"

af205e4

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

Update on "add BFloat16 support for LayerNorm CPU"

c6235c2

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

mingfeima added 2 commits June 18, 2021 10:16

Update on "add BFloat16 support for LayerNorm CPU"

7b3795e

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

Update on "add BFloat16 support for LayerNorm CPU"

e16ca0d

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

VitalyFedyunin approved these changes Jun 23, 2021

View reviewed changes

Update on "add BFloat16 support for LayerNorm CPU"

5ef2baf

Differential Revision: [D28836793](https://our.internmc.facebook.com/intern/diff/D28836793) [ghstack-poisoned]

facebook-github-bot closed this in 652d911 Jun 29, 2021

facebook-github-bot added the Merged label Jun 29, 2021

mingfeima mentioned this pull request Jun 30, 2021

add BFloat16 support for BatchNorm2d on CPU #61015

Closed

facebook-github-bot deleted the gh/mingfeima/17/head branch July 3, 2021 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add BFloat16 support for LayerNorm CPU#55210

add BFloat16 support for LayerNorm CPU#55210
mingfeima wants to merge 15 commits intogh/mingfeima/17/basefrom
gh/mingfeima/17/head

mingfeima commented Apr 2, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 2, 2021 •

edited

Loading

Uh oh!

mingfeima commented Apr 2, 2021

Uh oh!

VitalyFedyunin commented Jun 2, 2021

Uh oh!

mingfeima commented Jun 11, 2021

Uh oh!

VitalyFedyunin commented Jun 23, 2021

Uh oh!

VitalyFedyunin commented Jun 25, 2021

Uh oh!

mingfeima commented Jun 29, 2021

Uh oh!

VitalyFedyunin commented Jun 29, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mingfeima commented Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

1 failure not recognized by patterns:

ci.pytorch.org: 1 failed

Uh oh!

mingfeima commented Apr 2, 2021

Uh oh!

VitalyFedyunin commented Jun 2, 2021

Uh oh!

mingfeima commented Jun 11, 2021

Uh oh!

VitalyFedyunin commented Jun 23, 2021

Uh oh!

VitalyFedyunin commented Jun 25, 2021

Uh oh!

mingfeima commented Jun 29, 2021

Uh oh!

VitalyFedyunin commented Jun 29, 2021

Uh oh!

facebook-github-bot commented Jun 29, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mingfeima commented Apr 2, 2021 •

edited

Loading

facebook-github-bot commented Apr 2, 2021 •

edited

Loading