[ROCm] Add bfloat16 support in linear algebra on ROCm #27719

iotamudelta · 2019-10-10T22:28:26Z

This adds support for gemm-style matrix multiplications with data and output in bf16 to PyTorch on ROCm to the backend (i.e., bgemm).

Enable operators depending on bgemm.

With this change, bf16 matrices on ROCm can be multiplied on the GPU.

iotamudelta · 2019-11-12T22:51:29Z

@gottbrath this is the BLAS bringup for bf16

…bgemm

ezyang · 2019-11-18T14:48:25Z

This patch is described to be ROCm specific, but the contents of the diff suggest to me that it is turning on bfloat16 on regular CUDA as well. What's going on here?

Additionally, I didn't see any test modifications.

iotamudelta · 2019-11-18T17:06:03Z

@ezyang thanks for looking at it! I don't think we can discriminate between CUDA/ROCm in the Declarations?

Test cases: that's a good point. We've tested with actual scripts but let me see if we can also enable some unit tests here.

ezyang · 2019-11-19T16:31:14Z

I guess what I'm mostly wondering is, does this PR also accidentally add support for CUDA at the same time? Or will the CUDA paths just error.

rohithkrn · 2019-11-19T17:57:32Z

@ezyang cuda paths will just error if bfloat16 type is used.

ezyang · 2019-11-20T18:31:37Z

Thanks. This looks gtg, just needs to resolve merge conflict

bddppq

Could you add some tests? Maybe refer to #27259 to see how to add bfloat16 tests.

izdeby · 2019-11-21T15:16:12Z

aten/src/THC/generic/THCTensorMathBlas.cu

    THCTensor_(freeCopyTo)(state, cr, r_);
  }
-#elif defined(THC_REAL_IS_HALF)
+#elif defined(THC_REAL_IS_HALF)  || defined(THC_REAL_IS_BFLOAT16)


minor: double space

izdeby · 2019-11-21T15:17:28Z

Can you, please, add some description of what and why are these changes for? Also, add tests

iotamudelta · 2019-11-21T15:26:39Z

@bddppq @izdeby tests are incoming.

@izdeby I thought the title was pretty self-explanatory but added more words to the description now. OK?

… tests on rocm

rohithkrn · 2019-11-21T21:49:57Z

@bddppq @izdeby @ezyang the tests for gemms are under tensor_op_tests. Added bfloat16_precision(defaults to 1e-5) arg to the argument list and enabled bfloat16 tests for gemm ops on ROCm

facebook-github-bot

@bddppq has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Add bfloat16 support in linear algebra on ROCm

7668bfb

iotamudelta added module: rocm AMD GPU support for Pytorch open source labels Oct 10, 2019

iotamudelta requested a review from bddppq October 10, 2019 22:28

pytorchbot added module: cublas Problem related to cublas support module: cuda Related to torch.cuda, and CUDA support in general module: internals Related to internal abstractions in c10 and ATen module: operators labels Oct 10, 2019

rohithkrn added 4 commits October 10, 2019 19:20

proper ifdef for cuda paths

20b9ab0

define accreal

be36f12

include bfloat16 for blas

08f1073

enable bfloat16 codegen

2c70dbf

cpuhrsch added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 11, 2019

guard bfloat16 cuda paths completely with __HIP_PLATFORM_HCC__

506f012

iotamudelta added 3 commits November 15, 2019 14:30

Merge remote-tracking branch 'upstream/master' into bgemm

cc72648

Fix for latest upstream

a03cce5

Merge branch 'bgemm' of github.com:ROCmSoftwarePlatform/pytorch into …

96e97a8

…bgemm

iotamudelta requested a review from ezyang November 15, 2019 21:44

error out for non rocm paths

cd1f0bb

Merge branch 'master' into bgemm

02f8583

bddppq suggested changes Nov 21, 2019

View reviewed changes

bddppq requested a review from izdeby November 21, 2019 07:10

izdeby reviewed Nov 21, 2019

View reviewed changes

rohithkrn added 3 commits November 21, 2019 12:27

add bfloat16 placeholder for tensor op tests and enable bfloat16 blas…

737b7b2

… tests on rocm

remove whitespace

ddba232

lint

0aa6c0c

izdeby approved these changes Nov 21, 2019

View reviewed changes

facebook-github-bot reviewed Nov 22, 2019

View reviewed changes

bddppq approved these changes Nov 22, 2019

View reviewed changes

facebook-github-bot closed this in 48b9439 Nov 22, 2019

jithunnair-amd deleted the bgemm branch September 25, 2025 16:34

[ROCm] Add bfloat16 support in linear algebra on ROCm #27719

[ROCm] Add bfloat16 support in linear algebra on ROCm #27719

Uh oh!

Conversation

iotamudelta commented Oct 10, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iotamudelta commented Nov 12, 2019

Uh oh!

ezyang commented Nov 18, 2019

Uh oh!

iotamudelta commented Nov 18, 2019

Uh oh!

ezyang commented Nov 19, 2019

Uh oh!

rohithkrn commented Nov 19, 2019

Uh oh!

ezyang commented Nov 20, 2019

Uh oh!

bddppq left a comment

Choose a reason for hiding this comment

Uh oh!

izdeby Nov 21, 2019

Choose a reason for hiding this comment

Uh oh!

izdeby commented Nov 21, 2019

Uh oh!

iotamudelta commented Nov 21, 2019

Uh oh!

rohithkrn commented Nov 21, 2019

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

iotamudelta commented Oct 10, 2019 •

edited

Loading