[MPS] Fix batchnorm for mixed types by malfet · Pull Request #96208 · pytorch/pytorch

malfet · 2023-03-07T18:31:46Z

By up/down casting weights to input types

Extend unittests to support float16 input

Fixes #96113

By up/down casting weights to input types Extend unittests to support float16 input Fixes #96113

pytorch-bot · 2023-03-07T18:31:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/96208

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures

As of commit 2cc290c:

NEW FAILURES - The following jobs have failed:

linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test (slow, 2, 2, linux.g5.4xlarge.nvidia.gpu) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kulinseth · 2023-03-08T17:34:21Z

aten/src/ATen/native/mps/operations/Normalization.mm

                                                                            secondaryTensor:momentumTensor
                                                                                       name:nil];
-                MPSGraphTensor* scaledRunningMean = [mpsGraph multiplicationWithPrimaryTensor:runningMeanTensor
+                MPSGraphTensor* scaledRunningMean = [mpsGraph multiplicationWithPrimaryTensor:castMPSTensor(mpsGraph, runningMeanTensor, input_mps_dtype)


All these casts shouldn't be necessary. If the initial Ranked placeholder is created with correct data type. We shouldn't be adding any casts here in the code.

kulinseth · 2023-03-08T17:38:41Z

aten/src/ATen/native/mps/operations/Normalization.mm

-                                                                      name:nil];
+          const auto inputTensorType = [inputTensor dataType];
+          MPSGraphTensor* outputTensor = [mpsGraph normalizationWithTensor: inputTensor
+                                                                meanTensor: castMPSTensor(mpsGraph, saveMeanTensor, inputTensorType)


The normalization shouldn't need casts here. The computation graph for normalization should be performed in the type which is requested by the user. Even if it's a pass through, it can add spurious casts which should have been fixed earlier in the Graph to add casts at proper places. My main concern is that it may lead to actual casts in future and will be hidden from the next person modifying the code. I am curious do you need the casts here for this crash ?

…atch norm error

malfet · 2023-03-09T18:10:11Z

As we've discussed, let's split it into a smaller change, and then land a bigger one

Only for forward pass Subset of #96208 Create constant with scalar using `input_mps_dtype` and use `reciprocalWithTensor` instead of `divisionWithPrimaryTensor:1.0 secondaryTensor:` Fixes #96113

Only for forward pass Subset of #96208 Create constant with scalar using `input_mps_dtype` and use `reciprocalWithTensor` instead of `divisionWithPrimaryTensor:1.0 secondaryTensor:` Fixes #96113 Pull Request resolved: #96430 Approved by: https://github.com/kulinseth

Only for forward pass Subset of pytorch/pytorch#96208 Create constant with scalar using `input_mps_dtype` and use `reciprocalWithTensor` instead of `divisionWithPrimaryTensor:1.0 secondaryTensor:` Fixes pytorch/pytorch#96113 Pull Request resolved: pytorch/pytorch#96430 Approved by: https://github.com/kulinseth

Only for forward pass Subset of pytorch#96208 Create constant with scalar using `input_mps_dtype` and use `reciprocalWithTensor` instead of `divisionWithPrimaryTensor:1.0 secondaryTensor:` Fixes pytorch#96113 Pull Request resolved: pytorch#96430 Approved by: https://github.com/kulinseth

Only for forward pass Subset of #96208 Create constant with scalar using `input_mps_dtype` and use `reciprocalWithTensor` instead of `divisionWithPrimaryTensor:1.0 secondaryTensor:` Fixes #96113 Pull Request resolved: #96430 Approved by: https://github.com/kulinseth (cherry picked from commit 075a494)

github-actions · 2023-06-16T02:01:47Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

[MPS] Fix batchnorm for mixed types

2cc290c

By up/down casting weights to input types Extend unittests to support float16 input Fixes #96113

malfet requested a review from kulinseth as a code owner March 7, 2023 18:31

pytorch-bot bot added ciflow/mps Run MPS tests (subset of trunk) release notes: mps Release notes category labels Mar 7, 2023

malfet added the topic: bug fixes topic category label Mar 7, 2023

kulinseth reviewed Mar 8, 2023

View reviewed changes

kulinseth requested changes Mar 8, 2023

View reviewed changes

ethanliu1206 pushed a commit to kulinseth/pytorch that referenced this pull request Mar 9, 2023

Apply the change in Normalization.mm from pytorch#96208 to fix FP16 b…

6abf716

…atch norm error

malfet mentioned this pull request Mar 9, 2023

[MPS] Allow float16 input to float32 LayerNorm #96430

Closed

malfet mentioned this pull request Apr 18, 2023

[MPS] Allow float16 input to float32 LayerNorm (#96430) #99454

Closed

github-actions bot added the Stale label Jun 16, 2023

github-actions bot closed this Jul 16, 2023

github-actions bot deleted the malfet/mps-fix-batchnorm-mixed-types branch September 6, 2024 02:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MPS] Fix batchnorm for mixed types#96208

[MPS] Fix batchnorm for mixed types#96208
malfet wants to merge 1 commit intomainfrom
malfet/mps-fix-batchnorm-mixed-types

malfet commented Mar 7, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 7, 2023 •

edited

Loading

Uh oh!

kulinseth Mar 8, 2023

Uh oh!

kulinseth Mar 8, 2023

Uh oh!

malfet commented Mar 9, 2023

Uh oh!

github-actions bot commented Jun 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

malfet commented Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/96208

❌ 1 Failures

Uh oh!

kulinseth Mar 8, 2023

Choose a reason for hiding this comment

Uh oh!

kulinseth Mar 8, 2023

Choose a reason for hiding this comment

Uh oh!

malfet commented Mar 9, 2023

Uh oh!

github-actions bot commented Jun 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

malfet commented Mar 7, 2023 •

edited

Loading

pytorch-bot bot commented Mar 7, 2023 •

edited

Loading