[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2204

pmaybank · 2025-05-28T15:05:34Z

Previously expected values were calculated on GPU using same dtype as result values
Now expected values are calculated on CPU using Float32 dtype
This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values

Cherry-picked to release/2.7 branch via #2211

Cherry-picked to rocm6.5_internal_testing branch via #2212

Cherry-picked to rocm7.0_internal_testing branch via #2252

…t failure

pmaybank · 2025-05-30T10:23:57Z

!cherry-pick --onto release/2.7

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]>

okakarpa · 2025-05-30T10:26:11Z

Created branch autogenerated/release/2.7_cherry-pick_pr-2204 and #2211

pmaybank · 2025-05-30T10:29:36Z

!cherry-pick --onto rocm6.5_internal_testing

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]>

okakarpa · 2025-05-30T10:31:15Z

Created branch autogenerated/rocm6.5_internal_testing_cherry-pick_pr-2204 and #2212

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]> (cherry picked from commit 8fe3cdd)

…6 CooperativeReduction Test Failure (#2211) Cherry-pick of #2204 Co-authored-by: pmaybank <[email protected]> Co-authored-by: pnikolic-amd <[email protected]>

…4] Fix Float16 CooperativeReduction Test Failure (#2212) Cherry-pick of #2204 Co-authored-by: pmaybank <[email protected]> Co-authored-by: pnikolic-amd <[email protected]>

jithunnair-amd · 2025-06-07T03:15:15Z

!cherry-pick --onto rocm7.0_internal_testing

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]>

okakarpa · 2025-06-07T03:21:16Z

Created branch autogenerated/rocm7.0_internal_testing_cherry-pick_pr-2204 and #2252

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]>

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]> (cherry picked from commit 8fe3cdd) (cherry picked from commit 34f3b3e)

…ilure (ROCm#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]> (cherry picked from commit 8fe3cdd) (cherry picked from commit 34f3b3e)

…ilure (#2204) - Previously expected values were calculated on GPU using same dtype as result values - Now expected values are calculated on CPU using Float32 dtype - This fixes a test failure that was observed on Navi48 where difference between Eager mode (expected) and Inductor / Triton (result) did not meet the error tolerance when sum was evaluated on an array of Float16 values Co-authored-by: pnikolic-amd <[email protected]> (cherry picked from commit 8fe3cdd) (cherry picked from commit 34f3b3e)

Fix CooperativeReductionTests.test_reduction_fns_name_sum_float16 tes…

3495de3

…t failure

pmaybank requested review from jataylo and jeffdaily May 28, 2025 15:05

pmaybank changed the title ~~Fix Float16 CooperativeReduction Test Failure~~ [release/2.6] Fix Float16 CooperativeReduction Test Failure May 29, 2025

pmaybank changed the title ~~[release/2.6] Fix Float16 CooperativeReduction Test Failure~~ [release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure May 29, 2025

jeffdaily approved these changes May 29, 2025

View reviewed changes

jeffdaily merged commit 8fe3cdd into release/2.6 May 29, 2025

jeffdaily deleted the pmaybank/inductor-reductions-1 branch May 29, 2025 15:41

okakarpa mentioned this pull request May 30, 2025

[AUTOGENERATED] [release/2.7] [release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2211

Merged

okakarpa mentioned this pull request May 30, 2025

[AUTOGENERATED] [rocm6.5_internal_testing] [release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2212

Merged

okakarpa mentioned this pull request Jun 7, 2025

[AUTOGENERATED] [rocm7.0_internal_testing] [release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2252

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2204

[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2204

Uh oh!

pmaybank commented May 28, 2025 •

edited by okakarpa

Loading

Uh oh!

pmaybank commented May 30, 2025

Uh oh!

okakarpa commented May 30, 2025

Uh oh!

pmaybank commented May 30, 2025

Uh oh!

okakarpa commented May 30, 2025

Uh oh!

jithunnair-amd commented Jun 7, 2025

Uh oh!

okakarpa commented Jun 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2204

[release/2.6] [SWDEV-529824] Fix Float16 CooperativeReduction Test Failure #2204

Uh oh!

Conversation

pmaybank commented May 28, 2025 • edited by okakarpa Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pmaybank commented May 30, 2025

Uh oh!

okakarpa commented May 30, 2025

Uh oh!

pmaybank commented May 30, 2025

Uh oh!

okakarpa commented May 30, 2025

Uh oh!

jithunnair-amd commented Jun 7, 2025

Uh oh!

okakarpa commented Jun 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pmaybank commented May 28, 2025 •

edited by okakarpa

Loading