[ROCm] Implement float32 copy kernel #163869

jerrymannil · 2025-09-25T16:23:28Z

Add float32_copy_kernel for vectorizing float16/bfloat16 to float32 conversion

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd

* Add `float32_copy_kernel` for vectorizing float16/bfloat16 to float32 conversion

pytorch-bot · 2025-09-25T16:23:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163869

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[Maintenance] MacOS runners update

✅ No Failures

As of commit d55cae5 with merge base c8e75c4 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

* Added float16 support

jeffdaily · 2025-09-25T21:42:34Z

@pytorchbot merge

pytorchmergebot · 2025-09-25T21:45:10Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

cherry-pick of pytorch#163869

* Add `float32_copy_kernel` for vectorizing float16/bfloat16 to float32 conversion Pull Request resolved: #163869 Approved by: https://github.com/jeffdaily

cherry-pick of pytorch#163869 (cherry picked from commit dfd386f)

[ROCm] Implement float32 copy kernel

b6646dd

* Add `float32_copy_kernel` for vectorizing float16/bfloat16 to float32 conversion

jerrymannil requested review from eqy and syed-ahmed as code owners September 25, 2025 16:23

pytorch-bot bot added module: rocm AMD GPU support for Pytorch release notes: cuda release notes category labels Sep 25, 2025

pytorchbot added the open source label Sep 25, 2025

jeffdaily approved these changes Sep 25, 2025

View reviewed changes

jeffdaily added release notes: rocm mandatorylabel ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 and removed release notes: cuda release notes category labels Sep 25, 2025

Fix UT failures

d55cae5

* Added float16 support

pytorch-bot bot removed ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 labels Sep 25, 2025

jeffdaily added ciflow/rocm Trigger "default" config CI on ROCm ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 labels Sep 25, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 25, 2025

pytorchmergebot added the merging label Sep 25, 2025

jerrymannil added a commit to ROCm/pytorch that referenced this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel

d03ca07

cherry-pick of pytorch#163869

jerrymannil mentioned this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel ROCm/pytorch#2682

Merged

jerrymannil added a commit to ROCm/pytorch that referenced this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel (#2682)

167f7e2

cherry-pick of pytorch#163869

jerrymannil added a commit to ROCm/pytorch that referenced this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel

8e3b5e5

cherry-pick of pytorch#163869

jerrymannil mentioned this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel ROCm/pytorch#2683

Merged

jerrymannil added a commit to ROCm/pytorch that referenced this pull request Sep 25, 2025

[ROCm] Implement float32 copy kernel (#2683)

dfd386f

cherry-pick of pytorch#163869

jerrymannil mentioned this pull request Sep 26, 2025

[ROCm] Implement float32 copy kernel ROCm/pytorch#2684

Merged

jerrymannil added a commit to ROCm/pytorch that referenced this pull request Sep 26, 2025

[ROCm] Implement float32 copy kernel (#2684)

069985e

cherry-pick of pytorch#163869

pytorchmergebot added the Merged label Sep 26, 2025

pytorchmergebot closed this in b4be380 Sep 26, 2025

pytorchmergebot removed the merging label Sep 26, 2025

pragupta pushed a commit to ROCm/pytorch that referenced this pull request Oct 8, 2025

[ROCm] Implement float32 copy kernel (#2683)

86692dd

cherry-pick of pytorch#163869 (cherry picked from commit dfd386f)

jithunnair-amd pushed a commit to ROCm/pytorch that referenced this pull request Oct 10, 2025

[ROCm] Implement float32 copy kernel (#2683)

55b2445

cherry-pick of pytorch#163869 (cherry picked from commit dfd386f)

jeffdaily pushed a commit to ROCm/pytorch that referenced this pull request Nov 17, 2025

[ROCm] Implement float32 copy kernel (#2683)

90b73a3

cherry-pick of pytorch#163869 (cherry picked from commit dfd386f)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Implement float32 copy kernel #163869

[ROCm] Implement float32 copy kernel #163869

Uh oh!

jerrymannil commented Sep 25, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 25, 2025 •

edited

Loading

Uh oh!

jeffdaily commented Sep 25, 2025

Uh oh!

pytorchmergebot commented Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[ROCm] Implement float32 copy kernel #163869

[ROCm] Implement float32 copy kernel #163869

Uh oh!

Conversation

jerrymannil commented Sep 25, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163869

❗ 1 Active SEVs

✅ No Failures

Uh oh!

jeffdaily commented Sep 25, 2025

Uh oh!

pytorchmergebot commented Sep 25, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jerrymannil commented Sep 25, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 25, 2025 •

edited

Loading