-
Notifications
You must be signed in to change notification settings - Fork 75
[release/2.4] [ROCm] Enable vector size for 8 for half precision types in elementwise kernels (#1671) #1738
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Jenkins build for 1626a877c9703e7ad341e8710b7bcf62e0ece7d7 commit finished as FAILURE Detected error during Pytorch building: |
|
@jerrymannil |
|
|
!cherry-pick --onto release/2.5 |
|
Created branch release/2.5_cherry-pick_pr-1738 and #1831 |
…f precision types in elementwise kernels (#1831) Cherry-pick of #1738 Co-authored-by: Jerry Mannil <[email protected]>
Enable *_load_dwordx4 ISA for BFloat16 and Half by using vector size of 8
Co-author: @akadutta