-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[PyTorch] add NEON half2float fmadd/fmsub #137723
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137723
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New FailuresAs of commit de8a7a9 with merge base de4c2a3 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
This pull request was exported from Phabricator. Differential Revision: D64197048 |
NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) ghstack-source-id: 247361421 Pull Request resolved: #137723
NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D64197048 |
Pull Request resolved: #137723 NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. ghstack-source-id: 247364835 @exported-using-ghexport Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/)
NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D64197048 |
Pull Request resolved: #137723 NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. ghstack-source-id: 247383137 @exported-using-ghexport Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/)
…d/fmsub" NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/) [ghstack-poisoned]
|
This pull request was exported from Phabricator. Differential Revision: D64197048 |
Pull Request resolved: #137723 NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface. ghstack-source-id: 247393404 @exported-using-ghexport Differential Revision: [D64197048](https://our.internmc.facebook.com/intern/diff/D64197048/)
|
this is getting folded into a forthcoming PR |
Stack from ghstack (oldest at bottom):
defined(__aarch64__) && !defined(CPU_CAPABILITY_SVE256)instead ofdefined(CPU_CAPABILITY_NEON)#137722NEON supports this (FMLAL/FMLAL2)and our FP16 GEMV fast path uses it. Add it as a supported Vectorized interface.
Differential Revision: D64197048
cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10