arrayfire
arrayfire copied to clipboard
Add f16 support for modulus and norm
Add support for f16 for modulus and norm. This PR also adds tests for the norm function
Description
- Adds support for modulus operation for half type. This operation is missing in the CUDA backend because it is not supported by the CUDA math library. We are now implementing this function in software
- Add support for norm for half type.
- Add tests for norm
- We now use f
f functions that accept float types as inputs in CUDA kernels. - Add tests for additional binary operations
Fixes: #3256
Changes to Users
- Users can now call modulus and norm from user code
Checklist
- [x] Rebased on latest master
- [x] Code compiles
- [x] Tests pass
- ~[ ] Functions added to unified API~
- ~[ ] Functions documented~