arrayfire

arrayfire copied to clipboard

Reame
Issues

Add f16 support for modulus and norm

Open umar456 opened this issue 3 years ago • 0 comments

Add support for f16 for modulus and norm. This PR also adds tests for the norm function

Description

Adds support for modulus operation for half type. This operation is missing in the CUDA backend because it is not supported by the CUDA math library. We are now implementing this function in software
Add support for norm for half type.
Add tests for norm
We now use ff functions that accept float types as inputs in CUDA kernels.
Add tests for additional binary operations

Fixes: #3256

Changes to Users

Users can now call modulus and norm from user code

Checklist

[x] Rebased on latest master
[x] Code compiles
[x] Tests pass
~[ ] Functions added to unified API~
~[ ] Functions documented~

Jun 10 '22 20:06 umar456