arrayfire icon indicating copy to clipboard operation
arrayfire copied to clipboard

Add f16 support for modulus and norm

Open umar456 opened this issue 3 years ago • 0 comments

Add support for f16 for modulus and norm. This PR also adds tests for the norm function

Description

  • Adds support for modulus operation for half type. This operation is missing in the CUDA backend because it is not supported by the CUDA math library. We are now implementing this function in software
  • Add support for norm for half type.
  • Add tests for norm
  • We now use ff functions that accept float types as inputs in CUDA kernels.
  • Add tests for additional binary operations

Fixes: #3256

Changes to Users

  • Users can now call modulus and norm from user code

Checklist

  • [x] Rebased on latest master
  • [x] Code compiles
  • [x] Tests pass
  • ~[ ] Functions added to unified API~
  • ~[ ] Functions documented~

umar456 avatar Jun 10 '22 20:06 umar456