Skip to content
This repository was archived by the owner on Dec 22, 2021. It is now read-only.

Include floating-point rounding instructions #7

Closed
stoklund opened this issue Apr 21, 2017 · 3 comments
Closed

Include floating-point rounding instructions #7

stoklund opened this issue Apr 21, 2017 · 3 comments

Comments

@stoklund
Copy link
Contributor

The proposal in #1 doesn't include vectorized versions of the floating-point round-to-integer instructions:

  • f32x4.ceil(x: v128) -> v128
  • f32x4.floor(x: v128) -> v128
  • f32x4.trunc(x: v128) -> v128
  • f32x4.nearest(x: v128) -> v128
  • f64x2.ceil(x: v128) -> v128
  • f64x2.floor(x: v128) -> v128
  • f64x2.trunc(x: v128) -> v128
  • f64x2.nearest(x: v128) -> v128

It seems these instructions would be just as useful in vectorized floating-point code as they are in scalar code.

Are these instructions widely available in SIMD instruction sets we care about?

@simicicd
Copy link

On MIPS architecture (with MSA extension) there is FRINT.df instruction (df is element size 32,64b) with four rounding modes which can be set in MSACSR register (round to nearest, round towards zero (trunc), round towards positive/+inf (ceil), round towards negative/-inf(floor)).

@billbudge
Copy link

ARM has vector convert float to integer, round towards zero (truncate). That's the most natural operation to implement since it matches C behavior.

I believe there are instructions that use the rounding mode in the FP control register (FPSCR) but setting this register causes FP execution to stall until all instructions in progress complete. Therefore it can be slow. I believe most ISAs work like this.

I'd be concerned that exposing other rounding modes would be slow. There are clever tricks to simulate the other rounding modes efficiently.

@ngzhian
Copy link
Member

ngzhian commented Feb 18, 2021

We have these now in #232.

@ngzhian ngzhian closed this as completed Feb 18, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

5 participants