Winch: Add `trunc_sat` instructions for x64 with AVX #10226

jeffcharles · 2025-02-12T20:56:53Z

Part of #8093. Adds implementations for the following instructions:

i32x4.trunc_sat_f32x4_s
i32x4.trunc_sat_f32x4_u
i32x4.trunc_sat_f64x2_s_zero
i32x4.trunc_sat_f64x2_u_zero

github-actions · 2025-02-12T22:44:59Z

Subscribe to Label Action

cc @saulecabrera

This issue or pull request has been labeled: "winch"

Thus the following users have been cc'd because of the following labels:

saulecabrera: winch

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

saulecabrera · 2025-02-13T12:44:44Z

winch/codegen/src/isa/x64/masm.rs

+                    .xmm_vex_rr(AvxOpcode::Vpxor, reg.to_reg(), scratch.to_reg(), reg);
+            }
+            V128TruncSatKind::F32x4U => {
+                let reg2 = writable!(context.any_fpr(self)?);


I'm confused about the lifetime of this register, I was expecting a context.free_reg given that it seems to be temporary?

Good catch!

saulecabrera · 2025-02-13T12:48:16Z

winch/codegen/src/isa/x64/masm.rs

@@ -2543,6 +2543,183 @@ impl Masm for MacroAssembler {
        Ok(())
    }

+    fn v128_trunc_sat(
+        &mut self,
+        context: &mut CodeGenContext<Emission>,


Usually CodeGenContext is passed when there's a special ISA needs for lowering, but I don't think that the case here? If I'm reading the code correctly, it seems that the only reason why we need the context is to pop a temporary register when kind == F32x4U?

If that's the case, could we instead avoid passing the context and pass an Option<Reg> at each call site where appropriate?

Would we still want to allocate the extra register on AArch64 for F32x4U when it isn't necessary on that ISA?

Otherwise I'm not sure how to handle only allocating the register on x64 if we're doing the allocation outside the x64 macroassembler.

Would we still want to allocate the extra register on AArch64 for F32x4U when it isn't necessary on that ISA?

I was under the impression that we'd still need an extra reg for aarch64, is that not the case? More generally, maybe it's fine to leave it as is and we can refactor this once we add SIMD support for other backends.

AFAICT, we could implement this instruction with just a single emission of FCVTZU on AArch64 and we would only need the one register.

saulecabrera · 2025-02-13T12:50:06Z

winch/codegen/src/isa/x64/masm.rs

+        let reg = writable!(context.pop_to_reg(self, None)?.reg);
+        let scratch = writable!(regs::scratch_xmm());
+
+        match kind {


Each arm here is considerably long, could we extract each into it's own helper? I had a bit of a hard time following along, having each arm in its own function will make it easier (at least personally) to reason about each invariant.

saulecabrera

LGTM, thanks!

Winch: Add trunc_sat instructions for x64 with AVX

d646e00

jeffcharles requested review from a team as code owners February 12, 2025 20:56

jeffcharles requested review from cfallin and pchickey and removed request for a team February 12, 2025 20:56

github-actions bot added the winch Winch issues or pull requests label Feb 12, 2025

saulecabrera reviewed Feb 13, 2025

View reviewed changes

jeffcharles added 4 commits February 13, 2025 20:25

Free temp register

37be0c8

Move implementations into helper methods

4d53986

Merge branch 'main' into winch-simd-trunc-sat

c0cd909

Remove duplicate Wast test entries

210d91f

jeffcharles requested a review from saulecabrera February 13, 2025 21:53

saulecabrera approved these changes Feb 13, 2025

View reviewed changes

saulecabrera added this pull request to the merge queue Feb 13, 2025

Merged via the queue into bytecodealliance:main with commit 7f93c1e Feb 13, 2025
39 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Winch: Add `trunc_sat` instructions for x64 with AVX #10226

Winch: Add `trunc_sat` instructions for x64 with AVX #10226

jeffcharles commented Feb 12, 2025

github-actions bot commented Feb 12, 2025

saulecabrera Feb 13, 2025

jeffcharles Feb 13, 2025

saulecabrera Feb 13, 2025

jeffcharles Feb 13, 2025

saulecabrera Feb 13, 2025

jeffcharles Feb 13, 2025

saulecabrera Feb 13, 2025

saulecabrera left a comment

Winch: Add trunc_sat instructions for x64 with AVX #10226

Winch: Add trunc_sat instructions for x64 with AVX #10226

Conversation

jeffcharles commented Feb 12, 2025

github-actions bot commented Feb 12, 2025

Subscribe to Label Action

saulecabrera Feb 13, 2025

Choose a reason for hiding this comment

jeffcharles Feb 13, 2025

Choose a reason for hiding this comment

saulecabrera Feb 13, 2025

Choose a reason for hiding this comment

jeffcharles Feb 13, 2025

Choose a reason for hiding this comment

saulecabrera Feb 13, 2025

Choose a reason for hiding this comment

jeffcharles Feb 13, 2025

Choose a reason for hiding this comment

saulecabrera Feb 13, 2025

Choose a reason for hiding this comment

saulecabrera left a comment

Choose a reason for hiding this comment

Winch: Add `trunc_sat` instructions for x64 with AVX #10226

Winch: Add `trunc_sat` instructions for x64 with AVX #10226