Enable simd_extmul_* for AArch64 #3070

sparker-arm · 2021-07-08T15:42:19Z

Lower simd_extmul_[low/high][signed/unsigned] to [s|u]widen inputs to
an imul node.

This also reorganises the aarch64 'long mul' operations and their assembly.

cfallin · 2021-07-14T19:57:13Z

@sparker-arm @akirilov-arm I'm currently out on vacation until Mon Jul 26; I can review this when I'm back if you prefer, or perhaps one of @abrown or @alexcrichton could review?

akirilov-arm · 2021-07-14T20:30:06Z

@cfallin I pinged you because historically you had done most of the AArch64-related reviews (and Sam was actually unable to request a review from anybody), but another reviewer would be absolutely fine.

alexcrichton

Looks pretty reasonable to me!

alexcrichton · 2021-07-14T20:46:57Z

cranelift/codegen/src/isa/aarch64/inst/mod.rs

+    Umull32,
+    /// Unsigned multiply add long
+    Umlal8,
+    Umlal16,


It seems like this and Umlal8 aren't actually generated in any lowerings (unless I'm missing something), but do you want to leave these in for completeness with possible future lowerings?

Yes, exactly - thought it would be weird to just port the 32-bit version.

alexcrichton · 2021-07-14T20:48:36Z

cranelift/codegen/src/isa/aarch64/lower_inst.rs

+                    ra: zero_reg(),
+                });
+            } else if ty.is_vector() {
+                for ext_op in &[


One possible way to structure this is to perhaps check for is_vector and i64x2 early on at the top of this block with early-returns if they match, and if that fails all the remaining cases share the logic of

let rn = put_input_in_reg(ctx, inputs[0], NarrowValueMode::None); let rm = put_input_in_reg(ctx, inputs[1], NarrowValueMode::None); let rd = get_output_reg(ctx, outputs[0]).only_reg().unwrap();

as a prefix.

Just a thought though, happy to defer to you who work on this much more than I!

So even though I don't like the duplication, or how much conditional stuff is going on here, we would only be able to factor out one collection of these calls as the I128 case uses, ever so slightly, different calls to handle multiple registers. So I think I prefer the clarity.

cfallin

LGTM, thanks! Just one request for a doc-comment below.

cfallin · 2021-07-26T21:23:09Z

cranelift/codegen/src/isa/aarch64/lower.rs

@@ -1243,6 +1243,150 @@ pub(crate) fn maybe_input_insn_via_conv<C: LowerCtx<I = Inst>>(
    None
 }

+pub(crate) fn match_vec_long_mul<C: LowerCtx<I = Inst>>(


Can we add a doc-comment here describing the return tuple? E.g. it's not clear to me reading at this point what the bool denotes, until I notice the low/high pattern below in the implementation.

cfallin

Thanks!

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:area:aarch64 Issues related to AArch64 backend. cranelift:wasm labels Jul 8, 2021

sparker-arm force-pushed the simd-extmul-aarch64 branch from 00b264c to ff2cd0f Compare July 9, 2021 09:28

akirilov-arm requested a review from cfallin July 14, 2021 17:44

akirilov-arm mentioned this pull request Jul 14, 2021

Add simd_extmul_* support for x64 #3084

Merged

alexcrichton approved these changes Jul 14, 2021

View reviewed changes

akirilov-arm mentioned this pull request Jul 15, 2021

Incorrect codegen for i16x8.extmul_high_i8x16_s on x64 #3089

Closed

cfallin approved these changes Jul 26, 2021

View reviewed changes

sparker-arm added 3 commits July 28, 2021 13:14

Enable simd_extmul_* for AArch64

541a4ee

Lower simd_extmul_[low/high][signed/unsigned] to [s|u]widen inputs to an imul node. Copyright (c) 2021, Arm Limited.

Added doc comment

5eb2dca

And removed an accidental code move. Copyright (c) 2021, Arm Limited.

sparker-arm force-pushed the simd-extmul-aarch64 branch from ff2cd0f to 5eb2dca Compare July 28, 2021 12:18

cfallin approved these changes Jul 28, 2021

View reviewed changes

cfallin merged commit 323197e into bytecodealliance:main Jul 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable simd_extmul_* for AArch64 #3070

Enable simd_extmul_* for AArch64 #3070

sparker-arm commented Jul 8, 2021

cfallin commented Jul 14, 2021

akirilov-arm commented Jul 14, 2021

alexcrichton left a comment

alexcrichton Jul 14, 2021

sparker-arm Jul 15, 2021

alexcrichton Jul 14, 2021

sparker-arm Jul 15, 2021

cfallin left a comment

cfallin Jul 26, 2021

cfallin left a comment

Enable simd_extmul_* for AArch64 #3070

Enable simd_extmul_* for AArch64 #3070

Conversation

sparker-arm commented Jul 8, 2021

cfallin commented Jul 14, 2021

akirilov-arm commented Jul 14, 2021

alexcrichton left a comment

Choose a reason for hiding this comment

alexcrichton Jul 14, 2021

Choose a reason for hiding this comment

sparker-arm Jul 15, 2021

Choose a reason for hiding this comment

alexcrichton Jul 14, 2021

Choose a reason for hiding this comment

sparker-arm Jul 15, 2021

Choose a reason for hiding this comment

cfallin left a comment

Choose a reason for hiding this comment

cfallin Jul 26, 2021

Choose a reason for hiding this comment

cfallin left a comment

Choose a reason for hiding this comment