i64x2.abs instruction #413

Maratyszcza · 2020-12-23T22:59:53Z

Introduction

This is proposal to add 64-bit variant of existing abs instruction. ARM64 and x86 with AVX512 natively support this instruction, and on earlier instruction sets it can be emulated with 3-5 instructions.

Applications

Mapping to Common Instruction Sets

This section illustrates how the new WebAssembly instructions can be lowered on common instruction sets. However, these patterns are provided only for convenience, compliant WebAssembly implementations do not have to follow the same code generation patterns.

x86/x86-64 processors with AVX512F and AVX512VL instruction sets

i64x2.abs
- y = i64x2.abs(x) is lowered to VPABSQ xmm_y, xmm_x

x86/x86-64 processors with AVX instruction set

i64x2.abs
- y = i64x2.abs(x) (x is not y) is lowered to:
  - VPXOR xmm_y, xmm_y, xmm_y
  - VPSUBQ xmm_y, xmm_y, xmm_x
  - VBLENDVPD xmm_y, xmm_x, xmm_y, xmm_x

x86/x86-64 processors with SSE4.1 instruction set

i64x2.abs
- y = i64x2.abs(x) (x is not y and x/y is not in xmm0) is lowered to:
  - PXOR xmm0, xmm0, xmm0
  - PSUBQ xmm0, xmm_x
  - MOVDQA xmm_y, xmm0
  - BLENDVPD xmm_y, xmm_x

x86/x86-64 processors with SSE2 instruction set

i64x2.abs
- y = i64x2.abs(x) is lowered to:
  - PSHUFD xmm_tmp, xmm_x, 0xF5
  - MOVDQA xmm_y, xmm_x
  - PSRAD xmm_tmp, 31
  - PXOR xmm_y, xmm_tmp
  - PSUBQ xmm_y, xmm_tmp
- x = i64x2.abs(x) is lowered to:
  - PSHUFD xmm_tmp, xmm_x, 0xF5
  - PSRAD xmm_tmp, 31
  - PXOR xmm_x, xmm_tmp
  - PSUBQ xmm_x, xmm_tmp

ARM64 processors

i64x2.abs
- y = i64x2.abs(x) is lowered to ABS Vy.2D, Vx.2D

ARMv7 processors with NEON instruction set

i64x2.abs
- y = i64x2.abs(x) is lowered to:
  - VSHR.S64 Qtmp, Qx, #63
  - VEOR Qy, Qy, Qtmp
  - VSUB.I64 Qy, Qx, Qtmp

jan-wassenberg · 2021-01-25T07:59:51Z

Strong support, I'm adding this to Highway as well. It would be much harder for users to emulate this, especially if we do not add sign select nor i64 gt_s.

dtig · 2021-01-25T19:31:33Z

Adding a preliminary vote for the inclusion of i64x2.abs operation to the SIMD proposal below. Please vote with -

👍 For including i64x2.abs
👎 Against including i64x2.abs

penzn · 2021-01-25T22:55:55Z

I do have an issue with examples here - they seem to be all wrapper libraries. It isn't surprising that wrapper libraries would ave all sorts of operations, but this isn't the same as an app somebody could run.

Maratyszcza · 2021-02-09T21:02:10Z

Fixed a bug in suggested lowering on SSE2 and ARM NEON (thanks @ngzhian for reporting).

This was merged in WebAssembly#413.

This was merged in #413.

tlively mentioned this pull request Jan 8, 2021

Agenda for sync meeting 1/22/21 #419

Closed

Maratyszcza force-pushed the abs-64bit branch from 1523bf1 to 53b82d8 Compare January 19, 2021 20:52

tlively mentioned this pull request Jan 23, 2021

Agenda for sync meeting 1/29/21 #429

Closed

ngzhian added the 2021-01-29 Agenda for sync meeting 1/29/21 label Jan 26, 2021

tlively mentioned this pull request Jan 31, 2021

Agenda for sync meeting 2/5/21 #436

Closed

Maratyszcza force-pushed the abs-64bit branch from 53b82d8 to 10706d4 Compare February 1, 2021 17:02

dtig added needs discussion Proposal with an unclear resolution and removed 2021-01-29 Agenda for sync meeting 1/29/21 labels Feb 2, 2021

Maratyszcza force-pushed the abs-64bit branch 3 times, most recently from f89840d to c7b0168 Compare February 5, 2021 16:43

i64x2.abs instruction

7384f2a

Maratyszcza force-pushed the abs-64bit branch from c7b0168 to 7384f2a Compare February 5, 2021 16:44

tlively merged commit 961edc4 into WebAssembly:master Feb 5, 2021

ngzhian added a commit to ngzhian/simd that referenced this pull request Feb 9, 2021

[spec-text] Add i64x2.abs

0b47357

This was merged in WebAssembly#413.

ngzhian mentioned this pull request Feb 9, 2021

[spectext] Add i64x2.abs #457

Merged

ngzhian added a commit to ngzhian/simd that referenced this pull request Feb 10, 2021

[interpreter] Implement i64x2.abs

befe99e

This was merged in WebAssembly#413.

ngzhian mentioned this pull request Feb 10, 2021

[interpreter] Implement i64x2.abs #462

Merged

ngzhian added a commit that referenced this pull request Feb 10, 2021

[interpreter] Implement i64x2.abs

fe63095

This was merged in #413.

ngzhian added a commit that referenced this pull request Feb 10, 2021

[spec-text] Add i64x2.abs

ab6a361

This was merged in #413.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i64x2.abs instruction #413

i64x2.abs instruction #413

Maratyszcza commented Dec 23, 2020 •

edited

Loading

jan-wassenberg commented Jan 25, 2021

dtig commented Jan 25, 2021

penzn commented Jan 25, 2021

Maratyszcza commented Feb 9, 2021

i64x2.abs instruction #413

i64x2.abs instruction #413

Conversation

Maratyszcza commented Dec 23, 2020 • edited Loading

Introduction

Applications

Mapping to Common Instruction Sets

x86/x86-64 processors with AVX512F and AVX512VL instruction sets

x86/x86-64 processors with AVX instruction set

x86/x86-64 processors with SSE4.1 instruction set

x86/x86-64 processors with SSE2 instruction set

ARM64 processors

ARMv7 processors with NEON instruction set

jan-wassenberg commented Jan 25, 2021

dtig commented Jan 25, 2021

penzn commented Jan 25, 2021

Maratyszcza commented Feb 9, 2021

Maratyszcza commented Dec 23, 2020 •

edited

Loading