Add Quasi-Fused Multiply-Add/Subtract instructions

Maratyszcza · Maratyszcza · commit 5f5e20c1b377 · 2019-05-21T16:57:14.000-07:00
diff --git a/proposals/simd/BinarySIMD.md b/proposals/simd/BinarySIMD.md
@@ -143,6 +143,8 @@ The `v8x16.shuffle2_imm` instruction has 16 bytes after `simdop`.
 | `f32x4.abs`               |    `0x95`| -                  |
 | `f32x4.neg`               |    `0x96`| -                  |
 | `f32x4.sqrt`              |    `0x97`| -                  |
+| `f32x4.qfma`              |    `0x98`| -                  |
+| `f32x4.qfms`              |    `0x99`| -                  |
 | `f32x4.add`               |    `0x9a`| -                  |
 | `f32x4.sub`               |    `0x9b`| -                  |
 | `f32x4.mul`               |    `0x9c`| -                  |
@@ -152,6 +154,8 @@ The `v8x16.shuffle2_imm` instruction has 16 bytes after `simdop`.
 | `f64x2.abs`               |    `0xa0`| -                  |
 | `f64x2.neg`               |    `0xa1`| -                  |
 | `f64x2.sqrt`              |    `0xa2`| -                  |
+| `f64x2.qfma`              |    `0xa3`| -                  |
+| `f64x2.qfms`              |    `0xa4`| -                  |
 | `f64x2.add`               |    `0xa5`| -                  |
 | `f64x2.sub`               |    `0xa6`| -                  |
 | `f64x2.mul`               |    `0xa7`| -                  |
diff --git a/proposals/simd/SIMD.md b/proposals/simd/SIMD.md
@@ -754,6 +754,18 @@ Lane-wise IEEE `multiplication`.
 
 Lane-wise IEEE `squareRoot`.
 
+### Quasi-Fused Multiply-Add
+* `f32x4.qfma(a: v128, b: v128, c: v128) -> v128`
+* `f64x2.qfma(a: v128, b: v128, c: v128) -> v128`
+
+Lane-wise multiplication and addition (`a + b * c`), either with, or without intermediate rounding. WebAssembly implementation may execute this instruction as either IEEE Fused-Multiply-Add (FMA) or a combination of IEEE `multiplication` and IEEE `addition` operations, depending on availability and performance of FMA instruction on the target native platform. `qfma` instructions in a WebAssembly module must execute as either all fused, or all unfused operations.
+
+### Quasi-Fused Multiply-Subtract
+* `f32x4.qfms(a: v128, b: v128, c: v128) -> v128`
+* `f64x2.qfms(a: v128, b: v128, c: v128) -> v128`
+
+Lane-wise multiplication and subtraction (`a - b * c`), either with, or without intermediate rounding. WebAssembly implementation may execute this instruction as either IEEE Fused-Multiply-Subtract (FMS) or a combination of IEEE `multiplication` and IEEE `subtraction` operations, depending on availability and performance of FMS instruction on the target native platform. `qfms` instructions in a WebAssembly module must execute as either all fused, or all unfused operations.
+
 ## Conversions
 ### Integer to floating point
 * `f32x4.convert_s/i32x4(a: v128) -> v128`