[0030][SM6.9] Add list of ops with vector overload (#481)

llvm-beanz · web-flow · commit 731c6cbc0cb8 · 2025-04-22T16:47:56.000-05:00
* [0030][SM6.9] Add list of ops with vector overload

This updates the spec to explicitly list the DXIL operations wtih vector
overloads being added by this feature.

* Add missing intrinsics based on #7120
diff --git a/proposals/0030-dxil-vectors.md b/proposals/0030-dxil-vectors.md
@@ -100,6 +100,7 @@ Previously usage of `extractelement` and `insertelement` in DXIL didn't allow dy
 #### Elementwise intrinsics
 
 A selection of elementwise intrinsics are given additional native vector forms.
+The full list of intrinsics with elementwise overloads is listed in [Appendix 1](#appendix-1-new-elementwise-overloads).
 Elementwise intrinsics are those that perform their calculations irrespective of the location of the element
  in the vector or matrix arguments except insofar as that position corresponds to those of the other elements
  that might be used in the individual element calculations.
@@ -183,6 +184,71 @@ Calculations should produce the correct results in all cases for a range of vect
 In practice, this testing will largely represent verifying correct intrinsic output
  with the new shader model.
 
+## Appendix 1: New Elementwise Overloads
+
+| Opcode |  Name              | Class              |
+| ------ | --------------     | --------           |
+| 6      | FAbs               | Unary              |
+| 7      | Saturate           | Unary              |
+| 8      | IsNaN              | IsSpecialFloat     |
+| 9      | IsInf              | IsSpecialFloat     |
+| 10     | IsFinite           | IsSpecialFloat     |
+| 11     | IsNormal           | IsSpecialFloat     |
+| 12     | Cos                | Unary              |
+| 13     | Sin                | Unary              |
+| 14     | Tan                | Unary              |
+| 15     | Acos               | Unary              |
+| 16     | Asin               | Unary              |
+| 17     | Atan               | Unary              |
+| 18     | Hcos               | Unary              |
+| 19     | Hsin               | Unary              |
+| 20     | Htan               | Unary              |
+| 21     | Exp                | Unary              |
+| 22     | Frc                | Unary              |
+| 23     | Log                | Unary              |
+| 24     | Sqrt               | Unary              |
+| 25     | Rsqrt              | Unary              |
+| 26     | Round_ne           | Unary              |
+| 27     | Round_ni           | Unary              |
+| 28     | Round_pi           | Unary              |
+| 29     | Round_z            | Unary              |
+| 30     | Bfrev              | Unary              |
+| 31     | Countbits          | UnaryBits          |
+| 32     | FirstBitLo         | UnaryBits          |
+| 33     | FirstBitHi         | UnaryBits          |
+| 34     | FirstBitSHi        | UnaryBits          |
+| 35     | FMax               | Binary             |
+| 36     | FMin               | Binary             |
+| 37     | IMax               | Binary             |
+| 38     | IMin               | Binary             |
+| 39     | UMax               | Binary             |
+| 40     | UMin               | Binary             |
+| 46     | FMad               | Tertiary           |
+| 47     | Fma                | Tertiary           |
+| 48     | IMad               | Tertiary           |
+| 49     | UMad               | Tertiary           |
+| 83     | DerivCoarseX       | Unary              |
+| 84     | DerivCoarseY       | Unary              |
+| 85     | DerivFineX         | Unary              |
+| 86     | DerivFineY         | Unary              |
+| 115    | WaveActiveAllEqual | WaveActiveAllEqual |
+| 117    | WaveReadLaneAt     | WaveReadLaneAt     |
+| 118    | WaveReadLaneFirst  | WaveReadLaneFirst  |
+| 119    | WaveActiveOp       | WaveActiveOp       |
+| 120    | WaveActiveBit      | WaveActiveBit      |
+| 121    | WavePrefixOp       | WavePrefixOp       |
+| 122    | QuadReadLaneAt     | QuadReadLaneAt     |
+| 123    | QuadOp             | QuadOp             |
+| 124    | BitcastI16toF16    | BitcastI16toF16    |
+| 125    | BitcastF16toI16    | BitcastF16toI16    |
+| 126    | BitcastI32toF32    | BitcastI32toF32    |
+| 127    | BitcastF32toI32    | BitcastF32toI32    |
+| 128    | BitcastI64toF64    | BitcastI64toF64    |
+| 129    | BitcastF64toI64    | BitcastF64toI64    |
+| 165    | WaveMatch          | WaveMatch          |
+
+
+
 ## Acknowledgments
 
 * [Anupama Chandrasekhar](https://github.com/anupamachandra) and [Tex Riddell](https://github.com/tex3d) for foundational contributions to the design.