[SPIR-V] Allow intrinsics with aggregate return type to reach GlobalISel #108893

VyacheslavLevytskyy · 2024-09-16T22:17:49Z

Two main goals of this PR are:

to support "Arithmetic with Overflow" intrinsics, including the special case when those intrinsics are being generated by the CodeGenPrepare pass during translations with optimization;
to redirect intrinsics with aggregate return type to be lowered via GlobalISel operations instead of SPIRV-specific unfolding/lowering (see [SPIR-V] Lower llvm.x.with.overflow intrinsics #95012).

There is a new test case llvm/test/CodeGen/SPIRV/passes/translate-aggregate-uaddo.ll that describes and checks the general logics of the translation.

This PR continues a series of PRs aimed to identify and fix flaws in code emission, to improve pass rates for the mode with expensive checks set on (see #101732, #104104, #106966), having in mind the ultimate goal of proceeding towards the non-experimental status of SPIR-V Backend.

The reproducers are:

consider llc -O3 -mtriple=spirv64-unknown-unknown ... with:

define spir_func i32 @foo(i32 %a, ptr addrspace(4) %p) {
entry:
  br label %l1

l1:
  %e = phi i32 [ %a, %entry ], [ %i, %body ]
  %i = add nsw i32 %e, 1
  %fl = icmp eq i32 %i, 0
  br i1 %fl, label %exit, label %body

body:
  store i8 42, ptr addrspace(4) %p
  br label %l1

exit:
  ret i32 %i
}

consider llc -O0 -mtriple=spirv64-unknown-unknown ... with:

define spir_func i32 @foo(i32 %a, ptr addrspace(4) %p) {
entry:
  br label %l1

l1:                                               ; preds = %body, %entry
  %e = phi i32 [ %a, %entry ], [ %math, %body ]
  %0 = call { i32, i1 } @llvm.uadd.with.overflow.i32(i32 %e, i32 1)
  %math = extractvalue { i32, i1 } %0, 0
  %ov = extractvalue { i32, i1 } %0, 1
  br i1 %ov, label %exit, label %body

body:                                             ; preds = %l1
  store i8 42, ptr addrspace(4) %p, align 1
  br label %l1

exit:                                             ; preds = %l1
  ret i32 %math
}

github-actions · 2024-09-17T12:01:45Z

✅ With the latest revision this PR passed the C/C++ code formatter.

VyacheslavLevytskyy · 2024-09-18T09:42:32Z

FYI @efriedma-quic

s-perron

Looks okay with my limited knowledge, but I did not understand one part.

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp

llvm/lib/Target/SPIRV/SPIRVEmitIntrinsics.cpp

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp

Keenuts · 2024-09-19T12:39:52Z

llvm/lib/Target/SPIRV/SPIRVEmitIntrinsics.cpp

+
+bool isInternalNonVoidIntrinsic(const Value *I) {
+  if (const auto *II = dyn_cast<IntrinsicInst>(I))
+    switch (II->getIntrinsicID()) {


This is supposed to be the list of internal intrinsics not returning void?

Almost, but not exactly. This list doesn't include int_spv_gep, for example, because it returns ptr type. This list is to include internal intrinsics that returns a non-ptr value and so may potentially (but should not actually) participate in fake_use emission.

In that case, isn't the return type fetchable from the CallBase parent class? Or am I missing something? (Seems like maintaining a list like that is bound to end up wrong)

I think you are right. My motivation was to explore the topic further and widen applicability of this PR's way to pass info through passes, but we may extend/address this later if needed. Another motivation was that internal intrinsics are intended for quite different things, and it's hard to separate them as a class to use in conditions, but this again doesn't matter in case of this PR specifically.

So for goals of this PR it may be better indeed to be more general and don't list intrinsics explicitly, and I've changed the explicit list to F->getName().starts_with("llvm.spv.") to exclude internal SPIR-V backend intrinsics.

llvm/test/CodeGen/SPIRV/optimizations/add-check-overflow.ll

llvm/lib/Target/SPIRV/SPIRVPreLegalizer.cpp

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp

llvm/lib/Target/SPIRV/SPIRVPreLegalizer.cpp

… passes; implement selection of Arithmetic with Overflow Intrinsics

…om an optimization pass; (2) key translation steps with checks of LLVM IR / gMIR / SPIRV code patterns

@foo

…Sel (llvm#108893) Two main goals of this PR are: * to support "Arithmetic with Overflow" intrinsics, including the special case when those intrinsics are being generated by the CodeGenPrepare pass during translations with optimization; * to redirect intrinsics with aggregate return type to be lowered via GlobalISel operations instead of SPIRV-specific unfolding/lowering (see llvm#95012). There is a new test case `llvm/test/CodeGen/SPIRV/passes/translate-aggregate-uaddo.ll` that describes and checks the general logics of the translation. This PR continues a series of PRs aimed to identify and fix flaws in code emission, to improve pass rates for the mode with expensive checks set on (see llvm#101732, llvm#104104, llvm#106966), having in mind the ultimate goal of proceeding towards the non-experimental status of SPIR-V Backend. The reproducers are: 1) consider `llc -O3 -mtriple=spirv64-unknown-unknown ...` with: ``` define spir_func i32 @foo(i32 %a, ptr addrspace(4) %p) { entry: br label %l1 l1: %e = phi i32 [ %a, %entry ], [ %i, %body ] %i = add nsw i32 %e, 1 %fl = icmp eq i32 %i, 0 br i1 %fl, label %exit, label %body body: store i8 42, ptr addrspace(4) %p br label %l1 exit: ret i32 %i } ``` 2) consider `llc -O0 -mtriple=spirv64-unknown-unknown ...` with: ``` define spir_func i32 @foo(i32 %a, ptr addrspace(4) %p) { entry: br label %l1 l1: ; preds = %body, %entry %e = phi i32 [ %a, %entry ], [ %math, %body ] %0 = call { i32, i1 } @llvm.uadd.with.overflow.i32(i32 %e, i32 1) %math = extractvalue { i32, i1 } %0, 0 %ov = extractvalue { i32, i1 } %0, 1 br i1 %ov, label %exit, label %body body: ; preds = %l1 store i8 42, ptr addrspace(4) %p, align 1 br label %l1 exit: ; preds = %l1 ret i32 %math } ```

VyacheslavLevytskyy force-pushed the intrinsics_with_aggregate_ret_1 branch from 616fece to ae5fe0e Compare September 18, 2024 09:25

VyacheslavLevytskyy marked this pull request as ready for review September 18, 2024 09:38

VyacheslavLevytskyy requested review from michalpaszkowski and Keenuts September 18, 2024 09:38

VyacheslavLevytskyy requested review from s-perron and Keenuts and removed request for Keenuts September 18, 2024 09:45

s-perron reviewed Sep 18, 2024

View reviewed changes

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/SPIRV/SPIRVEmitIntrinsics.cpp Show resolved Hide resolved

efriedma-quic reviewed Sep 18, 2024

View reviewed changes

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp Outdated Show resolved Hide resolved

Keenuts reviewed Sep 19, 2024

View reviewed changes

michalpaszkowski reviewed Sep 19, 2024

View reviewed changes

llvm/lib/Target/SPIRV/SPIRVPreLegalizer.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/SPIRV/SPIRVPreLegalizer.cpp Show resolved Hide resolved

VyacheslavLevytskyy force-pushed the intrinsics_with_aggregate_ret_1 branch from 97e333c to 54cc904 Compare September 20, 2024 09:44

VyacheslavLevytskyy added 16 commits September 23, 2024 06:27

allow intrinsics with aggregate return type to reach GlobalISel

067e404

implement keeping and referencing aggregate result attributes between…

416eafe

… passes; implement selection of Arithmetic with Overflow Intrinsics

fix passing info and access to metadata

45ea3fd

fixes

063030d

implement 'Arithmetic with Overflow' intrinsics

abcf8e6

clang-format

ff8e18e

add a test case

a65fd2e

add scalar/vector versions of OpIAddCarry/OpISubBorrow; add test cases

4e365d5

update docs

de45e6e

add specific test cases: (1) example of llvm intrinsics originated fr…

f3e0304

…om an optimization pass; (2) key translation steps with checks of LLVM IR / gMIR / SPIRV code patterns

correct insertion points for new instructions

0e47611

apply code review suggestions

e93b7d7

apply code review suggestions

3758ff9

apply code review suggestions

ee044e3

apply code review suggestions

f26041e

fix weird merge issue

c7a8975

VyacheslavLevytskyy force-pushed the intrinsics_with_aggregate_ret_1 branch from d49e9a3 to c7a8975 Compare September 23, 2024 13:27

michalpaszkowski approved these changes Sep 25, 2024

View reviewed changes

VyacheslavLevytskyy merged commit a059b29 into llvm:main Sep 26, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPIR-V] Allow intrinsics with aggregate return type to reach GlobalISel #108893

[SPIR-V] Allow intrinsics with aggregate return type to reach GlobalISel #108893

VyacheslavLevytskyy commented Sep 16, 2024 •

edited

Loading

github-actions bot commented Sep 17, 2024 •

edited

Loading

VyacheslavLevytskyy commented Sep 18, 2024

s-perron left a comment

Keenuts Sep 19, 2024

VyacheslavLevytskyy Sep 19, 2024 •

edited

Loading

Keenuts Sep 19, 2024

VyacheslavLevytskyy Sep 20, 2024

[SPIR-V] Allow intrinsics with aggregate return type to reach GlobalISel #108893

[SPIR-V] Allow intrinsics with aggregate return type to reach GlobalISel #108893

Conversation

VyacheslavLevytskyy commented Sep 16, 2024 • edited Loading

github-actions bot commented Sep 17, 2024 • edited Loading

VyacheslavLevytskyy commented Sep 18, 2024

s-perron left a comment

Choose a reason for hiding this comment

Keenuts Sep 19, 2024

Choose a reason for hiding this comment

VyacheslavLevytskyy Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

Keenuts Sep 19, 2024

Choose a reason for hiding this comment

VyacheslavLevytskyy Sep 20, 2024

Choose a reason for hiding this comment

VyacheslavLevytskyy commented Sep 16, 2024 •

edited

Loading

github-actions bot commented Sep 17, 2024 •

edited

Loading

VyacheslavLevytskyy Sep 19, 2024 •

edited

Loading