Skip to content
This repository was archived by the owner on Mar 28, 2023. It is now read-only.

[SYCL] Enable parallel indexers for AMD, after adding global offset #941

Merged
merged 1 commit into from
May 5, 2022

Conversation

vladimirlaz
Copy link

Must be reapplied after intel/llvm#5855

pvchupin pushed a commit to intel/llvm that referenced this pull request May 5, 2022
The purpose of this patch is to generalize SYCL global offset pass and enable it for AMDGPU.

* enable global offset in AMD's HIP
* decorate SYCL kernel with dedicated MDNode: This removes the need for command line options added by the SYCL driver, discussed here: [SYCL] Generalize local accessor to shared mem pass #5149 (comment)
* extract common helpers for local accessor and global offset passes
* generalize the pass
* introduce builtin_amdgcn_implicit_offset and enable the pass for ADMGPU
* implement spirv_GlobalOffset_[x,y,z]
* update the docs

The main deviation from the NVPTX is the need for supporting address spaces. For AMD kernel arguments reside in constant address space, which for the case with offset forces a copy to private AS, in order to keep the call-graph interface coherent (we can't allocate const address space for the case without offset).

Corresponding test-suit PR: intel/llvm-test-suite#941
@pvchupin pvchupin merged commit b40d1e3 into intel May 5, 2022
@pvchupin pvchupin deleted the revert-940-revert-934-jakub/global_offset branch May 5, 2022 21:59
@jchlanda
Copy link

jchlanda commented May 6, 2022

This can now be closed, with b40d1e3 in.

myler pushed a commit to myler/llvm-test-suite that referenced this pull request Jun 17, 2022
myler pushed a commit to myler/llvm-test-suite that referenced this pull request Jun 17, 2022
…_delete

[ESIMD] Delete the old versions of LIT tests for LSC load/stores
aelovikov-intel pushed a commit to aelovikov-intel/llvm that referenced this pull request Mar 27, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants