Skip to content

[ESIMD] Rename slm_load4/slm_store4 to slm_load_rgba/slm_store_rgba #4158

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Sep 23, 2021

Conversation

DenisBakhvalov
Copy link
Contributor

@DenisBakhvalov DenisBakhvalov commented Jul 21, 2021

Corresponding E2E test here: intel/llvm-test-suite#372

@DenisBakhvalov DenisBakhvalov requested a review from kbobrovs July 21, 2021 18:26
ESIMD_INLINE ESIMD_NODEBUG typename sycl::detail::enable_if_t<
(N == 8 || N == 16 || N == 32) && (sizeof(T) == 4),
simd<T, N * get_num_channels_enabled(Mask)>>
slm_load_rgba(simd<uint32_t, N> offsets, simd<uint16_t, N> pred = 1) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's rename it to match non-SLM variant - slm_gather_rgba.
@rolandschulz - I recall that what we discussed.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree. I can do renaming in a separate commit since slm_load/store also needs to be renamed.

  • slm_load -> slm_gather
  • slm_load_rgba -> slm_gather_rgba
  • slm_store -> slm_scatter
  • slm_store_rgba -> slm_scatter_rgba

@rolandschulz, do you agree?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can do renaming in a separate commit since slm_load/store also needs to be renamed.

I don't see a point in doing this in a separate commit, frankly. First commit would be slm_load4->slm_load_rgba, second - slm_load_rgba->slm_gather_rgba. Why not slm_load4 -> slm_gather_rgba in one shot?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Renamed. E2E test for slm_load/store will be provided in intel/llvm-test-suite#380

This commit also replaces sycl::detail::enable_if_t with std::enable_if_t

Signed-off-by: Vyacheslav N Klochkov <[email protected]>
v-klochkov
v-klochkov previously approved these changes Sep 20, 2021
@v-klochkov
Copy link
Contributor

@kychendev Please review the patch, PR requires your review/approval.

kychendev
kychendev previously approved these changes Sep 20, 2021
@bader bader requested a review from kbobrovs September 20, 2021 12:01
@bader bader changed the title [ESIMD] rename slm_load4/slm_store4 to slm_load_rgba/slm_store_rgba [ESIMD] Rename slm_load4/slm_store4 to slm_load_rgba/slm_store_rgba Sep 20, 2021
@v-klochkov v-klochkov dismissed stale reviews from kychendev and themself via 9d1b437 September 23, 2021 17:10
Signed-off-by: Vyacheslav N Klochkov <[email protected]>
@v-klochkov
Copy link
Contributor

I have just resolved the conflicts with @kbobrovs changes replacing simd<uint16_t, N> with simd_mask

@kbobrovs kbobrovs merged commit 4c911b5 into intel:sycl Sep 23, 2021
alexbatashev added a commit to alexbatashev/llvm that referenced this pull request Sep 24, 2021
* upstream/sycl: (2344 commits)
  [ESIMD] Rename slm_load4/slm_store4 to slm_load_rgba/slm_store_rgba (intel#4158)
  [SYCL] Avoid re-computing group_range in nd_item::get_group_range() (intel#4621)
  [clang-offload-extract] Ignore zero padding in .tgting section (intel#4622)
  [Driver][SYCL] Fix -fsycl-help output when redirected (intel#4619)
  [Driver][SYCL][FPGA] Do not unbundle aoco as an archive for hardware (intel#4477)
  [Driver][SYCL] Fix offload-bundler and offload-deps triples (intel#4616)
  [SYCL] Fix bit_cast for half type (intel#4603)
  [SYCL] Fix a typo in accessor::get_range method (intel#4556)
  [SYCL] Detach allocas from their dependencies regardless of linked alloca presence (intel#4573)
  [SYCL][L0] Make sure that we only query/sync host-visible events from the host. (intel#4613)
  Fix tests with wrong alias metadata
  [Driver][SYCL] Fixup arguments to llc call for PIC and code-model (intel#4614)
  [SYCL][L0] Add ownership control for LeveL-Zero kernel_bundle interop. (intel#4576)
  [SYCL][Driver] Expose llvm-foreach --jobs functionality through a driver option (intel#4543)
  [SYCL] Prevent stream buffer leak on constructor exception (intel#4594)
  [ESIMD] Replace mask_type_t with simd_mask to represent Gen predicates. (intel#4230)
  Fix for a bunch of fixed point integer SPIR-V instructions (intel#1213)
  Amend SingleElementVectorINTEL decoration use cases according to spec update (intel#1192)
  Enforce UserSemantic decoration if no FPGA decorations found
  [SYCL][CUDA] Fix context scope in kernel launch (intel#4606)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants