[SYCL][ESIMD] Add support for lsc mem access APIs #5512

sndmitriev · 2022-02-08T15:03:50Z

Signed-off-by: Sergey Dmitriev [email protected]

Signed-off-by: Sergey Dmitriev <[email protected]>

kbobrovs

I think this one is good to go, given that it has been internally reviewed as well.
But please follow up with another PR addressing some documentation-related comments.

kbobrovs · 2022-02-08T16:58:16Z

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp

@@ -176,11 +176,279 @@ enum class atomic_op : uint8_t {
  /// Compare and exchange (floating point).
  /// <code>if (*addr == src0) *addr = src1;</code>
  fcmpwr = 0x12,
+  fadd = 0x13,


Please follow up with a PR adding comments to these new operations.

kbobrovs · 2022-02-08T17:55:30Z

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp

+/// Data size or format to read or store
+enum class lsc_data_size : uint8_t {
+  default_size = 0,
+  u8 = 1,


please add documentation comments to undocumented elements in the follow-up PR.

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

kbobrovs · 2022-02-08T18:03:32Z

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

+                                              vals.data());
+}
+
+/// 2D flat-address block load.


Some usage examples are highly desirable for this and few other lsc APIs. Please add a TODO marker in the follow-up PR.

kbobrovs · 2022-02-09T16:57:06Z

@bader, @vladimirlaz - CUDA testing stalled. This patch does can't affect CUDA - could you please merge?

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp

vmustya · 2022-02-09T17:14:51Z

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp

+  check_lsc_data_size<T, DS>();
+  if (DS != lsc_data_size::default_size)
+    return DS;
+  else if (sizeof(T) == 1)


Actually U8 and U16 data size values are only supported by 2d block messages. D32U8 and D32U16 should be used instead

I think this has been fixed in f922b01

Probably the fixed code will not work. When sizeof(T) is 1 or 2, values should be also bit-casted to uint32_t.

Please review the updated patch. I think this problem has been fixed.

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

vmustya · 2022-02-09T17:18:36Z

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

+/// @tparam BlockHeight is the block height in number of elements.
+/// @tparam NBlocks is the number of blocks.
+/// @tparam Transposed is the transposed version or not.
+/// @tparam Transformed is apply VNNI transform or not.


VNNI transform is only supported when data size is U8 or U16

I have added static asserts to check supported data sizes in e9e39f7

vmustya · 2022-02-09T17:19:22Z

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

+/// @tparam BlockWidth is the block width in number of elements.
+/// @tparam BlockHeight is the block height in number of elements.
+/// @tparam NBlocks is the number of blocks.
+/// @tparam Transposed is the transposed version or not.


Transposed messages are only supported for U32 and U64 (with some constraints)

I have added static asserts to check supported data sizes when Transposed is true in e9e39f7

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

kbobrovs · 2022-02-09T18:02:26Z

@bader, @vladimirlaz - CUDA testing stalled. This patch does can't affect CUDA - could you please merge?

I need to take this request back - few more comments arrived.

… with regular atomics

…ed messages

vmustya · 2022-02-11T15:11:31Z

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp

+///
+/// @tparam T is element type.
+/// @tparam NElts is the number of elements to load per address.
+/// @tparam DS is the data size.


All the Transposed (block) messages support only u32 and u64 values for DS

Added static assert to all Transposed functions in f922b01

…lsc-api

kbobrovs · 2022-03-02T01:36:17Z

The precommit failures is unrelated - timeouts on two tests:
AtomicRef/sub.cpp
AtomicRef/add.cpp

Probably caused by enormous amount of warning messages.
So, I'm merging the patch.

[SYCL][ESIMD] Add support for lsc mem access APIs

7ed1be9

Signed-off-by: Sergey Dmitriev <[email protected]>

sndmitriev requested review from kbobrovs and kychendev February 8, 2022 15:03

sndmitriev requested a review from a team as a code owner February 8, 2022 15:03

kbobrovs previously approved these changes Feb 8, 2022

View reviewed changes

petercad reviewed Feb 9, 2022

View reviewed changes

sycl/include/sycl/ext/intel/experimental/esimd/common.hpp Outdated Show resolved Hide resolved

vmustya reviewed Feb 9, 2022

View reviewed changes

petercad reviewed Feb 9, 2022

View reviewed changes

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp Outdated Show resolved Hide resolved

vmustya reviewed Feb 9, 2022

View reviewed changes

sycl/include/sycl/ext/intel/experimental/esimd/memory.hpp Show resolved Hide resolved

sndmitriev added 4 commits February 10, 2022 02:23

Removed XeHP_SDV from the list of supported platforms

0082466

Removed DG2 from the list of supported platforms for 2d intrinsics

3ecffc7

Removed cache hints from user-visible lsc SLM APIs

bd45f43

Replaced "flat-address" with "USM pointer"

b06759b

sndmitriev dismissed kbobrovs’s stale review via b06759b February 10, 2022 11:17

sndmitriev added 5 commits February 10, 2022 19:51

Removed Transposed and Transformed params from lsc_store2d template

200feff

Removed L1 cache hint from atomic operations

e32e85f

Removed NElts from atomic operations

2c2aadd

Reordered parameters for lsc atomic templates to make them consistent…

7963e86

… with regular atomics

Added static asserts to check data sizes for Transformed and Transpos…

e9e39f7

…ed messages

vmustya reviewed Feb 11, 2022

View reviewed changes

Applied suggestions from code review

f922b01

sndmitriev marked this pull request as draft February 17, 2022 09:22

sndmitriev added 4 commits February 20, 2022 21:51

Added checks for allowed cache hints

8015cad

Add special handling for u8 and u16 data types

21ec38d

Remove 'Transposed' and 'Transformed' perameters from prefetch 2d

e687516

Merge remote-tracking branch 'intel_llvm/sycl' into sndmitriev/esimd-…

5698713

…lsc-api

sndmitriev marked this pull request as ready for review February 25, 2022 15:22

kbobrovs approved these changes Mar 1, 2022

View reviewed changes

Merge remote-tracking branch 'intel_llvm/sycl' into sndmitriev/esimd-…

e5efe83

…lsc-api

kbobrovs merged commit 4bd50e7 into intel:sycl Mar 2, 2022

sndmitriev deleted the sndmitriev/esimd-lsc-api branch March 2, 2022 02:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][ESIMD] Add support for lsc mem access APIs #5512

[SYCL][ESIMD] Add support for lsc mem access APIs #5512

sndmitriev commented Feb 8, 2022

kbobrovs left a comment

kbobrovs Feb 8, 2022

kbobrovs Feb 8, 2022

kbobrovs Feb 8, 2022

kbobrovs commented Feb 9, 2022

vmustya Feb 9, 2022

sndmitriev Feb 15, 2022

vmustya Feb 15, 2022

sndmitriev Feb 25, 2022

vmustya Feb 9, 2022

sndmitriev Feb 11, 2022

vmustya Feb 9, 2022

sndmitriev Feb 11, 2022

kbobrovs commented Feb 9, 2022

vmustya Feb 11, 2022

sndmitriev Feb 15, 2022

kbobrovs commented Mar 2, 2022

[SYCL][ESIMD] Add support for lsc mem access APIs #5512

[SYCL][ESIMD] Add support for lsc mem access APIs #5512

Conversation

sndmitriev commented Feb 8, 2022

kbobrovs left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kbobrovs commented Feb 9, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kbobrovs commented Feb 9, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kbobrovs commented Mar 2, 2022