fix bugs in embedding converter #403

ruoqianguo · 2021-03-17T09:45:57Z

Signed-off-by: Ruoqian Guo [email protected]

Description

In my practice, the indicesTensor in aten::embedding sometimes is IValue. So i replaced the ITensor with ITensorOrFreeze.
According to the document , After calling setType, the type of a tensor is unchanged if the tensor is not a network input tensor, or marked as an output tensor or shape output tensor. So i replaced the setType with addIdentity to avoid change the input data type and ensure the indicesTensor is set as INT32.

In addition, i use the linters to ensure that my code matches the style guidelines. The linters changed the entire file(core/conversion/converters/impl/select.cpp) . I only modified the aten::embedding part in select.cpp.

Fixes # (issue)

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes

ruoqianguo · 2021-03-17T09:46:42Z

core/conversion/converters/impl/select.cpp

            {"aten::embedding(Tensor weight, Tensor indices, int padding_idx=-1, bool scale_grad_by_freq=False, bool sparse=False) -> (Tensor)",
             [](ConversionCtx* ctx, const torch::jit::Node* n, args& args) -> bool {
               auto embeddingTensor = args[0].ITensorOrFreeze(ctx);
-               auto indicesTensor = args[1].ITensor();
+               auto indicesTensor = args[1].ITensorOrFreeze(ctx);
               // Set datatype for indices tensor to INT32
-               indicesTensor->setType(nvinfer1::DataType::kINT32);
+               auto identity = ctx->net->addIdentity(*indicesTensor);
+               identity->setOutputType(0, nvinfer1::DataType::kINT32);
+               indicesTensor = identity->getOutput(0);

               // IGatherLayer takes in input tensor, the indices, and the axis of input tensor to take indices from
               auto gather_layer = ctx->net->addGather(*embeddingTensor, *indicesTensor, 0);


I only modified this part in select.cpp.

narendasan · 2021-03-17T19:39:22Z

tests/core/conversion/converters/test_select.cpp

@@ -106,7 +106,7 @@ TEST(Converters, ATenEmbeddingConvertsCorrectly) {
  auto jit_results = trtorch::tests::util::RunGraph(g, params, {jit_in});

  // Run TensorRT
-  auto options_trt = torch::TensorOptions().device(torch::kCUDA, 0).dtype(torch::kI32);
+  auto options_trt = torch::TensorOptions().device(torch::kCUDA, 0).dtype(torch::kFloat);


Do the indices only come in float? can they come in other types? We should maybe instead duplicate instead of modifying the test

When using setType function in embbedding converter, the input data type will be changed from kFloat to kInt32. But i have replaced setType function with addIdentity function, the input data type will not change. If i still use kInt32, it will get an error like that "Expected inputs[pyt_idx].dtype() == expected_type to be true but got false. Expected input tensors to have type Float, found type int".
In my opinion, a single operation at one layer of the network directly changes the input data type of the entire network, which maybe is not a cool thing, right? And the issue #388 maybe can be resolved through set all input data type as kFloat.

Yeah this is a good catch, I was just wondering about the test and why it wasnt duplicated. I think its not a massive deal, other than that this patch lgtm

narendasan · 2021-03-17T19:42:29Z

When I run the linter on my machine on your branch it switches everything back. How are you running the linter? do you perhaps have a global clang-format or something?

Signed-off-by: Ruoqian Guo <[email protected]>

ruoqianguo · 2021-03-18T03:31:03Z

When I run the linter on my machine on your branch it switches everything back. How are you running the linter? do you perhaps have a global clang-format or something?

Previously I used clang-format10.0, now I have changed it to clang-format 9.0.1-12. Now everything in select.cpp is back.

Besides, after i run this command "bazel run //tools/linter:cpplint -- //..." with clang-format 9.0.1-12, there are other some files are changed:

I am curious why other files are also modified. Is it because of my clang-format version?

narendasan · 2021-03-18T22:42:08Z

tests/core/conversion/converters/test_select.cpp

@@ -106,7 +106,7 @@ TEST(Converters, ATenEmbeddingConvertsCorrectly) {
  auto jit_results = trtorch::tests::util::RunGraph(g, params, {jit_in});

  // Run TensorRT
-  auto options_trt = torch::TensorOptions().device(torch::kCUDA, 0).dtype(torch::kI32);
+  auto options_trt = torch::TensorOptions().device(torch::kCUDA, 0).dtype(torch::kFloat);


Yeah this is a good catch, I was just wondering about the test and why it wasnt duplicated. I think its not a massive deal, other than that this patch lgtm

narendasan · 2021-03-18T22:43:45Z

Hmm, that is odd, I did not realize that different clang-format versions will give you different lintings. I will look more into what the set up of the canonical linter (the one that gets run on github on PRs) is and make it standard

narendasan

lgtm

ruoqianguo commented Mar 17, 2021

View reviewed changes

narendasan reviewed Mar 17, 2021

View reviewed changes

fix bugs in embedding converter

de269af

Signed-off-by: Ruoqian Guo <[email protected]>

ruoqianguo force-pushed the embedding_fix_bug branch from 6bbfc51 to de269af Compare March 18, 2021 03:18

narendasan approved these changes Mar 18, 2021

View reviewed changes

narendasan approved these changes Mar 22, 2021

View reviewed changes

narendasan merged commit 9439059 into pytorch:master Mar 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bugs in embedding converter #403

fix bugs in embedding converter #403

ruoqianguo commented Mar 17, 2021

ruoqianguo Mar 17, 2021

narendasan Mar 17, 2021

ruoqianguo Mar 18, 2021 •

edited

Loading

narendasan Mar 18, 2021

narendasan commented Mar 17, 2021

ruoqianguo commented Mar 18, 2021

narendasan Mar 18, 2021

narendasan commented Mar 18, 2021

narendasan left a comment

fix bugs in embedding converter #403

fix bugs in embedding converter #403

Conversation

ruoqianguo commented Mar 17, 2021

Description

Type of change

Checklist:

ruoqianguo Mar 17, 2021

Choose a reason for hiding this comment

narendasan Mar 17, 2021

Choose a reason for hiding this comment

ruoqianguo Mar 18, 2021 • edited Loading

Choose a reason for hiding this comment

narendasan Mar 18, 2021

Choose a reason for hiding this comment

narendasan commented Mar 17, 2021

ruoqianguo commented Mar 18, 2021

narendasan Mar 18, 2021

Choose a reason for hiding this comment

narendasan commented Mar 18, 2021

narendasan left a comment

Choose a reason for hiding this comment

ruoqianguo Mar 18, 2021 •

edited

Loading