✨[Feature] Setting the input data type of models, such as INT32, is not supported #388

ruoqianguo · 2021-03-04T08:24:18Z

Is your feature request related to a problem? Please describe.

When i tried to use TRTorch to convert some PyTorch models which input's data type is INT32 such as BERT , it failed in model.forward(inputs) process. The error message is below. It seems that TRT Model's input data type is related to op_precision.

Describe the solution you'd like

I would like to add a corresponding interface to specify the type of input data.

Describe alternatives you've considered

Additional context

torch::Tensor in1 = torch::randint(0, 4, {128, 203}, torch::kCUDA).to(torch::kInt32);  
torch::Tensor in2 = torch::randint(0, 4, {128, 203}, torch::kCUDA).to(torch::kInt32);
std::vector<at::Tensor> inputs_trt;
inputs_trt.push_back(in1);
inputs_trt.push_back(in2);

std::vector<torch::jit::IValue> inputs_trt_;
for (auto in : inputs_trt) {
  inputs_trt_.push_back(torch::jit::IValue(in.clone()));
}

torch::jit::Module mod;
try {
  // Deserialize the ScriptModule from a file using torch::jit::load().
  mod = torch::jit::load(path);
} catch (const c10::Error& e) {
  std::cerr << "error loading the model\n";
}
mod.eval();
mod.to(torch::kCUDA);

auto trt_mod = trtorch::CompileGraph(mod, std::vector<trtorch::CompileSpec::InputRange>{in1.sizes(), in2.sizes()});
auto trt_out = trt_mod.forward(inputs_trt_);

When executing trt_mod.forward(), a bug appears

The text was updated successfully, but these errors were encountered:

peri044 · 2021-03-05T03:06:36Z

@narendasan
Discovered the same bug while writing testcase for aten::cast. I just had a workaround for cast testcases.
TRTorch assumes the input to be float by using ctx->input_type

I agree we need to provide an interface to provide input data types of tensors.

Also TRT supports INT32 input datatype. When I have two inputs (float32, int32) to the graph, TRT worked fine. I did not
encounter a case with only int32 inputs yet. But I think TRT will support it https://docs.nvidia.com/deeplearning/tensorrt/api/c_api/classnvinfer1_1_1_i_network_definition.html#a06a61f560bdf6197afd3368937f62025

narendasan · 2021-03-05T21:26:56Z

I think those inputs (float32, int32) cases are cases where the int32 are shape tensors typically. I am not sure there is actual input data in int32 (can be completely wrong as well).

borisfom · 2021-03-16T19:18:58Z

Oh yes there is actual input data in INT32 (INT64 even) for BERT and other transformers.

narendasan · 2021-05-03T22:56:52Z

#412

ncomly-nvidia · 2021-07-21T22:55:46Z

Comment added by Nick Comly in Aha! - View

If Input provided: Use specified precision

Else: use engine precision (INT8 uses FP32 input)

ruoqianguo added the feature request New feature or request label Mar 4, 2021

ruoqianguo assigned narendasan Mar 4, 2021

ruoqianguo mentioned this issue Mar 18, 2021

fix bugs in embedding converter #403

Merged

6 tasks

narendasan added this to the v0.4 milestone May 3, 2021

narendasan assigned peri044 and unassigned narendasan May 17, 2021

peri044 mentioned this issue Jun 25, 2021

feat: Add support for providing input datatypes in TRTorch #510

Merged

6 tasks

narendasan assigned narendasan and unassigned peri044 Jul 12, 2021

narendasan closed this as completed in #510 Jul 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨[Feature] Setting the input data type of models, such as INT32, is not supported #388

✨[Feature] Setting the input data type of models, such as INT32, is not supported #388

ruoqianguo commented Mar 4, 2021

peri044 commented Mar 5, 2021 •

edited

Loading

narendasan commented Mar 5, 2021

borisfom commented Mar 16, 2021

narendasan commented May 3, 2021

ncomly-nvidia commented Jul 21, 2021

✨[Feature] Setting the input data type of models, such as INT32, is not supported #388

✨[Feature] Setting the input data type of models, such as INT32, is not supported #388

Comments

ruoqianguo commented Mar 4, 2021

peri044 commented Mar 5, 2021 • edited Loading

narendasan commented Mar 5, 2021

borisfom commented Mar 16, 2021

narendasan commented May 3, 2021

ncomly-nvidia commented Jul 21, 2021

peri044 commented Mar 5, 2021 •

edited

Loading