[RFC] Adding support for non-constant dims for aten.view #1131

gpetters94 · 2022-08-01T23:07:18Z

I'm working on lowering OPT, and I'm running into the following:

error: failed to legalize operation 'torch.aten.view' that was explicitly marked illegal
note: see current operation: %932 = "torch.aten.view"(%898, %931) : (!torch.vtensor<[1,12,7,64],f32>, !torch.list<int>) -> !torch.vtensor<[12,7,64],f32>

Inspecting the lowering of aten.view, it looks like the output shape is -1, -1, 64 because the first two input dims aren't constants. The solution I'd like to write is to recursively follow the dims up the tree, verifying that all the ops are either constants, no-ops (i.e. NumToTensor), or math ops (i.e. multiplication, addition) and then performing the math statically to determine the output shape. Does this sound like how we want to implement this?

The text was updated successfully, but these errors were encountered:

silvasean · 2022-08-01T23:19:08Z

The output shape looks like [12, 7, 64] in your snippet and not [-1, -1, 64]. Can you show the actual IR snippet you are dealing with?

gpetters94 · 2022-08-02T01:14:00Z

In the actually processing of aten.view, it checks if each input dim is a constant. If not it assigns kUnknownDim to it, and in this case the first two inputs are not constants. The code is here.

silvasean · 2022-08-02T01:41:51Z

Can you show the IR before the pass?

silvasean · 2022-08-02T01:42:39Z

(for future reference, it's usually important to show a reduced, fully valid IR example with any bug reports like this)

gpetters94 · 2022-08-02T01:58:33Z

Here's the IR after failure: https://gist.github.com/gpetters94/af96b032acb0e6c6274af9aff62ec5e3

The relevant part is:

  %136 = torch.aten.mul.Tensor %123, %71 : !torch.vtensor<[],si64>, !torch.vtensor<[],si64> -> !torch.vtensor<[],si64>
  %137 = torch.aten.Int.Tensor %136 : !torch.vtensor<[],si64> -> !torch.int
  %138 = torch.aten.Int.Tensor %136 : !torch.vtensor<[],si64> -> !torch.int
  %139 = torch.aten.Int.Tensor %136 : !torch.vtensor<[],si64> -> !torch.int
  %140 = torch.prim.ListConstruct %int1, %int7, %int12, %int64 : (!torch.int, !torch.int, !torch.int, !torch.int) -> !torch.list<int>
  %141 = torch.aten.view %126, %140 : !torch.vtensor<[1,7,768],f32>, !torch.list<int> -> !torch.vtensor<[1,7,12,64],f32>
  %142 = torch.aten.transpose.int %141, %int1, %int2 : !torch.vtensor<[1,7,12,64],f32>, !torch.int, !torch.int -> !torch.vtensor<[1,12,7,64],f32>
  %143 = torch.aten.contiguous %142, %int0 : !torch.vtensor<[1,12,7,64],f32>, !torch.int -> !torch.vtensor<[1,12,7,64],f32>
  %144 = torch.aten.numel %143 : !torch.vtensor<[1,12,7,64],f32> -> !torch.int
  %145 = torch.prim.NumToTensor.Scalar %144 : !torch.int -> !torch.vtensor<[],si64>
  %146 = torch.aten.div.Tensor_mode %145, %136, %str : !torch.vtensor<[],si64>, !torch.vtensor<[],si64>, !torch.str -> !torch.vtensor<[],si64>
  %147 = torch.aten.div.Tensor_mode %146, %70, %str : !torch.vtensor<[],si64>, !torch.vtensor<[],si64>, !torch.str -> !torch.vtensor<[],si64>
  %148 = torch.aten.Int.Tensor %147 : !torch.vtensor<[],si64> -> !torch.int
  %149 = torch.prim.ListConstruct %139, %148, %int64 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int>
  %150 = torch.aten.view %143, %149 : !torch.vtensor<[1,12,7,64],f32>, !torch.list<int> -> !torch.vtensor<[12,7,64],f32>

gpetters94 · 2022-08-02T04:34:57Z

Here's the distilled version:

func.func @forward(%arg0: !torch.vtensor<[1,12,7,64],f32>) -> !torch.vtensor<[12,7,64],f32> {
  %str = torch.constant.str "floor"
  %int7 = torch.constant.int 7
  %int12 = torch.constant.int 12
  %int64 = torch.constant.int 64
  %144 = torch.aten.numel %arg0 : !torch.vtensor<[1,12,7,64],f32> -> !torch.int
  %145 = torch.prim.NumToTensor.Scalar %144 : !torch.int -> !torch.vtensor<[],si64>
  %tensor7 = torch.prim.NumToTensor.Scalar %int7 : !torch.int -> !torch.vtensor<[],si64>
  %tensor64 = torch.prim.NumToTensor.Scalar %int64 : !torch.int -> !torch.vtensor<[],si64>
  %146 = torch.aten.div.Tensor_mode %145, %tensor7, %str : !torch.vtensor<[],si64>, !torch.vtensor<[],si64>, !torch.str -> !torch.vtensor<[],si64>
  %147 = torch.aten.div.Tensor_mode %146, %tensor64, %str : !torch.vtensor<[],si64>, !torch.vtensor<[],si64>, !torch.str -> !torch.vtensor<[],si64>
  %148 = torch.aten.Int.Tensor %147 : !torch.vtensor<[],si64> -> !torch.int
  %149 = torch.prim.ListConstruct %int12, %148, %int64 : (!torch.int, !torch.int, !torch.int) -> !torch.list<int>
  %150 = torch.aten.view %arg0, %149 : !torch.vtensor<[1,12,7,64],f32>, !torch.list<int> -> !torch.vtensor<[12,7,64],f32>
  return %150 : !torch.vtensor<[12,7,64],f32>
}

silvasean · 2022-08-02T23:04:33Z

It looks we have already done all the shape math statically, because the result shape is inferred as !torch.vtensor<[12,7,64],f32>. So I don't want to do any special local logic here for that.

You should be able to extend #935 for torch.aten.div.Tensor_mode to do more folding here if that is useful as well.

gpetters94 · 2022-08-03T00:34:59Z

So should I just rewrite aten.view to use the statically-inferred output shape when the current logic fails?

silvasean · 2022-08-03T19:53:38Z

So should I just rewrite aten.view to use the statically-inferred output shape when the current logic fails?

That would make sense to me. Actually, I would add a canonicalization that replaces the view sizes operand with a constant list if the result shape is static (and the operand is not already a constant list).

gpetters94 · 2022-08-03T23:03:36Z

Sure, I can do that. Where are canonicalizations added?

silvasean · 2022-08-03T23:43:31Z

TorchOps.cpp -- you need to add let hasCanonicalizer = 1 the ODS definition.

silvasean · 2022-08-03T23:44:07Z

See here for more info: https://mlir.llvm.org/docs/Canonicalization/

This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: llvm#1131

This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: llvm#1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.

This introduces a new pass LowerToBackendContract (better name very welcome) which performs the bulk of the simplifications that we do, such as - shape refinement - dtype refinement - maximizing value semantics - inlining global slots - decomposing complex ops The key difference from before is that it iterates the set of transformations, which can help to break a number of "catch-22" issues where one simplification depends on another, the latest example being here: #1131 This also exposed that RefineTypes was sometimes crashing/asserting for certain inputs. This commit hardens it a bit.

gpetters94 · 2022-09-02T07:17:18Z

Implemented this in #1337

Co-authored-by: Tung D. Le <[email protected]> Co-authored-by: Alexandre Eichenberger <[email protected]>

gpetters94 changed the title ~~[RFC] Adding support for multiple unknown dims for aten.view~~ [RFC] Adding support for non-constant dims for aten.view Aug 1, 2022

silvasean mentioned this issue Aug 5, 2022

Iteratively run the main simplification pipeline. #1165

Merged

gpetters94 closed this as completed Sep 2, 2022

qedawkins pushed a commit to nod-ai/torch-mlir that referenced this issue Oct 3, 2022

add mnist.def for end-to-end example mnist-example (llvm#1131)

af02912

Co-authored-by: Tung D. Le <[email protected]> Co-authored-by: Alexandre Eichenberger <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] Adding support for non-constant dims for aten.view #1131

[RFC] Adding support for non-constant dims for aten.view #1131

gpetters94 commented Aug 1, 2022

silvasean commented Aug 1, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022 •

edited

Loading

Uh oh!

gpetters94 commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

gpetters94 commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

gpetters94 commented Sep 2, 2022

Uh oh!

[RFC] Adding support for non-constant dims for aten.view #1131

[RFC] Adding support for non-constant dims for aten.view #1131

Comments

gpetters94 commented Aug 1, 2022

silvasean commented Aug 1, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

gpetters94 commented Aug 2, 2022

Uh oh!

silvasean commented Aug 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gpetters94 commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

gpetters94 commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

silvasean commented Aug 3, 2022

Uh oh!

gpetters94 commented Sep 2, 2022

Uh oh!

silvasean commented Aug 2, 2022 •

edited

Loading