Remove rarely used shape utilities #1016

ricardoV94 · 2024-10-06T08:48:22Z

This PR cleans up the use of obscure shape utilities, removing them from Unique.infer_shape and and other places where they are not needed.

codecov · 2024-10-06T09:11:01Z

Codecov Report

Attention: Patch coverage is 90.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 81.73%. Comparing base (fa0ab9d) to head (59ff115).
Report is 108 commits behind head on main.

Files with missing lines	Patch %	Lines
pytensor/tensor/extra_ops.py	88.46%	1 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1016      +/-   ##
==========================================
- Coverage   81.75%   81.73%   -0.03%     
==========================================
  Files         183      183              
  Lines       47756    47734      -22     
  Branches    11620    11611       -9     
==========================================
- Hits        39044    39016      -28     
- Misses       6519     6525       +6     
  Partials     2193     2193

Files with missing lines	Coverage Δ
pytensor/tensor/rewriting/shape.py	`82.74% <100.00%> (ø)`
pytensor/tensor/shape.py	`90.75% <ø> (-1.94%)`	⬇️
pytensor/tensor/subtensor.py	`89.32% <100.00%> (+<0.01%)`	⬆️
pytensor/tensor/extra_ops.py	`87.77% <88.46%> (+0.13%)`	⬆️

Armavica · 2024-10-07T06:06:25Z

pytensor/tensor/extra_ops.py

+        if axis is not None and axis < 0:
+            raise ValueError("Axis cannot be negative.")


Why remove this possibility?

Because it simplifies the logic in the Op. The helper users use pt.unique handles the negative axis and passes a positive one to the Op. Users don't really create Ops themselves

Armavica · 2024-10-07T06:09:04Z

pytensor/tensor/extra_ops.py

-            ret[0] = tuple(
-                fgraph.shape_feature.shape_ir(i, node.outputs[0]) for i in range(ndim)
-            )
+        [x_shape] = i0_shapes


I am not sure I understand what is happening in this function, but just to check, shouldn't there be a case for return_index and return_counts as well?

There is. i0_shapes are the input dimensions, so that doesn't change with the number of outputs. return_index/counts are outputs, and they are always vector.

We set out_shapes = [out.shape[0] for out in node.outputs] by default which will always work for return_index and return_counts. Then we have special logic for the main output when axis is not None and for return_inverse which is not just out.shape[0].

The big picture is the function tries to return a graph for the shape of the outputs given the input shapes (and possibly values, which you could retrieve from node.inputs). The default graph of the shape is just output.shape, which we try to avoid when possible, as we would like to avoid computing the Op in order to find out its shape.

For unique we can do that for some of the outputs dimensions, but not all (we only know how many repeated values there are if we evaluate Unique).

This method is combining dims we can know from the input shapes and those that we can only get after we compute the outputs with out.shape[0] or out.shape[x].

ricardoV94 · 2024-10-07T07:13:39Z

pytensor/tensor/extra_ops.py

@@ -1293,6 +1269,9 @@ def unique(
        * the number of times each unique value comes up in the input array

    """
+    ar = as_tensor_variable(ar)
+    if axis is not None:
+        axis = normalize_axis_index(axis, ar.ndim)


@Armavica here is where we allow negative axis for the user

ricardoV94 added maintenance Op implementation shape inference labels Oct 6, 2024

ricardoV94 requested a review from Armavica October 6, 2024 08:48

ricardoV94 added 2 commits October 6, 2024 10:48

Simplify Unique Op

9fb19d8

Remove rarely used shape_i helpers

59ff115

ricardoV94 force-pushed the cleanup_shape_feat branch from 0099ecc to 59ff115 Compare October 6, 2024 08:48

ricardoV94 removed the request for review from Armavica October 6, 2024 08:49

ricardoV94 marked this pull request as ready for review October 6, 2024 09:45

ricardoV94 requested review from twiecki and Armavica October 6, 2024 09:45

twiecki approved these changes Oct 7, 2024

View reviewed changes

Armavica reviewed Oct 7, 2024

View reviewed changes

ricardoV94 commented Oct 7, 2024

View reviewed changes

ricardoV94 merged commit 1c2bc8f into pymc-devs:main Oct 7, 2024
60 of 61 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove rarely used shape utilities #1016

Remove rarely used shape utilities #1016

Uh oh!

ricardoV94 commented Oct 6, 2024

Uh oh!

codecov bot commented Oct 6, 2024 •

edited

Loading

Uh oh!

Armavica Oct 7, 2024

Uh oh!

ricardoV94 Oct 7, 2024

Uh oh!

Armavica Oct 7, 2024

Uh oh!

ricardoV94 Oct 7, 2024

Uh oh!

ricardoV94 Oct 7, 2024 •

edited

Loading

Uh oh!

ricardoV94 Oct 7, 2024

Uh oh!

Uh oh!

Uh oh!

		if axis is not None and axis < 0:
		raise ValueError("Axis cannot be negative.")

Remove rarely used shape utilities #1016

Remove rarely used shape utilities #1016

Uh oh!

Conversation

ricardoV94 commented Oct 6, 2024

Uh oh!

codecov bot commented Oct 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Armavica Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

Armavica Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Oct 6, 2024 •

edited

Loading

ricardoV94 Oct 7, 2024 •

edited

Loading