Parametrizations tutorial #1444

lezcano · 2021-03-25T18:57:01Z

Creates the tutorial for the parametrizations functionality. This was discussed in the issue pytorch/pytorch#7313 and implemented in the PR pytorch/pytorch#33344

cc @albanD @IvanYashchuk @toshas

netlify · 2021-03-25T19:01:12Z

Deploy preview for pytorch-tutorials-preview ready!

Built with commit 496327c

https://deploy-preview-1444--pytorch-tutorials-preview.netlify.app

brianjo · 2021-03-26T16:48:32Z

Looks like this requires PyTorch 1.9 which we don't build on yet. You can remove the _tutorial from the file name to add it without running the code. If you want to wait for 1.9 to publish, I have some other options for testing this.

vadimkantorov · 2021-04-06T13:16:36Z

intermediate_source/parametrizations.py

+
+###############################################################################
+# We can then use this idea to implement a linear layer with symmetric weights:
+class LinearSymmetric(nn.Module):


Less of a straw-man (for this simple case) would be a module deriving from Linear and adding a weight property

That one's tricky. If you think about it, it'd go as something like this but with a bit more flare:

import torch.nn as nn class MyLin(nn.Linear): def __init__(self, *args, **kwargs): super().__init__(*args, **kwargs) self._weight = self.weight # Why does this line work? delattr(self, "weight") @property def weight(self): return self._weight lin = MyLin(3, 4)

The line above works because of how nn.Module handles __getitem__ and __setitem__. One could almost say that works "out of pure chance". What happens in that line is:

It calls __getattribute__

__getattribute__ finds the property self.weight and calls it

The property looks for self._weight. At that time it does not exist, so it raises an AttributeError.

Since __getattribute__ got an AttributeError, nn.Module.__getattr__ is called

nn.Module.__getattr__ finds the self.weight that was created in nn.Linear.__init__ and returns it

This is quite a mess really. That's why I went for the simpler method here.

I had used register_parameter('weight', None) instead: https://gist.github.com/vadimkantorov/4f34fe60d2ef00e72dcad16512d224af, seems to work for Conv1d. And maybe even directly delattr(self, 'weight') could work?

Ah, the trick is that you still want to access old _weight, while I didn't need it

That works for exactly the same convoluted reason, but changing __getattr__ and __getattribute__ with __setattr__ and __setattribute__. Again, it works, but it's tricky to know why it works.

vadimkantorov · 2021-04-06T13:18:09Z

intermediate_source/parametrizations.py

+# are properly registered as submodules of the original module. As such, the same rules
+# for registering parameters in a module apply to register a parametrization.
+# For example, if a parametrization has parameters, these will be moved from CPU
+# to CUDA when calling ``model = model.cuda()``.


It may be good to add a note explaining how reparametrization magic is implemented under the hood

I think that that would be too technical (it uses dynamically generated classes). I think something that could be done would be to stress more what happens after you call register_parametrization. In particular, the fact that it creates a ModuleDict under module.parametrizations, and each of those modules is a ParametrizationList and so on.
In fact, that's all register_parametrization is doing modulo the dynamically generated classes magic. This would also help clarifying why it can be used with nn.Modules but not with plain old functions.

I think it's still better to mention it (even if technical), then there wouldn't be a surprise of some generated class name if there is some exception (and especially when users debug it interactively. does it affect interactive debugging in ide? that's a valid question). Also this would be a useful reminder for the user to not do any similar magics if they fear interference

I do agree that such explanation would be nice, but I'm not sure this tutorial is the right place for it.
A Note in the doc similar to the ones we have about autograd herre for example sound more appropriate.

vadimkantorov · 2021-04-06T13:19:26Z

intermediate_source/parametrizations.py

+# matrices. Using these two facts, we may reuse the parametrizations
+class MatrixExponential(nn.Module):
+    def forward(X):
+        return torch.matrix_exp(X)


Could torch.matrix_exp be directly used instead of MatrixExponential()? In both cases, I think this should be discussed explicitly

No, it could not. At the moment Parametrizations are defined to be nn.Modules, so they do not support the functional API.

I do not know whether it would be necessary to discuss this, as in the whole tutorial it's been made clear that a parametrization is just a plain nn.Module. What do you think @albanD ?

My reasoning: in many places in PyTorch one expects both functions and module objects work (especially in older areas of PyTorch). Whenever this is violated - quantization, module tracing etc, I'm always suspicious of undeclared magic :)

In this case, it begs the question, since the module here is just a wrapper

That is indeed an interesting question.
But for now everything is expected to be a nn.Module and that sounds enough for me.
If we want to relax that in the future we might be able to do so but that would be out of scope of this tutorial.

vadimkantorov · 2021-04-06T13:21:17Z

intermediate_source/parametrizations.py

+        return A
+
+###############################################################################
+# In this case, it is not true that ``forward(right_inverse(X)) == X``. This is


Would reparametrizations work for double-backward? A related question asked here: pytorch/pytorch#55368

That is a good point. At the moment that example but using

import torch.nn.utils.parametrize as P model = P.register_parametrization(torch.nn.Linear(2, 5), "weight", torch.nn.ReLU())

also breaks with the same error message. That being said, it smells like there's a problem in the implementation of register_parametrizaton (?). Perhaps @albanD can give a bit more insight on what's going on..

There is no reason it wouldn't.
Your implementation of the reparametrization will need to be double differentiable though.

I guess it's also worth an explicit discussion somewhere

Not sure what you mean about the relu part though. This works fine on colab for me:

# !pip uninstall --y torch # !pip install --pre torch -f https://download.pytorch.org/whl/nightly/cpu/torch_nightly.html import torch import torch.nn.utils.parametrize as P model = P.register_parametrization(torch.nn.Linear(2, 5), "weight", torch.nn.ReLU()) inp = torch.rand(1, 2, requires_grad=True) out = model(inp) g = torch.autograd.grad(out.sum(), model.parameters(), create_graph=True) print(g) g[1].exp().sum().backward() print(inp.grad)

I meant that this example (which is a modification of that in that PR) breaks:

import torch import torch.nn.utils.parametrize as P model = P.register_parametrization(torch.nn.Linear(2, 5), "weight", torch.nn.ReLU()) opt1 = torch.optim.SGD(model.parameters(), lr=1e-3) opt2 = torch.optim.SGD(model.parameters(), lr=1e-3) output = model(torch.randn(7, 2)) loss = output.abs().mean() opt1.zero_grad(); loss.backward(retain_graph=True); opt1.step() # first propagation opt2.zero_grad(); loss.backward(); opt2.step() # second

albanD

Looks quite good to me.
Only phrasing and minor comments.

index.rst

intermediate_source/parametrizations.py

albanD · 2021-04-06T14:27:35Z

intermediate_source/parametrizations.py

+        return A
+
+###############################################################################
+# In this case, it is not true that ``forward(right_inverse(X)) == X``. This is


There is no reason it wouldn't.
Your implementation of the reparametrization will need to be double differentiable though.

lezcano · 2021-04-07T11:45:29Z

@albanD I addressed the points that you raised and I corrected a few other things (the code does not break now... I had forgotten to check that...). Even then, there were no major changes in the text.

albanD

Just a minor update and it looks mostly good for me.

index.rst

intermediate_source/parametrizations.py

Add Alban's suggestions Correct the code Beter spacing after enumeration

* Parametrizaitons tutorial * Add remove_parametrization * Correct name * minor * Proper version number * Fuzzy spellcheck * version * Remove _tutorial from name * Forgot to add the file... * Rename parametrizations_tutorial by parametrizations everywhere Add Alban's suggestions Correct the code Beter spacing after enumeration * Minor * Add more comments * Minor * Prefer unicode over math * Minor * minor * Corrections Co-authored-by: Brian Johnson <[email protected]>

lezcano requested a review from albanD March 25, 2021 18:57

facebook-github-bot added the cla signed label Mar 25, 2021

IvanYashchuk changed the title ~~Parametrizaitons tutorial~~ Parameterizations tutorial Mar 25, 2021

IvanYashchuk changed the title ~~Parameterizations tutorial~~ Parametrizations tutorial Mar 25, 2021

vadimkantorov reviewed Apr 6, 2021

View reviewed changes

albanD reviewed Apr 6, 2021

View reviewed changes

albanD reviewed Apr 7, 2021

View reviewed changes

index.rst Outdated Show resolved Hide resolved

lezcano mentioned this pull request Apr 8, 2021

Error on initialization Tensor not contained in PSSD lezcano/geotorch#15

Closed

iamsimha reviewed Apr 11, 2021

View reviewed changes

intermediate_source/parametrizations.py Outdated Show resolved Hide resolved

lezcano added 15 commits April 12, 2021 10:21

Parametrizaitons tutorial

9f43dbb

Add remove_parametrization

6b97e9e

Correct name

6448c4b

minor

93d1cd2

Proper version number

3390cb7

Fuzzy spellcheck

542d4d7

version

70ae334

Remove _tutorial from name

ea59b2b

Forgot to add the file...

f3d1204

Rename parametrizations_tutorial by parametrizations everywhere

a2dec3f

Add Alban's suggestions Correct the code Beter spacing after enumeration

Minor

b952c78

Add more comments

f0e55b3

Minor

d1dd542

Prefer unicode over math

58b9611

Minor

d2bbbcf

lezcano and others added 4 commits April 12, 2021 10:21

minor

9f0546e

Corrections

8f926c0

Merge branch 'master' into master

bd2e2ae

Merge branch 'master' into master

496327c

brianjo merged commit dc5c41c into pytorch:master Apr 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parametrizations tutorial #1444

Parametrizations tutorial #1444

lezcano commented Mar 25, 2021 •

edited by IvanYashchuk

Loading

netlify bot commented Mar 25, 2021 •

edited

Loading

brianjo commented Mar 26, 2021

vadimkantorov Apr 6, 2021

lezcano Apr 6, 2021

vadimkantorov Apr 6, 2021 •

edited

Loading

vadimkantorov Apr 6, 2021

lezcano Apr 6, 2021

vadimkantorov Apr 6, 2021

lezcano Apr 6, 2021

vadimkantorov Apr 6, 2021 •

edited

Loading

albanD Apr 6, 2021

vadimkantorov Apr 6, 2021 •

edited

Loading

lezcano Apr 6, 2021

vadimkantorov Apr 6, 2021

albanD Apr 7, 2021

vadimkantorov Apr 6, 2021

lezcano Apr 6, 2021 •

edited

Loading

albanD Apr 6, 2021

vadimkantorov Apr 7, 2021

albanD Apr 7, 2021

lezcano Apr 7, 2021 •

edited

Loading

albanD left a comment

albanD Apr 6, 2021

lezcano commented Apr 7, 2021

albanD left a comment

Parametrizations tutorial #1444

Parametrizations tutorial #1444

Conversation

lezcano commented Mar 25, 2021 • edited by IvanYashchuk Loading

netlify bot commented Mar 25, 2021 • edited Loading

brianjo commented Mar 26, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vadimkantorov Apr 6, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vadimkantorov Apr 6, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vadimkantorov Apr 6, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lezcano Apr 6, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lezcano Apr 7, 2021 • edited Loading

Choose a reason for hiding this comment

albanD left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lezcano commented Apr 7, 2021

albanD left a comment

Choose a reason for hiding this comment

lezcano commented Mar 25, 2021 •

edited by IvanYashchuk

Loading

netlify bot commented Mar 25, 2021 •

edited

Loading

vadimkantorov Apr 6, 2021 •

edited

Loading

vadimkantorov Apr 6, 2021 •

edited

Loading

vadimkantorov Apr 6, 2021 •

edited

Loading

lezcano Apr 6, 2021 •

edited

Loading

lezcano Apr 7, 2021 •

edited

Loading