Do not use deepcopy to copy nn.modules (including entire models) #2177

mikekgfb · 2023-01-21T01:49:34Z

copy.deepcopy is not defined for nn.module() and does not reliably copy an nn.module hierrchy, such as a model or partial model. Our tutorials should using copy.deepcopy() as this will induce our users to make incorrect use of the primitive.

Please update https://pytorch.org/tutorials/beginner/transformer_tutorial.html and other tutorials to avoid the use. Recommended way for snapshotting is to use torch.save() for trained models. to create multiple clones of an untrained model, create multiple copies from first principles from the model architectural parameters.

cc @pytorch/team-text-core @Nayef211

The text was updated successfully, but these errors were encountered:

mikekgfb · 2023-01-21T01:50:28Z

cc: @mthrok

Nayef211 · 2023-01-24T21:55:39Z

Thanks for creating this issue @mikekgfb. Will submit a PR with the fixes soon.

Nayef211 · 2023-01-25T19:18:56Z

@svekars I don't think this issue should be closed. The PR I sent out fixed the issue for https://pytorch.org/tutorials/beginner/transformer_tutorial.html. But there are several other places where it's still being used.

I.e. https://github.com/search?q=repo%3Apytorch%2Ftutorials%20copy.deepcopy(&type=code

svekars · 2023-01-30T17:11:24Z

Can the owners of the tutorials please take a look and submit corrections? If some of these tutorials describe an obsolete feature, please submit a PR to remove them:

fx_numeric_suite_tutorial - will not fix, will remove the tutorial as no longer valid.
fx_graph_mode_ptq_static - @jerryzh168
fx_conv_bn_fuser - @Chillee
transfer_learning_tutorial - @z-a-f
quantized_transfer_learning_tutorial - @z-a-f
fx_graph_mode_ptq_dynamic - @jerryzh168
mario_rl - @suraj813 @hw26

vkuzo · 2023-01-30T17:24:52Z

For quantization, we were assuming that copy.deepcopy is defined on nn.Module. Is it really not defined - are we sure that it's not just a bug that it's not working in some cases?

jerryzh168 · 2023-01-30T21:25:06Z

cc @qihqi

qihqi · 2023-01-30T22:27:21Z

Q: copy.deepcopy is not defined for nn.module() and does not reliably copy an nn.module hierrchy -- I believe in truthiness of this statement. However, why is the CTA "stop using deepcopy" and not "let's fix deepcopy for nn.Module"? Like is there a fundamental reason the latter is impossible or undesired, or it's really, "until deepcopy start working properly don't use it"?

chedatomasz · 2023-04-14T09:47:14Z

Are you sure deepcopy is not supposed to be defined for nn.modules? https://github.com/pytorch/pytorch/blob/master/torch/optim/swa_utils.py appears to be using deepcopy.

carljparker · 2023-05-26T17:37:40Z

I'm closing this issue because sekyondaMeta has opened up targeted issues to fix the specific tutorials that need updating.

#2335 | #2334 | #2333 | #2332 | #2331 | #2330

bot66 · 2023-11-06T03:44:38Z

Thanks for the information.

svekars added the module: torchtext label Jan 23, 2023

Nayef211 mentioned this issue Jan 25, 2023

Replace usage of copy.deepcopy() in favor of torch.save() to store best model params in transformer tutorial #2181

Merged

svekars closed this as completed Jan 25, 2023

Nayef211 reopened this Jan 25, 2023

carljparker closed this as completed May 26, 2023

QasimKhan5x mentioned this issue Jun 1, 2023

Fix tutorial to avoid use of copy.deepcopy()- FX Graph Mode #2385

Merged

svekars mentioned this issue Jun 1, 2023

updated the deepcopy method, using pythons native marshal command #2390

Closed

NicolasHug mentioned this issue Jun 5, 2023

Update Quantized Transfer Learning for Computer Vision tutorial to avoid use of copy.deepcopy() #2417

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not use deepcopy to copy nn.modules (including entire models) #2177

Do not use deepcopy to copy nn.modules (including entire models) #2177

mikekgfb commented Jan 21, 2023 •

edited by pytorch-bot bot

Loading

mikekgfb commented Jan 21, 2023

Nayef211 commented Jan 24, 2023

Nayef211 commented Jan 25, 2023 •

edited

Loading

svekars commented Jan 30, 2023 •

edited

Loading

vkuzo commented Jan 30, 2023

jerryzh168 commented Jan 30, 2023

qihqi commented Jan 30, 2023

chedatomasz commented Apr 14, 2023

carljparker commented May 26, 2023

bot66 commented Nov 6, 2023

Do not use deepcopy to copy nn.modules (including entire models) #2177

Do not use deepcopy to copy nn.modules (including entire models) #2177

Comments

mikekgfb commented Jan 21, 2023 • edited by pytorch-bot bot Loading

mikekgfb commented Jan 21, 2023

Nayef211 commented Jan 24, 2023

Nayef211 commented Jan 25, 2023 • edited Loading

svekars commented Jan 30, 2023 • edited Loading

vkuzo commented Jan 30, 2023

jerryzh168 commented Jan 30, 2023

qihqi commented Jan 30, 2023

chedatomasz commented Apr 14, 2023

carljparker commented May 26, 2023

bot66 commented Nov 6, 2023

mikekgfb commented Jan 21, 2023 •

edited by pytorch-bot bot

Loading

Nayef211 commented Jan 25, 2023 •

edited

Loading

svekars commented Jan 30, 2023 •

edited

Loading