adapt attention.py to torch 2.0 #2483

caiqi · 2023-02-24T17:46:53Z

The AttentionBlock is not adapted to the torch 2.0. When using StableDiffusionLatentUpscalePipeline with 768x768 images, it will raise OOM on 16GB GPU. This PR use F.scaled_dot_product_attention to decrease the memory usage. I tested on Colab this PR can fix the issue. https://colab.research.google.com/drive/1qMwzjweWSUHsYeG932OCECAeA-qkyUjb?usp=sharing .

HuggingFaceDocBuilderDev · 2023-02-24T17:51:10Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten · 2023-03-06T13:41:38Z

Hey @caiqi,

Thanks for the PR - I believe that we already merged PyTorch's 2.0 fast attention support in this PR: #2303

Think we can close this one, no? Very sorry for not replying earlier.

caiqi · 2023-03-08T01:11:43Z

Hey @caiqi,

Thanks for the PR - I believe that we already merged PyTorch's 2.0 fast attention support in this PR: #2303

Think we can close this one, no? Very sorry for not replying earlier.

@patrickvonplaten Thanks! It seems this PR #2303 updates the cross_attention.py but not attention.py?

I meet the memory issue due to the image decoder part, which replies on AttentionBlock.

patrickvonplaten · 2023-03-08T15:35:05Z

Hey @caiqi,

yeah the naming is not great here, attention.py make use of cross_attention.py in all its attention computations.

caiqi · 2023-03-09T11:54:07Z

@patrickvonplaten I have tested the latest diffusers code and it seems that attention.py uses its own attention code. The following is the stack track:

I have tested in this colab notebook: https://colab.research.google.com/drive/1qMwzjweWSUHsYeG932OCECAeA-qkyUjb?usp=sharing

patrickvonplaten · 2023-03-09T13:16:30Z

cc @williamberman we should clean this attention logic up to avoid confusion

williamberman · 2023-03-16T10:40:22Z

Appreciate it @caiqi :) we're in the process of deprecating AttentionBlock, see #1880

adapt attention.py to torch 2.0

719b971

williamberman closed this Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adapt attention.py to torch 2.0 #2483

adapt attention.py to torch 2.0 #2483

caiqi commented Feb 24, 2023

HuggingFaceDocBuilderDev commented Feb 24, 2023 •

edited

Loading

patrickvonplaten commented Mar 6, 2023

caiqi commented Mar 8, 2023 •

edited

Loading

patrickvonplaten commented Mar 8, 2023

caiqi commented Mar 9, 2023 •

edited

Loading

patrickvonplaten commented Mar 9, 2023

williamberman commented Mar 16, 2023 •

edited

Loading

adapt attention.py to torch 2.0 #2483

adapt attention.py to torch 2.0 #2483

Conversation

caiqi commented Feb 24, 2023

HuggingFaceDocBuilderDev commented Feb 24, 2023 • edited Loading

patrickvonplaten commented Mar 6, 2023

caiqi commented Mar 8, 2023 • edited Loading

patrickvonplaten commented Mar 8, 2023

caiqi commented Mar 9, 2023 • edited Loading

patrickvonplaten commented Mar 9, 2023

williamberman commented Mar 16, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Feb 24, 2023 •

edited

Loading

caiqi commented Mar 8, 2023 •

edited

Loading

caiqi commented Mar 9, 2023 •

edited

Loading

williamberman commented Mar 16, 2023 •

edited

Loading