Dreambooth class sampling to use xformers if enabled #3312

mu94-csl · 2023-05-02T05:49:50Z

Currently, the training can use xformers, but not the inference for class sampling before the training.
The prior preservation class sampling of ~200 images is a major bottleneck right now (~6x time over actual training).
This PR allows xformers to be used for both training/inference, for both full and LoRA dreambooth.

HuggingFaceDocBuilderDev · 2023-05-02T05:54:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

patrickvonplaten · 2023-05-03T14:55:54Z

Hmm given that much higher speed-ups can be gained with PT 2.0 now, should we maybe just advertise PT 2.0 ? cc @sayakpaul

sayakpaul · 2023-05-04T06:30:28Z

Hmm given that much higher speed-ups can be gained with PT 2.0 now, should we maybe just advertise PT 2.0

I think the community still uses xFormers a lot. Maybe it's better to make it clear from the docs that if someone is using PyTorch 2.0, the efficient attention processor will be used by default and they shouldn't have to enable xFormers for that.

diffusers/src/diffusers/models/attention_processor.py

Line 143 in 4bae76e

    
           AttnProcessor2_0() if hasattr(F, "scaled_dot_product_attention") and scale_qk else AttnProcessor()

WDYT?

mu94-csl · 2023-05-04T12:43:08Z

Hmm given that much higher speed-ups can be gained with PT 2.0 now, should we maybe just advertise PT 2.0 ?

Actually yes, I found no difference for Turing and Ampere GPUs, between PT2.0 and xformers.

My PR was 'misled' due to my experiments on a Pascal-gen GPU (still usable :D), for which PT 2.0 perhaps does not launch the correct kernels, but xformers help tremendously.

Also the current DreamBooth script fails on PT 2.0 due to some CUDA errors but works with xformers, suggesting that the PT backend is still quirky - see #3325.

patrickvonplaten · 2023-05-05T18:55:34Z

Hmm given that much higher speed-ups can be gained with PT 2.0 now, should we maybe just advertise PT 2.0

I think the community still uses xFormers a lot. Maybe it's better to make it clear from the docs that if someone is using PyTorch 2.0, the efficient attention processor will be used by default and they shouldn't have to enable xFormers for that.

diffusers/src/diffusers/models/attention_processor.py

Line 143 in 4bae76e

AttnProcessor2_0() if hasattr(F, "scaled_dot_product_attention") and scale_qk else AttnProcessor()

WDYT?

Yes good idea, we could throw a warning if we detect both PT and xformers to be installed maybe at init?

github-actions · 2023-06-01T15:02:47Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

mu94-csl added 2 commits May 2, 2023 01:48

Class sampling to use xformers

cf0c2a5

Class sampling to use xformers

29b23de

sayakpaul mentioned this pull request May 8, 2023

add: a warning message when using xformers in a PT 2.0 env. #3365

Merged

github-actions bot added the stale Issues that haven't received updates label Jun 1, 2023

github-actions bot closed this Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dreambooth class sampling to use xformers if enabled #3312

Dreambooth class sampling to use xformers if enabled #3312

mu94-csl commented May 2, 2023

HuggingFaceDocBuilderDev commented May 2, 2023

patrickvonplaten commented May 3, 2023

sayakpaul commented May 4, 2023

mu94-csl commented May 4, 2023 •

edited

Loading

patrickvonplaten commented May 5, 2023

github-actions bot commented Jun 1, 2023

Dreambooth class sampling to use xformers if enabled #3312

Dreambooth class sampling to use xformers if enabled #3312

Conversation

mu94-csl commented May 2, 2023

HuggingFaceDocBuilderDev commented May 2, 2023

patrickvonplaten commented May 3, 2023

sayakpaul commented May 4, 2023

mu94-csl commented May 4, 2023 • edited Loading

patrickvonplaten commented May 5, 2023

github-actions bot commented Jun 1, 2023

mu94-csl commented May 4, 2023 •

edited

Loading