Implement `.swap()` against diffusers 0.12 #2385

damian0815 · 2023-01-21T19:55:51Z

Re-implementation of .swap() for diffusers 0.12's new CrossAttnProcessor API.

needs diffusers 0.12 pip install https://github.com/huggingface/diffusers

currently only tested/working on mac CPU (invoke.py --always_use_cpu).

todo:

Sliced version
Test on linux & windows
Correctly reinstate the proper CrossAttnProcessor after it quits - it should automatically go back to eg xformers if that's what was there before doing a .swap()
remove errors about missing monkeypatching

…tation

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

damian0815 · 2023-01-25T21:43:56Z

MPS .swap is non-functional until kulinseth/pytorch#222 is merged

damian0815 · 2023-01-25T22:00:01Z

Ok I think this is good. Can i get some testing support on Windows and Linux please?

does it work with/without xformers
if you're running xformers: does InvokeAI correctly return to using xformers after doing a .swap()

…ention_control_reimplementation

ldm/invoke/generator/diffusers_pipeline.py

ldm/models/diffusion/cross_attention_control.py

ldm/models/diffusion/shared_invokeai_diffusion.py

ldm/invoke/generator/diffusers_pipeline.py

lstein · 2023-01-26T22:26:16Z

It seems to be working on Linux. How much change is expected in the non-swapped portions of the image? Here's a typical test:
mother and daughter having lunch

mother and daughter.swap(son) having lunch

mother and daughter.swap(son) having lunch.swap(dinner)

Memory usage was quite good. No apparent increase in memory usage when swap activated. Works with xformers as well.

keturn · 2023-01-28T21:55:46Z

mother and daughter.swap(son) having lunch.swap(dinner)

Oh, do we get to use more than one operation now? The previous implementation was limited to one, I thought.

lstein · 2023-01-28T22:03:45Z

I never tested two before, so I didn’t know that was a limitation. All ok then.

On Sat, Jan 28, 2023 at 4:55 PM Kevin Turner ***@***.***> wrote: mother and daughter.swap(son) having lunch.swap(dinner) Oh, do we get to use more than one operation now? The previous implementation was limited to one, I thought. — Reply to this email directly, view it on GitHub <#2385 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAA3EVLUFAKZACM6GBS4IWLWUWIW3ANCNFSM6AAAAAAUCR7FKE> . You are receiving this because you commented.Message ID: ***@***.***>

-- Written on my cell phone. Anything that seems odd is the fault of auto-correct.

…tation

keturn · 2023-01-29T23:12:20Z

Testing on Linux, results seem poor. The image with swap is almost (though not quite) identical to the replacement on its own.

Starting with --no-xformers does not seem to improve matters.

a photo of the trunk of a car filled with soccer gear

a photo of the trunk of a car filled with (picnic supplies).swap(soccer gear)

a photo of the trunk of a car filled with picnic supplies

using SD 1.5, DDIM, 25 steps

damian0815 · 2023-01-29T23:53:44Z

@keturn can you try with , t_start=0.1 or 0.2? ie a photo of the trunk of a car filled with (picnic supplies).swap(soccer gear, t_start=0.1)

JPPhoto · 2023-01-30T02:22:01Z

@damian0815 Perhaps t_start should default to something like 0.2 so there's a visual difference?

keturn · 2023-01-30T05:31:59Z

Interesting. Setting t_start, even as low as 0.05 (of 25 steps) is enough to retain the shape of the "picnic" car instead of looking totally like the replacement prompt.

(picnic supplies).swap(soccer gear, t_start=0.05)

I guess this will probably all be more comprehensible once we get the attention map visualizations back, huh?

keturn · 2023-01-30T05:48:55Z

Same goes for the test prompt I got from hipsterusername a while back:

silhouette of a dancing (elvis).swap(frog)

With default settings [t_start=0], it is practically indistinguishable from the replacement prompt silhouette of a dancing frog. Effective values of t_start seem to be low-but-non-zero. Which translates to a couple of steps, I assume.

keturn · 2023-01-30T06:03:25Z

Oh, do we get to use more than one operation now? The previous implementation was limited to one, I thought.

This warning still pops up in the log: "warning: cross-attention control options are not working properly for >1 edit"

but using multiple swaps definitely does do stuff. Is it warning us that you can have multiple edits but not have independent values of t_start/t_end for them?

damian0815 · 2023-01-30T12:44:24Z

yep, t_start should default to something >=1 step. probably 1 step would be fine. i wonder if t_start=0 should simply mean "after 1 step" then, since there's no longer an s_start.

@keturn

but using multiple swaps definitely does do stuff. Is it warning us that you can have multiple edits but not have independent values of t_start/t_end for them?

yes, that's the warning. i do want to eventually address that - should be more clear how now that i've broken off compel as a separate lib (which i'd like to convert to an import instead of having local source)

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

damian0815 · 2023-01-30T14:20:14Z

ok no skipping the first step is a bad idea, i'll just make the default 0.1

damian0815 · 2023-01-30T14:35:04Z

better now:

a photo of the trunk of a car filled with picnic supplies

a photo of the trunk of a car filled with soccer gear

a photo of the trunk of a car filled with (picnic supplies).swap(soccer gear)

…st step" This reverts commit 27ee939.

damian0815 · 2023-01-30T15:17:20Z

i took the liberty of ticking the test of linux & windows button, think we should be good to go on this

…tation

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

damian0815 · 2023-01-30T15:29:46Z

@keturn i also took the liberty of "resolving" the concerns you raised re: the naming of the remove_cross_attention_control function (which is now called restore_default_cross_attention)

keturn

I've now experimented with some high-RAM operations before and after running the swap to confirm that it does indeed put the memory-efficient attention settings back correctly, and that's working well for me both with xformers and without. ✔️

There are still minor details I'm unclear on (like why you can pass None to restore_default_cross_attention), but overall this is a huge improvement to the stability of the cross-attention code with diffusers 0.12 and I think it's good to merge. 👍

docs/features/PROMPTS.md

damian0815 added 2 commits January 21, 2023 18:07

wip SwapCrossAttnProcessor

0c2a511

SwapCrossAttnProcessor working - tested on mac CPU (MPS doesn't work)

bffe199

damian0815 mentioned this pull request Jan 21, 2023

[bug]: swap doesn't work when xformers is enabled #2328

Closed

keturn added this to the 2.3 🧨 milestone Jan 21, 2023

damian0815 mentioned this pull request Jan 22, 2023

[bug]: Summary of cross-attention related SD-2.1/xformers/swap issues #2387

Closed

1 task

squash float16/float32 mismatch on linux

313b206

damian0815 force-pushed the diffusers_cross_attention_control_reimplementation branch from c2183b6 to 313b206 Compare January 22, 2023 17:13

damian0815 and others added 8 commits January 22, 2023 18:19

pass missing value

c0610f7

sliced attention processor wip (untested)

63c6019

Merge branch 'main' into diffusers_cross_attention_control_reimplemen…

3c53b46

…tation

more wip sliced attention (.swap doesn't work yet)

a4aea15

Merge branch 'diffusers_cross_attention_control_reimplementation' of …

c52dd7e

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

sliced swap working

1f5ad1b

cleanup

34a3f4a

wip tracking down MPS slicing support

41aed57

damian0815 added 2 commits January 25, 2023 23:03

MPS support: negatory

95d147c

Merge remote-tracking branch 'upstream/main' into diffusers_cross_att…

93a2444

…ention_control_reimplementation

damian0815 marked this pull request as ready for review January 25, 2023 22:06

wip updating docs

5e7ed96

keturn reviewed Jan 26, 2023

View reviewed changes

ldm/invoke/generator/diffusers_pipeline.py Outdated Show resolved Hide resolved

damian0815 added 5 commits January 26, 2023 17:04

use 'auto' slice size

8ed8bf5

trying out JPPhoto's patch on vast.ai

7297526

use the correct value - whoops

fb312f9

don't restore None

c381788

try without setting every time

e090c0d

keturn mentioned this pull request Jan 26, 2023

[Community] Implement prompt-to-prompt pipelines huggingface/diffusers#2121

Closed

Merge branch 'main' into diffusers_cross_attention_control_reimplemen…

5ce62e0

…tation

damian0815 added 2 commits January 30, 2023 14:50

with diffusers cac, always run the original prompt on the first step

27ee939

Merge branch 'diffusers_cross_attention_control_reimplementation' of …

c5c160a

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

for cac make t_start=0.1 the default

478c379

Revert "with diffusers cac, always run the original prompt on the fir…

17d73d0

…st step" This reverts commit 27ee939.

damian0815 and others added 3 commits January 30, 2023 16:17

Merge branch 'main' into diffusers_cross_attention_control_reimplemen…

3f1120e

…tation

rename override/restore methods to better reflect what they actually do

d044d4c

Merge branch 'diffusers_cross_attention_control_reimplementation' of …

817e36f

…github.com:damian0815/InvokeAI into diffusers_cross_attention_control_reimplementation

keturn approved these changes Jan 30, 2023

View reviewed changes

docs/features/PROMPTS.md Show resolved Hide resolved

keturn merged commit 371edc9 into invoke-ai:main Jan 30, 2023

This was referenced Jan 30, 2023

honor upcast_attention model setting #2365

Closed

[bug]: SD 2.x models return only black in float16 #2329

Closed

psychedelicious mentioned this pull request Feb 16, 2023

[bug]: prompt cross attention t_start defaults to 0.1 instead of 0 #2682

Closed

1 task

Implement .swap() against diffusers 0.12 #2385

Implement .swap() against diffusers 0.12 #2385

Uh oh!

Conversation

damian0815 commented Jan 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

damian0815 commented Jan 25, 2023

Uh oh!

damian0815 commented Jan 25, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lstein commented Jan 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

keturn commented Jan 28, 2023

Uh oh!

lstein commented Jan 28, 2023 via email

Uh oh!

keturn commented Jan 29, 2023

Uh oh!

damian0815 commented Jan 29, 2023

Uh oh!

JPPhoto commented Jan 30, 2023

Uh oh!

keturn commented Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

keturn commented Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

keturn commented Jan 30, 2023

Uh oh!

damian0815 commented Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

damian0815 commented Jan 30, 2023

Uh oh!

damian0815 commented Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

damian0815 commented Jan 30, 2023

Uh oh!

damian0815 commented Jan 30, 2023

Uh oh!

keturn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Implement `.swap()` against diffusers 0.12 #2385

Implement `.swap()` against diffusers 0.12 #2385

damian0815 commented Jan 21, 2023 •

edited

Loading

lstein commented Jan 26, 2023 •

edited

Loading

keturn commented Jan 30, 2023 •

edited

Loading

keturn commented Jan 30, 2023 •

edited

Loading

damian0815 commented Jan 30, 2023 •

edited

Loading

damian0815 commented Jan 30, 2023 •

edited

Loading