[wip] attention refactor #2143

williamberman · 2023-01-27T21:13:14Z

When AttentionBlock's forward is called, dynamically create a CrossAttention module with the same parameters and call its forward method instead.
If an AttentionBlock is constructed, We will log a deprecation warning and instructions for converting the model. We can use a context manager that manages a one off logging method in attention.py to de-dup deprecation messages.

HuggingFaceDocBuilderDev · 2023-01-27T21:18:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

williamberman · 2023-01-27T23:34:56Z

src/diffusers/models/attention.py

+        attn = CrossAttention(
+            self.channels,
+            heads=self.num_heads,
+            dim_head=dim_head,
+            bias=True,
+            upcast_softmax=True,
+            norm_num_groups=self.group_norm.num_groups,
+            processor=processor,
+            eps=self.group_norm.eps,
+            rescale_output_factor=self.rescale_output_factor,
+        )

-        # compute next hidden_states
-        hidden_states = self.proj_attn(hidden_states)
+        attn.group_norm = self.group_norm
+        attn.to_q = self.query
+        attn.to_k = self.key
+        attn.to_v = self.value
+        attn.to_out[0] = self.proj_attn

-        hidden_states = hidden_states.transpose(-1, -2).reshape(batch, channel, height, width)
+        hidden_states = attn(hidden_states)


Creating CrossAttention on the fly like this causes some of the mps tests that rely on reproducibility to fail. Could this have something to do with the mps warm up passes? cc @pcuenca

williamberman · 2023-01-30T23:43:21Z

src/diffusers/pipelines/audio_diffusion/mel.py


 from ...configuration_utils import ConfigMixin, register_to_config
 from ...schedulers.scheduling_utils import SchedulerMixin


-warnings.filterwarnings("ignore")


This warnings filter silences all warnings. Need to remove to see the attention block deprectation warning

williamberman force-pushed the attention_refactor branch 2 times, most recently from 2658f5a to c6709ed Compare January 27, 2023 22:01

williamberman commented Jan 27, 2023

View reviewed changes

williamberman force-pushed the attention_refactor branch 10 times, most recently from e11a51a to 52f16b8 Compare January 30, 2023 20:43

williamberman commented Jan 30, 2023

View reviewed changes

williamberman force-pushed the attention_refactor branch from cbec387 to ae18d1d Compare January 31, 2023 18:19

[wip] attention refactor

06f3e9b

williamberman force-pushed the attention_refactor branch from ae18d1d to 06f3e9b Compare January 31, 2023 23:25

williamberman closed this Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wip] attention refactor #2143

[wip] attention refactor #2143

williamberman commented Jan 27, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 27, 2023

williamberman Jan 27, 2023

williamberman Jan 30, 2023

[wip] attention refactor #2143

[wip] attention refactor #2143

Conversation

williamberman commented Jan 27, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Jan 27, 2023

williamberman Jan 27, 2023

Choose a reason for hiding this comment

williamberman Jan 30, 2023

Choose a reason for hiding this comment

williamberman commented Jan 27, 2023 •

edited

Loading