Attention mask for transformer flux #10051

christopher5106 · 2024-11-29T10:10:47Z

Unused params attention_mask in transformer flux attention processors is useful to reweight prompt or image prompt.

yiyixuxu · 2024-12-01T22:00:12Z

src/diffusers/models/attention_processor.py

@@ -1762,7 +1762,9 @@ def __call__(
            query = apply_rotary_emb(query, image_rotary_emb)
            key = apply_rotary_emb(key, image_rotary_emb)

-        hidden_states = F.scaled_dot_product_attention(query, key, value, dropout_p=0.0, is_causal=False)
+        hidden_states = F.scaled_dot_product_attention(


i see you added this to F.scaled_dot_product_attention
but just wonder how would this help? it is currently not passing down from the flux transformer to attenton_processor

diffusers/src/diffusers/models/attention.py

Line 190 in c96bfa5

def forward(

christopher5106 added 6 commits October 7, 2024 11:08

fixing the convention naming in diffusers

5fc9199

fixing the convention naming in diffusers

e508f2b

fixing the convention naming in diffusers

e80032d

fixing the convention naming in diffusers

bd93038

attention_mask for flux

fee567f

fix

5b56e8f

christopher5106 marked this pull request as draft November 29, 2024 10:31

christopher5106 closed this Nov 29, 2024

yiyixuxu reviewed Dec 1, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attention mask for transformer flux #10051

Attention mask for transformer flux #10051

christopher5106 commented Nov 29, 2024

yiyixuxu Dec 1, 2024

Attention mask for transformer flux #10051

Attention mask for transformer flux #10051

Conversation

christopher5106 commented Nov 29, 2024

yiyixuxu Dec 1, 2024

Choose a reason for hiding this comment