Fix for breaking changes in xformers 0.0.21 #834

WoosukKwon · 2023-08-23T00:29:06Z

Fixes #832

This PR fixes the breaking changes made in the latest release of xformers. Specifically, now the xformers attention bias should have the batch dimension, which is 1 in the current vLLM implementation.

zhuohan123

LGTM! Thanks for the fix!

zhuohan123 · 2023-08-23T07:23:30Z

vllm/model_executor/layers/attention.py

                self.num_heads,
-                padded_len,
+                prompt_len,


What is the reason for the change from padded_len to prompt_len here?

Good question. This is actually a right implementation to fulfill the alignment requirement in xformers. This will fix #795, #407, #468

WoosukKwon added 3 commits August 23, 2023 00:21

Fix for xformers

e15303d

xformers >= 0.0.21

a3a4403

Minor

ea9e1c7

WoosukKwon requested a review from zhuohan123 August 23, 2023 00:29

Fix alignment error

314273c

WoosukKwon changed the title ~~Fix for breaking changes in xformers~~ Fix for breaking changes in xformers 0.0.21 Aug 23, 2023

zhuohan123 approved these changes Aug 23, 2023

View reviewed changes

WoosukKwon mentioned this pull request Aug 23, 2023

"attn_bias is not correctly aligned" on A100 for MPT-30B #795

Closed

This was linked to issues Aug 23, 2023

"attn_bias is not correctly aligned" on A100 for MPT-30B #795

Closed

RuntimeError: attn_bias is not correctly aligned #407

Closed

attn_bias not aligned & some questions regarding float16 #468

Closed

WoosukKwon merged commit 2a4ec90 into main Aug 23, 2023

WoosukKwon deleted the fix-xformers branch August 23, 2023 08:44

This was referenced Aug 23, 2023

attn_bias not aligned & some questions regarding float16 #468

Closed

麻烦关注下baichuan-13b error:ValueError: Invalid shape for attention bias: torch.Size([40, 10, 10]) (expected (1, 40, 10, 10)) #838

Closed

randxie pushed a commit to randxie/vllm that referenced this pull request Aug 29, 2023

Fix for breaking changes in xformers 0.0.21 (vllm-project#834)

df3906e

liuyanyi pushed a commit to liuyanyi/vllm that referenced this pull request Sep 12, 2023

Fix for breaking changes in xformers 0.0.21 (vllm-project#834)

50d0b4c

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Fix for breaking changes in xformers 0.0.21 (vllm-project#834)

ef89547

learning-chip mentioned this pull request Apr 5, 2024

No speed-up of model.generate() with StaticCache + torch.compile in 4.39.3 huggingface/transformers#30055

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for breaking changes in xformers 0.0.21 #834

Fix for breaking changes in xformers 0.0.21 #834

WoosukKwon commented Aug 23, 2023

zhuohan123 left a comment

zhuohan123 Aug 23, 2023

WoosukKwon Aug 23, 2023 •

edited

Loading

Fix for breaking changes in xformers 0.0.21 #834

Fix for breaking changes in xformers 0.0.21 #834

Conversation

WoosukKwon commented Aug 23, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment

zhuohan123 Aug 23, 2023

Choose a reason for hiding this comment

WoosukKwon Aug 23, 2023 • edited Loading

Choose a reason for hiding this comment

WoosukKwon Aug 23, 2023 •

edited

Loading