attn_bias not aligned & some questions regarding float16 #468

MM-IR · 2023-07-15T04:43:59Z

Hi,

When playing with MPT-7b models, I frequently meet the issues of "attn_bias not aligned", with tensor_parallel_size - 2, how do alleviate this issue?
Besides, I just find that your default model loading scripts load float16 versions, for fair evaluation, is it necessary to switch to float32?

Thanks very much in advance!

WoosukKwon · 2023-08-23T09:00:35Z

Hi @MM-IR, sorry for the very late response. I believe the bug you reported was fixed by #834.

For the second question, I think it's pretty common to use FP16 for evaluating models, because its impact on model accuracy is negligible. For example, HF open LLM leaderboard does not support FP32, but only supports FP16, BF16, and 4/8-bit quantized formats.

This PR adds all commits before vllm-project#6143 without vllm-project#6143.

zhuohan123 mentioned this issue Jul 18, 2023

[Roadmap] vLLM Development Roadmap: H2 2023 #244

Closed

76 tasks

zhuohan123 added the bug Something isn't working label Jul 18, 2023

WoosukKwon mentioned this issue Aug 23, 2023

Fix for breaking changes in xformers 0.0.21 #834

Merged

WoosukKwon linked a pull request Aug 23, 2023 that will close this issue

Fix for breaking changes in xformers 0.0.21 #834

Merged

WoosukKwon closed this as completed in #834 Aug 23, 2023

pi314ever pushed a commit to pi314ever/vllm that referenced this issue Nov 20, 2024

Nov 6 rebase (sans vllm-project#6143) (vllm-project#468)

5eb7f3d

This PR adds all commits before vllm-project#6143 without vllm-project#6143.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

attn_bias not aligned & some questions regarding float16 #468

attn_bias not aligned & some questions regarding float16 #468

MM-IR commented Jul 15, 2023

WoosukKwon commented Aug 23, 2023

Uh oh!

Uh oh!

attn_bias not aligned & some questions regarding float16 #468

attn_bias not aligned & some questions regarding float16 #468

Comments

MM-IR commented Jul 15, 2023

WoosukKwon commented Aug 23, 2023

Uh oh!