Skip to content

[Usage]: Multi lora inference support for llava v1.6 #13034

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
1 task done
WeixuanXiong opened this issue Feb 10, 2025 · 1 comment
Closed
1 task done

[Usage]: Multi lora inference support for llava v1.6 #13034

WeixuanXiong opened this issue Feb 10, 2025 · 1 comment
Labels
usage How to use vllm

Comments

@WeixuanXiong
Copy link

Your current environment

The output of `python collect_env.py`

How would you like to use vllm

I tried to run multi-lora inference on vllm for llava-hf/llava-v1.6-mistral-7b-hf. But i found it not supported according to the doc here https://docs.vllm.ai/en/latest/models/supported_models.html#list-of-multimodal-language-models.

Is there any plan to support multi lora for more mllm in near future or any practical way to obtain it on llava-next?

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
@WeixuanXiong WeixuanXiong added the usage How to use vllm label Feb 10, 2025
@hmellor
Copy link
Member

hmellor commented Feb 17, 2025

You can check out https://docs.vllm.ai/en/latest/features/compatibility_matrix.html#feature-x-feature to see which features are compatible with each other.

The table indicates that multi-modal inputs are not yet compatible with LoRA (although, the pull request linked in the table is merged).

You can follow #4194 for updates on LoRA + multi-modality.

@hmellor hmellor closed this as completed Feb 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
usage How to use vllm
Projects
None yet
Development

No branches or pull requests

2 participants