[New Model]: Support Zyphra/Zamba2-7B #9382

mgoin · 2024-10-15T16:53:55Z

The model to consider.

Announcement blog: https://www.zyphra.com/post/zamba2-7b

Base model: https://huggingface.co/Zyphra/Zamba2-7B
Instruct tuned: https://huggingface.co/Zyphra/Zamba2-7B-Instruct

The closest model vllm already supports.

Jamba, as it is a mixture of state-space and transformers blocks

Zamba2-7B-Instruct is a hybrid model composed of state-space (Mamba2) and transformer blocks.

What's your difficulty of supporting the model you want?

Should be easy once Mamba2 support lands in #9292, however this use_shared_attention_lora case seems possibly complex

All of the HF-compatible modeling code can be found here: https://github.com/Zyphra/transformers_zamba2/tree/main/src/transformers/models/zamba2

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

yaronr · 2024-10-16T12:55:09Z

+1

scwall · 2024-10-20T14:32:37Z

+1

engchina · 2024-10-31T16:08:43Z

+1

github-actions · 2025-01-30T01:56:36Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

yury-tokpanov · 2025-02-03T22:43:38Z

Hey. Yury here from Zyphra. We have an internal version that works, going to open a PR sometime soon.

yury-tokpanov · 2025-03-20T21:11:08Z

This is done now. Merged into 0.8.x version of vLLM

mgoin added the new-model Requests to new models label Oct 15, 2024

yury-tokpanov mentioned this issue Nov 28, 2024

[Model] Support Mamba2 (Codestral Mamba) #9292

Merged

5 tasks

github-actions bot added the stale Over 90 days of inactivity label Jan 30, 2025

github-actions bot added unstale Recieved activity after being labelled stale and removed stale Over 90 days of inactivity labels Feb 4, 2025

yury-tokpanov mentioned this issue Feb 13, 2025

[MODEL] Add support for Zamba2 models #13185

Merged

hmellor closed this as completed Apr 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[New Model]: Support Zyphra/Zamba2-7B #9382

[New Model]: Support Zyphra/Zamba2-7B #9382

mgoin commented Oct 15, 2024 •

edited

Loading

yaronr commented Oct 16, 2024

Uh oh!

scwall commented Oct 20, 2024

Uh oh!

engchina commented Oct 31, 2024

Uh oh!

github-actions bot commented Jan 30, 2025

Uh oh!

yury-tokpanov commented Feb 3, 2025

Uh oh!

yury-tokpanov commented Mar 20, 2025

Uh oh!

Uh oh!

[New Model]: Support Zyphra/Zamba2-7B #9382

[New Model]: Support Zyphra/Zamba2-7B #9382

Comments

mgoin commented Oct 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The model to consider.

The closest model vllm already supports.

What's your difficulty of supporting the model you want?

Before submitting a new issue...

yaronr commented Oct 16, 2024

Uh oh!

scwall commented Oct 20, 2024

Uh oh!

engchina commented Oct 31, 2024

Uh oh!

github-actions bot commented Jan 30, 2025

Uh oh!

yury-tokpanov commented Feb 3, 2025

Uh oh!

yury-tokpanov commented Mar 20, 2025

Uh oh!

mgoin commented Oct 15, 2024 •

edited

Loading