[Feature]: Add support for attention score output #11365

WoutDeRijck · 2024-12-20T08:50:58Z

🚀 The feature, motivation and pitch

Problem

vLLM currently doesn't provide access to attention scores during inference, which are essential for model analysis and interpretability research. #11862

Feature Request

Add the ability to retrieve attention scores during model inference, similar to HuggingFace's output_attentions=True parameter.

Motivation

Need to analyze token-level relationships in model outputs
Required for building visualization tools and debugging model behavior
Critical for research into attention mechanisms

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Dineshkumar-Anandan-ZS0367 · 2025-01-06T10:02:27Z

You are asking output_attentions=True or return_cross_attentions=True for getting coordinates right.

These only given by vision encoder decoder models or cross encoder models.

Which model?

WoutDeRijck · 2025-01-09T14:20:28Z

I don't mean to get coordinates. I am using Llama-3.1-8b, let's say I want to extract data out of the input context, then I need the attention scores to be able to visualize where the model is looking. (Pure text-based, no vision)

These are ofcourse also present in decoder-only models.

Dineshkumar-Anandan-ZS0367 · 2025-01-10T10:32:05Z

Apologise by mistakes. I have integrated score using tensor logits already. Thanks!

WoutDeRijck · 2025-01-12T10:32:27Z

I do not need the logits as well. I need the attention scores.

HuiSiqi · 2025-01-15T07:37:06Z

Any update of this? I also need to visualize the attention scores of decoder-based models.

github-actions · 2025-04-16T02:19:32Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

oneonlee · 2025-04-21T10:46:32Z

Any update on this?

WoutDeRijck added the feature request New feature or request label Dec 20, 2024

github-actions bot added the stale Over 90 days of inactivity label Apr 16, 2025

github-actions bot added unstale Recieved activity after being labelled stale and removed stale Over 90 days of inactivity labels Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Add support for attention score output #11365

[Feature]: Add support for attention score output #11365

WoutDeRijck commented Dec 20, 2024 •

edited

Loading

Dineshkumar-Anandan-ZS0367 commented Jan 6, 2025

Uh oh!

WoutDeRijck commented Jan 9, 2025 •

edited

Loading

Uh oh!

Dineshkumar-Anandan-ZS0367 commented Jan 10, 2025

Uh oh!

WoutDeRijck commented Jan 12, 2025

Uh oh!

HuiSiqi commented Jan 15, 2025

Uh oh!

github-actions bot commented Apr 16, 2025

Uh oh!

oneonlee commented Apr 21, 2025

Uh oh!

Uh oh!

[Feature]: Add support for attention score output #11365

[Feature]: Add support for attention score output #11365

Comments

WoutDeRijck commented Dec 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 The feature, motivation and pitch

Problem

Feature Request

Motivation

Alternatives

Additional context

Before submitting a new issue...

Dineshkumar-Anandan-ZS0367 commented Jan 6, 2025

Uh oh!

WoutDeRijck commented Jan 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dineshkumar-Anandan-ZS0367 commented Jan 10, 2025

Uh oh!

WoutDeRijck commented Jan 12, 2025

Uh oh!

HuiSiqi commented Jan 15, 2025

Uh oh!

github-actions bot commented Apr 16, 2025

Uh oh!

oneonlee commented Apr 21, 2025

Uh oh!

WoutDeRijck commented Dec 20, 2024 •

edited

Loading

WoutDeRijck commented Jan 9, 2025 •

edited

Loading