Skip to content

Commit 2f38518

Browse files
[Bugfix] Free cross attention block table for preempted-for-recompute sequence group. (#10013)
Signed-off-by: Kathy Yu <[email protected]>
1 parent 84c35c3 commit 2f38518

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm/core/scheduler.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1579,6 +1579,7 @@ def _preempt_by_recompute(
15791579
seq.status = SequenceStatus.WAITING
15801580
self.free_seq(seq)
15811581
seq.reset_state_for_recompute()
1582+
self._free_seq_group_cross_attn_blocks(seq_group)
15821583

15831584
def _preempt_by_swap(
15841585
self,

0 commit comments

Comments
 (0)