Skip to content

Commit 56a31a3

Browse files
ekagra-ranjanAlex4210987
authored andcommitted
[Spec Decode] Fix input triton kernel for eagle (vllm-project#15909)
Signed-off-by: xinyuxiao <[email protected]>
1 parent 606a702 commit 56a31a3

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

vllm/v1/spec_decode/eagle.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -250,13 +250,12 @@ def prepare_input_kernel(
250250
num_tokens = end_pos - start_pos
251251

252252
index_start = tl.load(cu_query_lens_ptr + pid)
253-
indices = index_start + tl.arange(0, BLOCK_SIZE)
254253

255254
num_blocks = tl.cdiv(num_tokens, BLOCK_SIZE)
256255
for i in tl.range(num_blocks):
257256
offset = i * BLOCK_SIZE + tl.arange(0, BLOCK_SIZE)
258257
tl.store(
259258
out_ptr + start_pos + offset,
260-
indices,
259+
index_start + offset,
261260
mask=offset < num_tokens,
262261
)

0 commit comments

Comments
 (0)