Skip to content

Commit 6007a75

Browse files
committed
add todo for split/join
1 parent b17af98 commit 6007a75

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

src/deepsparse/v2/text_generation/process_inputs.py

+3
Original file line numberDiff line numberDiff line change
@@ -121,5 +121,8 @@ def run(
121121
frequency_penalty=generation_config.repetition_penalty,
122122
)
123123

124+
# TODO: move this step to prep_for_prefill and add attention mask to the output
125+
# this will allow us to split/join more easily when processing multiple prompts
126+
# in parallel
124127
tokens = input_ids[attention_mask.nonzero()].tolist()
125128
return {"tokens": tokens}, inference_state_update

0 commit comments

Comments
 (0)