Skip to content

Commit 4be8a07

Browse files
committed
more lenient
1 parent 043e7c4 commit 4be8a07

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/deepsparse/transformers/pipelines/text_generation.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -834,7 +834,7 @@ def engine_forward(
834834
generated_tokens.append(token)
835835
generated_logits.append(logits)
836836

837-
if session.total_num_processed_tokens >= session.capacity:
837+
if session.total_num_processed_tokens > session.capacity:
838838
# if the kv cache is full, stop generation
839839
finished_reason.append(FinishReason.CAPACITY)
840840
break

0 commit comments

Comments
 (0)