We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 632c82d commit 4735fadCopy full SHA for 4735fad
vllm/v1/sample/tpu/metadata.py
@@ -31,8 +31,9 @@ class TPUSupportedSamplingMetadata:
31
32
all_greedy: bool = True
33
34
- # Maximum number of top logprobs requested in current batch.
35
- # TODO specify why bool
+ # Whether logprobs are to be gathered in this batch of request. To balance
+ # out compile time and runtime, a fixed `max_number_logprobs` value is used
36
+ # when gathering logprobs, regardless of the values specified in the batch.
37
logprobs: bool = False
38
39
# TODO No penalties for now
0 commit comments