We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent b12f085 commit 288fbcbCopy full SHA for 288fbcb
vllm/v1/core/kv_cache_manager.py
@@ -299,9 +299,7 @@ def get_num_common_prefix_blocks(
299
300
While all scheduled requests must be in the RUNNING state, the inverse
301
is not necessarily true. There may be RUNNING requests that are not
302
- scheduled in the current step. As of 1/1/2025, the scheduler does not
303
- allow this case, but it is possible in the future, as we allow more
304
- flexible scheduling.
+ scheduled in the current step.
305
306
This can result in an edge case where the number of common prefix blocks
307
is 0, even though all scheduled requests share a common prefix. This
0 commit comments