You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/design/v1/prefix_caching.md
+2Lines changed: 2 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -92,6 +92,8 @@ To improve privacy in shared environments, vLLM supports isolating prefix cache
92
92
93
93
With this setup, cache sharing is limited to users or requests that explicitly agree on a common salt, enabling cache reuse within a trust group while isolating others.
94
94
95
+
> **Note:** Cache isolation is not supported in engine V0.
96
+
95
97
## Data Structure
96
98
97
99
The prefix caching in vLLM v1 is implemented in the KV cache manager. The basic building block is the “Block” data class (simplified):
0 commit comments