You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi !
I tried lots of models, quants and not, and the result are always the same as this video:
bug_vllm.mp4
As you can see, when the second stream prompt is also called corrupted tokens starts
I also tried curl and js fetch requests and the result (concurrents) are the same.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi !
I tried lots of models, quants and not, and the result are always the same as this video:
bug_vllm.mp4
As you can see, when the second stream prompt is also called corrupted tokens starts
I also tried curl and js fetch requests and the result (concurrents) are the same.
Any hints?
Beta Was this translation helpful? Give feedback.
All reactions