You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As documented in #550, the default vLLM configuration could be improved and documented better. A startupProbe on /health is the right default for vLLM given it does not load the server until a very long model load is complete, but tunables may vary.
The text was updated successfully, but these errors were encountered:
As documented in #550, the default vLLM configuration could be improved and documented better. A startupProbe on /health is the right default for vLLM given it does not load the server until a very long model load is complete, but tunables may vary.
The text was updated successfully, but these errors were encountered: