-
As per the title, what models/formats can vLLM actually serve? I am no longer going to iterate through documentation and have it fail, so someone please let me know what vLLM wants. safetensors error:
GGUF error:
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Could you plz provide your env details? |
Beta Was this translation helpful? Give feedback.
-
this worked well vllm seems to be running. |
Beta Was this translation helpful? Give feedback.
Could you plz provide your env details?
For
safetensors error
: seems to be that you are using BNB, you should append--quantization bitsandbytes --load-format bitsandbytes
when starting up