Skip to content

Commit c661f8a

Browse files
MathieuBorderewangxiyuan
authored andcommitted
config.py: Clarify that only local GGUF checkpoints are supported. (vllm-project#18623)
Signed-off-by: Mathieu Bordere <[email protected]>
1 parent 2b10ba7 commit c661f8a

File tree

2 files changed

+5
-1
lines changed

2 files changed

+5
-1
lines changed

vllm/model_executor/layers/fused_moe/layer.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,7 @@
4141
from .pplx_prepare_finalize import PplxPrepareAndFinalize
4242
else:
4343
fused_experts = None # type: ignore
44+
FusedMoEPermuteExpertsUnpermute = None # type: ignore
4445
FusedMoEPrepareAndFinalize = None # type: ignore
4546
if is_rocm_aiter_moe_enabled():
4647
from vllm.model_executor.layers.fused_moe.rocm_aiter_fused_moe import ( # noqa: E501

vllm/transformers_utils/config.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -300,7 +300,10 @@ def get_config(
300300
" - For Hugging Face models: ensure the presence of a "
301301
"'config.json'.\n"
302302
" - For Mistral models: ensure the presence of a "
303-
"'params.json'.\n").format(model=model)
303+
"'params.json'.\n"
304+
"3. For GGUF: pass the local path of the GGUF checkpoint.\n"
305+
" Loading GGUF from a remote repo directly is not yet "
306+
"supported.\n").format(model=model)
304307

305308
raise ValueError(error_message) from e
306309

0 commit comments

Comments
 (0)