[Config] Enhance ModelRecord #435

CharlieFRuan · 2024-05-30T04:41:00Z

There are three changes to ModelRecord this PR brings:

1. Update model ids to match HF repo name

We rename modelId in webllm.prebuiltAppConfig to be the exact same as the HF repo name. For most models, that means we simply append -MLC to the modelId. For the low-context version of the model, we would have {HF-repo}-1k, suggesting 1k context length.

As a result, we rename Phi2 and phi1.5 models since their modelId did not match with the repo name

Phi2-q4f32_1 → phi-2-q4f32_1-MLC
Phi1.5-q4f16_1 → phi-1_5-q4f16_1-MLC

2. Rename `model_url` and `model_lib_url` to `model` and `model_lib`

To better match with other platforms of MLC-LLM (e.g. iOS, Android), we rename the ModelRecord fields.

3. Remove `resolve/main` from `model` URL

Instead of "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/resolve/main/", we now make it "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/"; note the trailing / will be appended by us if it is not there.

Example

As an example, we would have:

    {
      model: "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC",
      model_id: "Llama-3-8B-Instruct-q4f16_1-MLC",
      model_lib: "path/to/Llama-3-8B-Instruct-q4f16_1-ctx1k_cs1k-webgpu.wasm",
    },

instead of

    {
      model_url: "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/resolve/main/",
      model_id: "Llama-3-8B-Instruct-q4f16_1",
      model_lib_url: "path/to/Llama-3-8B-Instruct-q4f16_1-ctx4k_cs1k-webgpu.wasm",
    },

Co-authored-by: Nestor Qin <[email protected]>

### Changes Main changes include: - New prebuilt models: - Phi3-mini - StableLM-2-zephyr-1.6B - Qwen1.5-1.8B - Hermes2-Pro-Llama-3-8B to prebuilt models - Updates on `ModelRecord` fields - For detail see: #435 - Update all WASMs - For detail see: #433 - Update all WASMs to v0.2.39 - Support grammar for Llama3, hence update examples/json-mode to use `Llama3` and `Hermes2-pro-Llama3-8B` for function calling in `examples/json-schema` - Use `loglevel` package: - For details see #427 - Fix `index.js.map` issue for Vite - #420 - Enhance error handling and ServiceWorker ### TVMjs TVMjs compiled at apache/tvm@71f7af7 - Main changes include: - apache/tvm#17031 - apache/tvm#17028 - apache/tvm#17021 ### WASM version - All wasms updated to 0.2.39 via mlc-ai/binary-mlc-llm-libs#123 for new MLC-LLM runtime (mainly grammar)

There are three changes to `ModelRecord` this PR brings: ### 1. Update model ids to match HF repo name We rename `modelId` in `webllm.prebuiltAppConfig` to be the exact same as the HF repo name. For most models, that means we simply append `-MLC` to the `modelId`. For the low-context version of the model, we would have `{HF-repo}-1k`, suggesting 1k context length. As a result, we rename Phi2 and phi1.5 models since their `modelId` did not match with the repo name - `Phi2-q4f32_1` → `phi-2-q4f32_1-MLC` - `Phi1.5-q4f16_1` → `phi-1_5-q4f16_1-MLC` ### 2. Rename `model_url` and `model_lib_url` to `model` and `model_lib` To better match with other platforms of MLC-LLM (e.g. iOS, Android), we rename the `ModelRecord` fields. ### 3. Remove `resolve/main` from `model` URL Instead of `"https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/resolve/main/"`, we now make it `"https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/"`; note the trailing `/` will be appended by us if it is not there. ### Example As an example, we would have: ```typescript { model: "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC", model_id: "Llama-3-8B-Instruct-q4f16_1-MLC", model_lib: "path/to/Llama-3-8B-Instruct-q4f16_1-ctx1k_cs1k-webgpu.wasm", }, ``` instead of ```typescript { model_url: "https://huggingface.co/mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC/resolve/main/", model_id: "Llama-3-8B-Instruct-q4f16_1", model_lib_url: "path/to/Llama-3-8B-Instruct-q4f16_1-ctx4k_cs1k-webgpu.wasm", }, ``` --------- Co-authored-by: Nestor Qin <[email protected]>

### Changes Main changes include: - New prebuilt models: - Phi3-mini - StableLM-2-zephyr-1.6B - Qwen1.5-1.8B - Hermes2-Pro-Llama-3-8B to prebuilt models - Updates on `ModelRecord` fields - For detail see: mlc-ai#435 - Update all WASMs - For detail see: mlc-ai#433 - Update all WASMs to v0.2.39 - Support grammar for Llama3, hence update examples/json-mode to use `Llama3` and `Hermes2-pro-Llama3-8B` for function calling in `examples/json-schema` - Use `loglevel` package: - For details see mlc-ai#427 - Fix `index.js.map` issue for Vite - mlc-ai#420 - Enhance error handling and ServiceWorker ### TVMjs TVMjs compiled at apache/tvm@71f7af7 - Main changes include: - apache/tvm#17031 - apache/tvm#17028 - apache/tvm#17021 ### WASM version - All wasms updated to 0.2.39 via mlc-ai/binary-mlc-llm-libs#123 for new MLC-LLM runtime (mainly grammar)

CharlieFRuan and others added 3 commits May 29, 2024 23:20

Update model ids to match HF repo name

5cc81b9

Co-authored-by: Nestor Qin <[email protected]>

[ModelRecord] Update model_lib_url to model_lib, and model_url to model

45f41e6

Remove resolve/main from model record input

c5adc8c

CharlieFRuan changed the title ~~[ModelRecord] Enhance ModelRecord~~ [Config] Enhance ModelRecord May 30, 2024

CharlieFRuan mentioned this pull request May 30, 2024

[Config] update model ids to match hf url #428

Closed

CharlieFRuan merged commit 896b012 into mlc-ai:main May 30, 2024
1 check passed

This was referenced May 30, 2024

[Tracking][WebLLM] Runtime updates #429

Closed

[Version] Bump version to 0.2.39, update prebuilt WASMs #436

Merged

bdpoff mentioned this pull request May 31, 2024

cleanModelUrl breaks the ability to specify branch names other than 'main' #443

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Config] Enhance ModelRecord #435

[Config] Enhance ModelRecord #435

CharlieFRuan commented May 30, 2024

[Config] Enhance ModelRecord #435

[Config] Enhance ModelRecord #435

Conversation

CharlieFRuan commented May 30, 2024

1. Update model ids to match HF repo name

2. Rename model_url and model_lib_url to model and model_lib

3. Remove resolve/main from model URL

Example

2. Rename `model_url` and `model_lib_url` to `model` and `model_lib`

3. Remove `resolve/main` from `model` URL