epic: `cortex models pull <ID_or_URL>` #1077

Van-QA · 2024-08-16T09:57:49Z

Goal

Pull from Jan's curated models
Pull from Huggingface based on URL

Tasklist

Bugs

bug: cortex pull from hf times out #1017
bug: Model corruption during download leads to loading failure #1139
bug: cortex run or pull redownloads existing model multiple times #1344
bug: Download models/engine failed cause api server stop #1010 (API server process crash edge case)
#777

Out-of-scope

Pause Functionality feat: Add pause functionality to model downloader jan#3519 (likely Sprint 21)
Multi-mirror? (e.g. if HF down)
Built in models to move back to Jan huggingface? (i.e. update URL)

The text was updated successfully, but these errors were encountered:

namchuai · 2024-08-21T00:47:26Z

Updating a flow chart on fetch and downloading model from Hugging Face

namchuai · 2024-08-21T00:49:00Z

After downloading model, should we automatically check for engine and init engine (if neccessary)?

dan-menlo · 2024-09-06T08:09:01Z

@namchuai Can I check, does your implementation of cortex pull cover the case of a Huggingface URL to a specific GGUF model?

cortex pull https://huggingface.co/BafS/gemma-2-2b-it-Q4_K_M-GGUF/blob/main/gemma-2-2b-it-q4_k_m.gguf

dan-menlo · 2024-09-06T08:10:35Z

After downloading model, should we automatically check for engine and init engine (if neccessary)?

@namchuai From a UX perspective, I think we should not automatically init a model:

We should show useful feedback to pre-empt problems
e.g. "To run this model, you will need to initialize the TensorRT-LLM engine, i.e. cortex engines init tensorrt-llm

namchuai · 2024-09-06T08:40:29Z

@namchuai Can I check, does your implementation of cortex pull cover the case of a Huggingface URL to a specific GGUF model?
cortex pull https://huggingface.co/BafS/gemma-2-2b-it-Q4_K_M-GGUF/blob/main/gemma-2-2b-it-q4_k_m.gguf

Currently, we are not cover this case. This will be supported in upcoming release, I think.

gabrielle-ong · 2024-10-05T10:04:56Z

Marking as complete! 🎉

vansangpfiev mentioned this issue Sep 2, 2024

discussion: https library #1080

Closed

2 tasks

vansangpfiev assigned namchuai Aug 19, 2024

imtuyethan transferred this issue from another repository Sep 2, 2024

freelerobot changed the title ~~Download models~~ feat: Download models Sep 6, 2024

freelerobot changed the title ~~feat: Download models~~ feat: cortex models pull <> Sep 6, 2024

freelerobot changed the title ~~feat: cortex models pull <>~~ feat: cortex models pull <ID_or_URL> Sep 6, 2024

freelerobot added the category: model management Model pull, yaml, model state label Sep 6, 2024

dan-menlo mentioned this issue Sep 6, 2024

feat: cortex pull supports full HF URL download #1004

Closed

namchuai mentioned this issue Sep 6, 2024

feat: Download model directly from HF url #1145

Closed

dan-menlo added the type: epic A major feature or initiative label Sep 8, 2024

dan-menlo mentioned this issue Sep 10, 2024

epic: Fix Local Engine issues (llama.cpp) menloresearch/jan#3614

Closed

10 tasks

dan-menlo changed the title ~~feat: cortex models pull <ID_or_URL>~~ epic: cortex models pull <ID_or_URL> Sep 26, 2024

gabrielle-ong closed this as completed Oct 5, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Oct 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: `cortex models pull <ID_or_URL>` #1077

epic: `cortex models pull <ID_or_URL>` #1077

Van-QA commented Aug 16, 2024 •

edited by gabrielle-ong

Loading

namchuai commented Aug 21, 2024

namchuai commented Aug 21, 2024

dan-menlo commented Sep 6, 2024

dan-menlo commented Sep 6, 2024 •

edited

Loading

namchuai commented Sep 6, 2024 •

edited

Loading

gabrielle-ong commented Oct 5, 2024

epic: cortex models pull <ID_or_URL> #1077

epic: cortex models pull <ID_or_URL> #1077

Comments

Van-QA commented Aug 16, 2024 • edited by gabrielle-ong Loading

Goal

Tasklist

Bugs

Out-of-scope

namchuai commented Aug 21, 2024

namchuai commented Aug 21, 2024

dan-menlo commented Sep 6, 2024

dan-menlo commented Sep 6, 2024 • edited Loading

namchuai commented Sep 6, 2024 • edited Loading

gabrielle-ong commented Oct 5, 2024

epic: `cortex models pull <ID_or_URL>` #1077

epic: `cortex models pull <ID_or_URL>` #1077

Van-QA commented Aug 16, 2024 •

edited by gabrielle-ong

Loading

dan-menlo commented Sep 6, 2024 •

edited

Loading

namchuai commented Sep 6, 2024 •

edited

Loading