Replies: 1 comment 1 reply
-
Also, attach appropriate licenses as well as possibly a legal disclaimer regarding quantization - something similar to https://github.com/pytorch/executorch/tree/main/examples#disclaimer |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
These are the proposed guidelines for models uploaded to Hugging Face Community
One thing that's missing that we should nail down is a standard naming for quant schemes. Right now things aren't standardized enough for people to infer things. GGUF does a good job at this.
Guidelines
Repo Naming
Names should include model, number of parameters and target device
{Model}-{Params}-{target-device}
Example: executorch-community/Llama-3.2-1B-Instruct-ARM-CPU
Model File Naming
{Model}-{Params}-{quantscheme}-{bitsize}.pte
Example: llama-3.2-1B-SQ-4-8.pte
Quant Lookup
General format
{Scheme as Abbreviation}-{weight bit size}-{activation bit size}
Beta Was this translation helpful? Give feedback.
All reactions