-
Notifications
You must be signed in to change notification settings - Fork 414
[Feature]: DeepSeek-R1-Distill-Qwen or similar distilled DeepSeek gguf support #1059
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
No I don't think so, not yet. We're currently about 5 weeks out of date. Glancing through the PRs, I can see these two which looks relevant:
The next version update is already in the works, I think there's just one major issue left to resolve which I hope to get to this weekend. |
OK! The last cpp version should add support. Thank you! |
Pretty excited to see the new binaries added. Not just deepseek, but it'll be an opportunity to try and incorporate the changes they've made to support jinja templates and as of a few days ago the tooling support. |
Is the new and improved Jinja support in the core llama.cpp, or just in the examples (e.g. common.cpp)? A lot of big changes like that land in examples (which LLamaSharp does not include) before they make their way into the main project. |
it's in core. A pretty big one too, I think some of the features related to Deepthink might require it. I know the tooling stuff does ggml-org/llama.cpp#11016 |
Martin, do we have support for DeepSeek-R1-Distill-Qwen gguf or similar already (distilled version of deepseek)?
Thank you.
The text was updated successfully, but these errors were encountered: