Skip to content

Add workflow for tensorcore CUDA wheels #40

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

oobabooga
Copy link

This adds a workflow for llama-cpp-python CUDA wheels with tensor cores support, and appends it to the Build Wheels for New Release with Prioritization workflow.

I have been having tremendous difficulty in compiling those wheels, as I get flooded with errors saying

API rate limit exceeded for installation ID

that cause part of the wheels to not be uploaded to GitHub. So I end up having to run the same workflow several times for each new release.

Including this additional workflow will increase the compilation time for your actions, so I understand if you want to not include this in the repository. In this case, I can keep maintaining those additional wheels myself.

PS: MacOS wheels have been building successfully for all versions, so they can be added back if you haven't already.

@Smartappli
Copy link

Please create a batch file, and integrate your script in build-wheels-full-release.yml. Adapt powershell scripts (ps1 files)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants