Add workflow for tensorcore CUDA wheels #40
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This adds a workflow for llama-cpp-python CUDA wheels with tensor cores support, and appends it to the
Build Wheels for New Release with Prioritization
workflow.llama_cpp_python_cuda_tensorcores
.I have been having tremendous difficulty in compiling those wheels, as I get flooded with errors saying
that cause part of the wheels to not be uploaded to GitHub. So I end up having to run the same workflow several times for each new release.
Including this additional workflow will increase the compilation time for your actions, so I understand if you want to not include this in the repository. In this case, I can keep maintaining those additional wheels myself.
PS: MacOS wheels have been building successfully for all versions, so they can be added back if you haven't already.