-
Notifications
You must be signed in to change notification settings - Fork 125
Implement urKernelGetSuggestedLocalWorkSize #1385
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement urKernelGetSuggestedLocalWorkSize #1385
Conversation
Do we need to add some conformance tests? |
Yang has asked me about this, but I can't think of anything meaningful. Do you have some ideas for adding tests for this? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CUDA and HIP adapters LGTM
Kindly ping @oneapi-src/unified-runtime-level-zero-write , please help review the PR, thank you very much! |
Kindly ping @oneapi-src/unified-runtime-level-zero-write , please help review the PR, thank you very much! |
Hi @nrspruit, @raiyanla, @winstonzhang-intel, could you please review this PR. Thanks! |
Hi @nrspruit, @raiyanla, @winstonzhang-intel, could you please review this PR. Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for level zero
Hi @kbenzie, I got all the approval needed. How do you think I should proceed? |
I've added the ready to merge label, its now in the merge queue so we'll merge this soon. |
Thank you! |
Its unclear why the CodeQL Windows job is failing, its not an issue with this PR so I'll merge anyway. |
UR PR: oneapi-src/unified-runtime#1385 --------- Co-authored-by: Kenneth Benzie (Benie) <[email protected]>
This PR try to implement the API
urKernelGetSuggestedLocalWorkSize
, discussed in #1270.SYCLOS PR: intel/llvm#12902
Also fix:
LocalWorkSize
is provided,urEnqueueKernelLaunch()
will readLocalWorkSize
without respectingworkDim
.