Skip to content

[CANN]Support Opt CONV_TRANSPOSE_1D and ELU #12786

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
Apr 9, 2025

Conversation

noemotiovon
Copy link
Contributor

Why is this PR needed?

Optimize the CONV_TRANSPOSE_1D , ELU operator in the CANN backend with the aclnn acceleration library.

Test

  # CONV_TRANSPOSE_1D
  Backend 1/2: CANN0
  Device description: Ascend910B3
  Device memory: 62432 MB (62164 MB free)

  CONV_TRANSPOSE_1D(ne_input=[197,32,1,1],ne_kernel=[16,32,32,1],s0=1,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=3,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=2,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[2,3,2,1],s0=1,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,2,2,1],s0=2,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,2,2,1],s0=1,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[3,2,1,1],ne_kernel=[3,1,2,1],s0=1,p0=0,d0=1): OK
  CONV_TRANSPOSE_1D(ne_input=[2,1,1,1],ne_kernel=[3,1,1,1],s0=1,p0=0,d0=1): OK
  5294/5294 tests passed
  Backend CANN0: OK

Backend 2/2: CPU
  Skipping
2/2 backends passed
OK
  
  # ELU
Backend 1/2: CANN0
  Device description: Ascend910B3
  Device memory: 62432 MB (62164 MB free)

  ELU(type=f16,ne_a=[128,2,2,2],v=0): OK
  ELU(type=f16,ne_a=[5,7,11,13],v=0): OK
  ELU(type=f16,ne_a=[128,2,2,2],v=1): OK
  ELU(type=f16,ne_a=[5,7,11,13],v=1): OK
  ELU(type=f32,ne_a=[128,2,2,2],v=0): OK
  ELU(type=f32,ne_a=[5,7,11,13],v=0): OK
  ELU(type=f32,ne_a=[128,2,2,2],v=1): OK
  ELU(type=f32,ne_a=[5,7,11,13],v=1): OK
  5294/5294 tests passed
  Backend CANN0: OK

Backend 2/2: CPU
  Skipping
2/2 backends passed
OK

@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Apr 7, 2025
@hipudding
Copy link
Collaborator

Need rebase.

@hipudding hipudding assigned hipudding and noemotiovon and unassigned hipudding Apr 7, 2025
@hipudding hipudding added the Ascend NPU issues specific to Ascend NPUs label Apr 7, 2025
noemotiovon added 3 commits April 8, 2025 03:43
@noemotiovon
Copy link
Contributor Author

The source of openEuler OS is not stable these days. I have tested this PR in my local environment.

noemotiovon added 2 commits April 8, 2025 10:58
@noemotiovon noemotiovon requested a review from ngxson as a code owner April 9, 2025 02:20
@github-actions github-actions bot added the devops improvements to build systems and github actions label Apr 9, 2025
@noemotiovon
Copy link
Contributor Author

CI issue has been resolved!

@hipudding hipudding merged commit 6e1c4ce into ggml-org:master Apr 9, 2025
51 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants