Skip to content

Naming convention of operator in etdump #11089

Answered by GregoryComer
yujiaoliang asked this question in Q&A
Discussion options

You must be logged in to vote

Are you looking for the actual kernel implementation (in code)? Or the corresponding op in the original graph?

If the question is in regards to the kernel implementation, the kernel code lives in the XNNPACK library. The naming convention for XNNPACK ops in this case is (DimOrder, Activation DType, Output Activation DType, and Weight DType). QD8 means dynamically quantized activations, 8-bit. Output activation dtype is f32. And weights are 4-bit blockwise (groupwise) quantized.

The actual kernels in XNNPACK live under https://github.com/google/XNNPACK/tree/master/src/qd8-f32-qb4w-gemm. Which specific qb4w kernel depends on your hardware at runtime, with the kernel selection logic living in

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@yujiaoliang
Comment options

Answer selected by yujiaoliang
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants