We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
CutlassSegmentGEMMSM90
1 parent 39323cd commit fdda958Copy full SHA for fdda958
csrc/flashinfer_gemm_sm90_ops.cu
@@ -19,7 +19,7 @@ void CutlassSegmentGEMMSM90(at::Tensor float_workspace_buffer, at::Tensor int_wo
19
at::Tensor all_problems, at::Tensor x_ptr, at::Tensor w_ptr,
20
at::Tensor y_ptr, at::Tensor x_stride, at::Tensor weight_stride,
21
at::Tensor y_stride, at::Tensor empty_x_data, bool weight_column_major,
22
- at::Tensor plan_info_vec, int64_t cuda_stream);
+ int64_t cuda_stream);
23
24
TORCH_LIBRARY_FRAGMENT(TORCH_EXTENSION_NAME, m) {
25
// "Cutlass Segment GEMM operator for SM90"
0 commit comments