composable_kernel
composable_kernel copied to clipboard
Grouped convolution forward missed the instances of datatype fp32 and int8 for layout (NHWGC, GKYXC, NHWGK)
Grouped convolution forward missed the instances of datatype fp32, bfp16 and int8 for layout (NHWGC, GKYXC, NHWGK)
@iq136boy Could you specify which DeviceOp exactly?
@iq136boy Could you specify which DeviceOp exactly?
@zjing14, the one using in the client example: DeviceGroupedConvFwdMultipleD