composable_kernel icon indicating copy to clipboard operation
composable_kernel copied to clipboard

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

Results 276 composable_kernel issues
Sort by recently updated
recently updated
newest added

We found numerical errors in convolution backward data kernel when running test with the detectron2 data. I put the data and the error log file [here.](http://ginger.amd.com/share/detectron2_data/) . The log contains...

bug
urgency_blocker

- Remove 1d and 2d grouped convolution instances - Move MakeDescriptor.... functions to transform headers

This PR set "-mllvm -enable-post-misched=0" as default compile option of CK. This option will improve gemm_universal performance and ensure the correctness.

quality

- Refactor f8_t and bf8_t - Update conversion methods - Update load method - Add dynamic buffer custom types support - Update threadwise conversion Right now custom types are supported...

### Problem Description // clang-format off using DeviceGemmInstance1 = ck::tensor_operation::device::DeviceGemm_Xdl_CShuffle < ALayout, // ALayout BLayout, // BLayout CLayout, // CLayout ADataType, // ADataType BDataType, // BDataType CDataType, // CDataType AccDataType,...