composable_kernel
composable_kernel copied to clipboard
Added client example for bwd qloop profiling
Added client example for bwd qloop v1, v2, light v1 and light v2. Now we can do profiling for flash attention backward qloop.