add Conv singlestage
This commit set add singlestage conv for cutlass
@hwu36 Hi,
Could you help review this patch, whether this singlestage patch could be merged?
Thx, Lei
Thanks, I will take a look and run the tests. If any change is needed, I will do it myself and push to your branch.
@leiwen83 can you provide performance results to justify this PR?
@leiwen83 can you provide performance results to justify this PR?
Seem to me there is no performance gain for this single stage conv being added. So maybe we could close this PR for now.
This is a useful feature for T4 or maybe small ampere cards. I will work on it hopefully this month.
We do have singlestage mma pipeline for GEMMs. Do we have some use cases for T4 where single stage wins over 2-staged pipeline?
We do have singlestage mma pipeline for GEMMs. Do we have some use cases for T4 where single stage wins over 2-staged pipeline?
Yes, most kernels picked by cublas on t4 are single stage ones.
This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.
Hi @leiwen83 ,
Sorry for the delay. I am working on this one now. Have you tested your code in any way?
This PR has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates. This PR will be labeled inactive-90d if there is no activity in the next 60 days.
This PR has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this PR if it is no longer required. Otherwise, please respond with a comment indicating any updates.