Runxin Zhong

Results 1 comments of Runxin Zhong

Same question. It seems that the current code is not the final version and the attention still uses dense methods not sparse. Will it be open-source? @HaochengWan @2020zhangcheng @zhangcheng828 Thanks...