Shen Zhuoran

Results 11 issues of Shen Zhuoran

I was debugging a data-parallel forward mismatch when using `megablocks` (DP and non-DP give different forward results). During debugging, I tried to reproduce such difference minimally, and found that simply...