Shen Zhuoran
Results
11
issues of
Shen Zhuoran
I was debugging a data-parallel forward mismatch when using `megablocks` (DP and non-DP give different forward results). During debugging, I tried to reproduce such difference minimally, and found that simply...