FBGEMM icon indicating copy to clipboard operation
FBGEMM copied to clipboard

redesign parallelism for small B

Open brad-mengchi opened this issue 1 year ago • 5 comments

Summary: As titled, redesign for small B (1< B < 64) and small T (T<=320) case, since current implementation only benefits large B * T.

Differential Revision: D54270887

brad-mengchi avatar Feb 28 '24 07:02 brad-mengchi