torchrec icon indicating copy to clipboard operation
torchrec copied to clipboard

add GPU sync tests

Open iamzainhuda opened this issue 1 year ago • 1 comments

Summary: Added GPU sync tests to simulate gathering metric states on to rank 0 and computing. Tests don't cover this case before, which has resulted in SEVs in the past as users aren't aware of how RecMetrics collects and computes metrics.

Reviewed By: henrylhtsang

Differential Revision: D59173140

iamzainhuda avatar Jun 28 '24 20:06 iamzainhuda