torchrec icon indicating copy to clipboard operation
torchrec copied to clipboard

base example

Open YLGH opened this issue 3 years ago • 1 comments

Base training loop examples

run cmd

torchx run -s local_cwd dist.ddp -j 1x8 --script train_dlrm.py Some TODO items:

  1. Add NE/QPS metrics checkpointing
  2. Show saving this model and then loading it in later for inference

YLGH avatar Jun 17 '22 17:06 YLGH

@YLGH has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot avatar Jul 08 '22 01:07 facebook-github-bot