benchmark icon indicating copy to clipboard operation
benchmark copied to clipboard

Move the reference model (for train correctness checks) to cpu before starting tests

Open davidberard98 opened this issue 2 years ago • 0 comments

Stack from ghstack:

  • -> #1275

On some OSS models we see CUDA OOM if we enable train correctness checks.

For certain models, we can prevent this OOM by copying the reference model to CPU before running the tests.

davidberard98 avatar Nov 03 '22 01:11 davidberard98