DeepSpeedExamples

DeepSpeedExamples copied to clipboard

Reame
Issues

Fix RLHF loss metrics & single-gpu training script

Open li-plus opened this issue 2 years ago • 2 comments

This PR fixes:

the actor/critic mean loss calculation
step-3 training script for 1.3b model on single gpu
some typos

Apr 22 '23 17:04 li-plus

@microsoft-github-policy-service agree

Apr 22 '23 17:04 li-plus