Lev Kurilenko
Lev Kurilenko
Hi @ZJXNEFU, Can you please provide the reproduction command you ran for Step 3 training (training script, actor/critic models, num GPUs, zero stage, etc)? I ran the following Step 3...
Hi @senthilps8, Thank you for providing feedback about codegen models having issues. I'd like to reproduce this on my end. Can you please provide an example of a Step 3...
Hi @ZJXNEFU, I believe this is due to the generate function not being replaced when there isn't a corresponding inference policy for the model you're using: https://github.com/microsoft/DeepSpeed/blob/93a81b5362a83bacd7b40c838295909f347e37af/deepspeed/runtime/hybrid_engine.py#L359 The `generate()` function...