Switchsyj

Results 3 issues of Switchsyj

**Describe the bug** Hi, I am new to deepspeed and training my RL with an actor-critic model(codegen-6B/shared params) and a reference model(codegen-6B). However, I get `exits with return code =...

bug
training

As described in `docs/execution.md`, the entire evaluation process should be ready in a few second with greedy decoding. However, I noticed the process has kept test script running in 13...

Hi, thanks for your great work! I would like to point out a potential bug in this code: add_special_tokens without checking embedding size is very dangerous especially for llama. In...