Switchsyj issues

Repositories
Issues
Comments

Results 3 issues of


                                            Switchsyj

[BUG]exit -9 while training with Codegen-6B

**Describe the bug** Hi, I am new to deepspeed and training my RL with an actor-critic model(codegen-6B/shared params) and a reference model(codegen-6B). However, I get `exits with return code =...

bug

training

Need help: my eval process is running quite slow

As described in `docs/execution.md`, the entire evaluation process should be ready in a few second with greedy decoding. However, I noticed the process has kept test script running in 13...

resize embedding after add_special_tokens

Hi, thanks for your great work! I would like to point out a potential bug in this code: add_special_tokens without checking embedding size is very dangerous especially for llama. In...