Junbo Li issues

Results 5 issues of


                                            Junbo Li

Missing if-else logic

https://github.com/Yujun-Shi/FedCLS/blob/master/main.py#L142 Does not initialize 'argument_path' if 'args.log_file_name' is specified.

Speed decrease during training

We established the environment and preprocessed the data as per the provided instructions. However, while executing the command ```bash scripts/runs/run_pile_baseline120M.sh```, we noticed a sudden reduction in speed after loading specific...

How to specify the available GPUs?

I have only one spare GPU in one node. How to specify it? CUDA_VISIBLE_DEVICES does not work.

Support for PPO for PRM?

Does this support PPO with step-level PRM? Currently I only see scripts for PPO with token-level RM. Specifically, how can we train PPO with [OpenRLHF/Mistral-7b-PRM-Math-Shepherd](https://huggingface.co/OpenRLHF/Mistral-7b-PRM-Math-Shepherd)? Are there train codes and...

enhancement

SFT training objective

Are the SFT codes here trained on the whole chat or just the response (completions)?