Open-Assistant
Open-Assistant copied to clipboard
accelerator version issue
huggingface's accelerator updated from v0.19.0 to v0.20.0 and 'logging_dir' disappeared from init method in Accelerator class.
So the above error occurs.
https://github.com/huggingface/accelerate/blob/baebae3bbecbea05d721a50917f352cccd14811e/src/accelerate/accelerator.py#L242-L244
OA doesn't specify version https://github.com/LAION-AI/Open-Assistant/blob/0fcf3e08fe62295d4696e590005b0f33383342ea/model/pyproject.toml#L12-L13
However, if you go to the trlx library that actually runs the accelerator, it is versioned as shown below.
https://github.com/CarperAI/trlx/blob/0dce99d96b7d70b6a9114129d8e38bf6c80eb653/requirements.txt#L1-L2
Of course, it is true that the trlx library also has its own errors.
However, if OA will have a dependency on the trlx library, I think it is necessary to get the trlx requirement.txt and install it with the same version as specified there.
Could you pleas provide the command you were running?
Could you pleas provide the command you were running?
- Create a new virtual environment using Anaconda
- Command
pip install -e .inOpen-Assistant/modelfolder, So dependencies can be installed according topyproject.tomlfile - Run
trainer.pyfile. I used my own script. But I think example code provided inOpen-Assistant/model/model_trainingwould make the same error.python trainer_rl.py --configs defaults_rlhf --cache_dir $DATA_PATH --rank_model $REWARD_MODEL --sft_model $SFT_MODEL --output_dir $MODEL_PATH/rl_model - Because error is occured when calling
trainer.py-trlx.trainmethod https://github.com/LAION-AI/Open-Assistant/blob/8ea3f1464db0c18c4311a68275d28692660abc23/model/model_training/trainer_rl.py#L184-L191 trlxlibrary uses HuggingFace'sacceleratorlibrary when training RLHF using PPO.- And
trlxlibrary manage their ownacceleratorversion. (For example now is 0.18.0) - But
Open-Assistant'sacceleratorversion (installs latest version automatically, now is 0.20.0) is not compatible withtrlxcode that usesaccelerator. - To summarize,
Open-Assistantusestrlxandtrlxusesaccelerator.trlxuse older version ofaccelerator, butOpen-Assistant's dependency file installs latest version ofaccelerator. So error occurs when usingtrlx