RL4LMs
RL4LMs copied to clipboard
Any plans for Deepspeed/Accelerate integration?
Hi, great library! I'm wondering if you have any plans for deepspeed or accelerate integration to train larger models (e.g., GPT-J)?
Hey @Breakend Thanks! We are working on the integration of HF Accelerate. Will let you know once it is out!
Thanks! No pressure. Just curious is there any ETA?
@boblee22 Probably by end of this month :)
Hi, thanks for the great library! Curious whether there's an updated eta on this issue?
We are a bit delayed on this. But it is coming soon. Is there a particular LM that you would like to train (that needs multi node setup) ?
I was trying to train blenderbot-3B, but it doesn't have to be that specific model. I'm interested in training models with 3-5B parameters, which I can currently do with deepspeed + huggingface. Thanks for prompt response!
Any updates on this?
We have a branch for this. https://github.com/allenai/RL4LMs/tree/add-accelerate-support But we are still testing it thoroughly before rolling it out.
Perfect! Do you recommend me to try it out or should I wait until the release ?