Diego Fiori
Diego Fiori
Hi @ylassoued, examples of the config file and of dataset generation have been added to the new readme with #203 and #204. Feel free to open another issue if you...
Hi @Alla-Abdella, thank you for contacting us! We are currently working on refining our code and we will release some training data examples very soon.
Hi @MohamedAliRashad and @ming1523, thanks for pointing out the error. We are currently fixing multiple bugs, you can see the updates on #203.
Closing the issue since the problem has been solved in #203.
Fixed in #212. It should work now pulling the latest commit.
Closing the issue since examples have been provided in the new readme #204.
Hi @TonyZhanghm, thanks for asking 😄 We actually worked on it and we will merge a PR with the code for downloading (and formatting) the Anthropic and SHP datasets. We...
Closing the issue since it has been fixed in #203.
Hi @lonelydancer, thank you for reaching out. Today, we are gonna release a new readme with more extensive examples. Please let us know if you have any other feedback 😃
Data examples for reward training has been released with #203. For updates on distributed training see #242 and #229