what is the training data format in chatllama actor_config/reward_config.

Open lonelydancer opened this issue 3 years ago • 2 comments

can you release some data sample?

Mar 08 '23 10:03 lonelydancer

Hi @lonelydancer, thank you for reaching out. Today, we are gonna release a new readme with more extensive examples. Please let us know if you have any other feedback 😃

Mar 08 '23 10:03 diegofiori

Please explain the way of distributed training in detail.

Mar 08 '23 11:03 Vincent131499

Data examples for reward training has been released with #203. For updates on distributed training see #242 and #229

Mar 10 '23 16:03 diegofiori