nebuly
nebuly copied to clipboard
what is the training data format in chatllama actor_config/reward_config.
can you release some data sample?
Hi @lonelydancer, thank you for reaching out. Today, we are gonna release a new readme with more extensive examples. Please let us know if you have any other feedback 😃
Please explain the way of distributed training in detail.
Data examples for reward training has been released with #203. For updates on distributed training see #242 and #229