self-rag icon indicating copy to clipboard operation
self-rag copied to clipboard

Obtaining my own critic data

Open roynirmal opened this issue 2 years ago • 2 comments

First of all congratulations on the impressive work! I am looking to extend SELF_RAG for long context tasks and planning to create my own training data. I had 3 questions about obtaining my own critic data.

  1. What is the input file? Are the inputs obtained from the tasks themselves?
  2. How do we create the jsonl for the input files? Where do we get those parameters from? (There is no README in process_data as mentioned here data_creation/critic/gpt4_reward/README.md). Does each critic data need a separate input file?
  3. Once I have obtained the critic data for each of the four tokens, how do I combine them into one file for training the critic model?

roynirmal avatar Dec 04 '23 18:12 roynirmal

YES! There is no README in process_data as mentioned here data_creation/critic/gpt4_reward/README.md

AllenShow avatar Feb 25 '24 17:02 AllenShow

Hi, thank you so much for opening an issue! Sorry that the README is not really easy-to-follow. I addressed some questions briefly here and try to improve the documentation in the coming weeks.

AkariAsai avatar Mar 19 '24 21:03 AkariAsai