self-rag Obtaining my own critic data

First of all congratulations on the impressive work! I am looking to extend SELF_RAG for long context tasks and planning to create my own training data. I had 3 questions about obtaining my own critic data.

What is the input file? Are the inputs obtained from the tasks themselves?
How do we create the jsonl for the input files? Where do we get those parameters from? (There is no README in process_data as mentioned here data_creation/critic/gpt4_reward/README.md). Does each critic data need a separate input file?
Once I have obtained the critic data for each of the four tokens, how do I combine them into one file for training the critic model?

Dec 04 '23 18:12 roynirmal

YES! There is no README in process_data as mentioned here data_creation/critic/gpt4_reward/README.md

Feb 25 '24 17:02 AllenShow

Hi, thank you so much for opening an issue! Sorry that the README is not really easy-to-follow. I addressed some questions briefly here and try to improve the documentation in the coming weeks.

Mar 19 '24 21:03 AkariAsai