InstructZero
InstructZero copied to clipboard
data split
Hi, after reading your code, can i check with you the following regarding how you split your dataset?
- 5 samples are generated from prompt_gen_data to induce the instruction from the open-source LLM
- 20 samples are generated from eval_data to evaluate the quality of the induced instruction during BO iterations
- 100 samples are generated from test_data to evaluate the quality of the proposed instruction after BO iterations Thanks!