AgentGym
AgentGym copied to clipboard
Questions about example training scripts
Hi, I have some questions about parameters in the agentevol training script:
- Where can we get the training data?
iter_data_path="./iter_data/iter_data_all_task"
- What's the difference between the test step and the inference step?
- Where can we get the inference data?
inference_file_list=("webshop.json" "alfworld.json" "textcraft.json" "sciworld.json" "sqlgym.json" "wordle.json" "maze.json" "babyai.json" "weather.json" "movie.json" "todo.json" "academia.json" "sheet.json" "webarena.json")
cur_inference_file=./small_exploration_data/${inference_file_list[$index]}
- What do I need to change to train a model that does NOT use the Llama 2 Chat template?
Thanks,