GALAXY
GALAXY copied to clipboard
Two questions about the evaluation
Hi,
Great thanks for providing this fantastic repo!
I have two questions about the evaluations:
- How many random seeds did you use to get the main evaluation results on the MultiWOZ2.0 dataset, e.g., Table 3 in your AAAI paper? If more than one seed is used, what are the other seeds except the
SEED=10
inGALAXY/scripts/multiwoz2.0/train.sh
? - In
GALAXY/scripts/multiwoz2.0/infer.sh
there is a commandLOAD_MODEL_NAME=state_epoch_7
. May I ask how you select this checkpoint (the 7-th/60 training epochs)? Is there a way that we can automatically select the best checkpoint?
Looking forward to hearing from you!