dress
dress copied to clipboard
Could I confirm the performance result for WikiSmall Dataset?
Based on Table 1 of your paper, I saw WikiSmall performance is much lower than WikiLarge. The table indicates three results are based on the different test set.
I know for WikiLarge, you use 8 references test set. But for WikiSmall, I wonder which test set you are using? Is it PWKP_108016.tag.80.aner.test.src/PWKP_108016.tag.80.aner.test.dst (contain 100 test sample) in WikiSmall folder?