UBAR-MultiWOZ icon indicating copy to clipboard operation
UBAR-MultiWOZ copied to clipboard

Multiwoz evaluation

Open unbiarirang opened this issue 4 years ago • 1 comments

Hi, thank you for sharing the code! I appreciate your work and I have a question.

The original MultiWoz evaluate.py calculates the INFORM score by checking if the first offered venue is correct or not. elif len(venue_offered[domain]) > 0 and venue_offered[domain][0] in goal_venues: match += 1 However your evaluation code is checking if there's an overlap between offered venues and goal venues. elif len(venue_offered[domain]) > 0 and len(set(venue_offered[domain])& set(goal_venues))>0: match += 1 Doesn't it lead to a higher score than the score obtained by the original evaluation code?

unbiarirang avatar Apr 20 '21 10:04 unbiarirang

The same question. @TonyNemo

SkyAndCloud avatar Aug 25 '21 09:08 SkyAndCloud