BEGIN-dataset
BEGIN-dataset copied to clipboard
A benchmark dataset for evaluating dialog system and natural language generation metrics.
Results
2
BEGIN-dataset issues
Sort by
recently updated
recently updated
newest added
I think this isn't fully attributable because there is no way to decide that "it" in the evidence refers to Graceland, but the answer does resolve to it. it is...
This one isn't fully attributable (because of equivalent... "Yes it is"): doha wow It was created by Joy Whitby. Is that like the equivalent of Barney in the US? Yes...