Evidence sources for dataset?

Open dwadden opened this issue 4 years ago • 1 comments

Hi, thanks for creating this dataset!

In sec. 4.1 of your paper, it looks like you use a pipeline where you select relevant sentences from an evidence document, and then use BERT to predict the relation between the selected sentences and the claim. Does the main_text field in the data you make available for download correspond to the input evidence document?

What exactly is the relationship between the main_text and the sources? Is the main_text just the concatenation of the text from all the sources - and if so, what's going on in the cases where there is no source listed?

Thanks for the clarification!

Dave

Apr 05 '21 23:04 dwadden

Any updates here? Seems even there are some cases the sentences of main_text are not from the sources, e.g., 100th cases in testing split,

Nov 06 '21 03:11 yfqiu98