Evidence sources for dataset?
Hi, thanks for creating this dataset!
In sec. 4.1 of your paper, it looks like you use a pipeline where you select relevant sentences from an evidence document, and then use BERT to predict the relation between the selected sentences and the claim. Does the main_text field in the data you make available for download correspond to the input evidence document?
What exactly is the relationship between the main_text and the sources? Is the main_text just the concatenation of the text from all the sources - and if so, what's going on in the cases where there is no source listed?
Thanks for the clarification!
Dave
Any updates here? Seems even there are some cases the sentences of main_text are not from the sources, e.g., 100th cases in testing split,