DuReader icon indicating copy to clipboard operation
DuReader copied to clipboard

what should I do if I want to use my data?

Open JYZ122 opened this issue 6 years ago • 1 comments

I want to know what the various keys of the json data set represent. For example, ‘is_selected’, ‘answer_spans’, and ‘match_scores’. And I see that there are no such keys in the raw data.

JYZ122 avatar Mar 22 '19 13:03 JYZ122

  1. make your data look like dureader-raw
  2. segmented them into segmented-xxx
  3. using the preprocess.py script to generate preprocess-data

menghuu avatar Aug 09 '19 09:08 menghuu