DuReader
DuReader copied to clipboard
what should I do if I want to use my data?
I want to know what the various keys of the json data set represent. For example, ‘is_selected’, ‘answer_spans’, and ‘match_scores’. And I see that there are no such keys in the raw data.
- make your data look like dureader-raw
- segmented them into
segmented-xxx - using the
preprocess.pyscript to generate preprocess-data