Chinese-Grammatical-error-diagnosis icon indicating copy to clipboard operation
Chinese-Grammatical-error-diagnosis copied to clipboard

CGED DATASET

Open ylinlinz opened this issue 3 years ago • 3 comments

您好,请问是否会share CGED2020 dataset?

ylinlinz avatar Jun 29 '21 00:06 ylinlinz

不太清楚欸,我也没CGED2020的数据集

Happleasei avatar Jun 29 '21 12:06 Happleasei

不太清楚欸,我也没CGED2020的数据集

@ HelloHeChengXi 您好,看您这里面提供的bio格式的数据,我发现有两个连续B-M的情况存在,请问这种情况的标注正确么?

BCWang93 avatar Oct 28 '21 07:10 BCWang93

追问一下,CGED2020 的数据有单独的训练集合吗?

我看文章说提供了

We provide 1129 training units with a total of 2,909 grammatical errors, categorized as redundant (678 instances), missing (801), word selection (1228) and word ordering (201).

目前只在https://github.com/blcuicall/cged_datasets 下面找到了测试集合。

Vimos avatar Jul 27 '22 04:07 Vimos