clang8 icon indicating copy to clipboard operation
clang8 copied to clipboard

cLang-8 is a dataset for grammatical error correction.

Results 10 clang8 issues
Sort by recently updated
recently updated
newest added

Hi, if there is any scripts for GEC pretraining, like adding noises to the original sentences?

@ekQ Training the model from scratch is time consuming, require knowledge not everyone has, hardware few people have and is error prone. It would be really great if you could...

Hey, I was wondering if it was possible to use the pre-trained GEC model here on sentences directly to improve text style transfer for a paper.

Hi, I'm working in GEC for a low resource language and wanted to create datasets myself. I have some question if you can answer i will be thankful. 1) I...

I wanted to ask if there are any conditions under which you would grant a commercial license? We are interested to integrate your model into our inclusive writing assistant https://www.witty.works/

Hi. I am interested in the grammar error correction task and I tried to reproduce your result, but I got only 64 F0.5 score for conll2014 when tuning the T5-large...

I‘d like to know why the size of cLang-8 is larger than the original Lang-8. cLang-8 contains 2372119 English sent-pairs, while Lang-8 contains only 1037561 English sent-pairs.

Hi, thanks for your great work. Then, I run the following command, it works. ``` echo "Running a test..." python -m prepare_clang8_dataset_test ``` However, when I run the following command,...

Dear authors of clang-8, I want to use clang-8 to fine tune the T5 model, however, I dont know what is the task prefix (e.g. for translation, the prefix could...

Can you tell me what hyperparameters were used for the beam search at inference time and anything concerning penalty for length and repetition? Thanks!