bert-extractive-summarization icon indicating copy to clipboard operation
bert-extractive-summarization copied to clipboard

Mistake in pre-processing

Open bnaman50 opened this issue 2 years ago • 0 comments

Hey,

I believe there is some mistake in pre-processing function (here). As you have yourself mentioned in the comments, you should add [SEP] [CLS] but in the code (line 18), you are adding [CLS] [SEP] image

I also checked the training code and it seems they are also doing the same. image

This is also verified by looking at the tokens of the datasets (given in training repo). image Here {101: '[CLS]', 102: 'SEP'}.

Could you please confirm it?

Thanks, Naman

bnaman50 avatar Apr 27 '22 23:04 bnaman50