doccano-transformer icon indicating copy to clipboard operation
doccano-transformer copied to clipboard

Not writing all entities in to_conll2003

Open rjuez00 opened this issue 2 years ago • 1 comments

How to reproduce the behaviour

I can't share the data because its confidential but some entities simply aren't written when using that function over documents!

rjuez00 avatar Apr 12 '22 19:04 rjuez00

Having pushed a little the analysis following the loss of many data, I realized that there were spaces (or -) included at the beginning or end of annotation empeding a correct tokenization and the associated labeling.

rayondemiel avatar Jul 07 '22 16:07 rayondemiel