litbank icon indicating copy to clipboard operation
litbank copied to clipboard

Original vs annotated alignment

Open florpi opened this issue 3 years ago • 0 comments

Hello! Thank you for making this really cool dataset publicly available :)

I'm trying to align the annotations and the original text, could you please specify what tokenizer was used to produce the dataset? So far I can't get it quite right. Or is there perhaps an easier way to align original texts and annotations that I'm missing? Thanks in advance

florpi avatar Mar 03 '21 17:03 florpi