cogcomp-nlp icon indicating copy to clipboard operation
cogcomp-nlp copied to clipboard

json format deserialization won't work with transformed text

Open mssammon opened this issue 8 years ago • 0 comments

https://github.com/CogComp/cogcomp-nlp/blob/master/core-utilities/src/main/java/edu/illinois/cs/cogcomp/core/utilities/JsonSerializer.java#L463

This method requires that strings contain only characters corresponding to tokens. For a transformed text, this means a version of that transformation must be maintained and used here, or some alternative method for computing token offsets must be used (e.g., serializing them explicitly)

mssammon avatar Jul 21 '17 16:07 mssammon