webanno
webanno copied to clipboard
Use codepoint offsets instead of UTF-16 offsets
Is your feature request related to a problem? Please describe. When working with TSV3 files in other programming languages, it is a bit annoying that offsets are in Java UTF-16 instead of codepoint offsets.
Describe the solution you'd like Codepoint offsets should be more interoperable.