universal-data-tool
universal-data-tool copied to clipboard
Respecting formating when importing text data
Hi,
I was wondering whether there is a way to preserve the text formating when importing text data. I'm currently working on a NER problem where it would be nice if the structure of the text could be preserved. For example, '\n' is simply displayed as '\ n' in UDT, even though I'd like it to be interpreted as a line break. Is there a way to change that?
Thanks!
Yes with wordSplitRegex! More info on the format page
I think to fix it you'll want to set "wordSplitRegex" to [a-zA-ZÀ-ÿ\\]+
Related to #374