ud-annotatrix icon indicating copy to clipboard operation
ud-annotatrix copied to clipboard

export entire corpus (in any format)

Open jonorthwash opened this issue 5 years ago • 4 comments

Doesn't seem possible to e.g., upload a corpus in CG format and export it in CoNLL-U. Perhaps this functionality is associated only with the backend?

jonorthwash avatar May 16 '19 20:05 jonorthwash

I marked this as "regression" because one used to be able to download corpora (at least in CoNLL-U format), but that no longer seems possible.

EDIT: actually, it does seem possible, sometimes. Downloading functionality needs thorough testing.

jonorthwash avatar Jun 01 '19 22:06 jonorthwash

Where can I find a corpus in CG format, for testing?

yaskevich avatar Jun 26 '19 19:06 yaskevich

Try these: https://github.com/apertium/apertium-kaz/tree/master/texts

Iirc, notatrix also has some tests.

jonorthwash avatar Jun 26 '19 20:06 jonorthwash

I tested it with kdt.tagged.txt from repo you provided. It seems to work, file format depends on active tab of the editor. In next commit file extensions would be added according to the file format.

yaskevich avatar Jun 27 '19 17:06 yaskevich