Pytorch-POS-Tagger
Pytorch-POS-Tagger copied to clipboard
data preprocess
Can you tell me how do you process the data into the format of sentences.txt and tags.txt? Thank you very much .
sentences.txt consists of one sentence per line as a space separated list of tokens. Similarly, tags.txt consists of space separated lists of tags for corresponding sentence tokens. Once you have sentence.txt and tags.txt ready, you can use preprocess.sh to preprocess them for use by the model.