bran icon indicating copy to clipboard operation
bran copied to clipboard

Pre trained Embedding

Open pranoy-k opened this issue 6 years ago • 1 comments

How did you pretrain the embeddings, I am confused because they are based on byte pair encoding?

pranoy-k avatar Feb 14 '19 18:02 pranoy-k

Sorry for the delayed response. I trained word2vec using bytepair tokenized data. In reality, I noticed very little difference in performance using the pretrained vs randomly initialized byte pair embeddings.

patverga avatar Apr 29 '19 14:04 patverga